crowsonkb/simulacra-aesthetic-models

Stars
130
Rank 277,575 (Top 6 %)
Language
Python
License
MIT License
Created over 2 years ago
Updated over 1 year ago

crowsonkb/simulacra-aesthetic-models

crowsonkb

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Introduction

This is model fit and inference code for CLIP aesthetic regressions trained on Simulacra Aesthetic Captions. These remarkably simple models emulate human aesthetic judgment. They can be used in tasks such as dataset filtering to remove obviously poor quality images from the corpus before training. The following grids, one sorted by John David Pressman and one sorted by the machine give some idea of the models capabilities:

Manually Sorted Grid

Model Sorted Grid

Installation

Git clone this repository:

git clone https://github.com/crowsonkb/simulacra-aesthetic-models.git

Install pytorch if you don't already have it:

pip3 install torch==1.10.1+cu113 torchvision==0.11.2+cu113 torchaudio==0.10.1+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html

Then pip install our other dependencies:

pip3 install tqdm pillow torchvision sklearn numpy

If you don't already have it installed, you'll need to install CLIP:

git clone https://github.com/openai/CLIP.git
cd CLIP
pip3 install .
cd ..

Usage

The models are largely meant to be used as a library, i.e. you'll need to write specific code for your use case. But to get you started we've provided a sample script rank_images.py which finds all the .jpg or .png images in a directory tree and ranks the top N (default 50) with the aesthetic model:

python3 rank_images.py demo_images/

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

v-diffusion-pytorch

v objective diffusion inference code for PyTorch.

style-transfer-pytorch

Neural style transfer in PyTorch.

v-diffusion-jax

v objective diffusion inference code for JAX.

style_transfer

Data-parallel image stylization using Caffe.

deep_dream

A parallel implementation of the Deep Dream image processing algorithm which is able to process arbitrarily large images.

Jupyter Notebook

consistency-models

A JAX implementation of the continuous time formulation of Consistency Models

LDLM

Latent Diffusion Language Models

cloob-training

CLOOB training (JAX) and inference (JAX and PyTorch)

esgd

ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.

mdmm

The Modified Differential Multiplier Method (MDMM) for PyTorch

vgg_loss

A VGG-based perceptual loss function for PyTorch.

jax-wavelets

The 2D discrete wavelet transform for JAX

cond_transformer_2

A CLIP conditioned Decision Transformer.

clip-guided-diffusion

CLIP Guided Diffusion

mdmm-jax

Gradient-based constrained optimization for JAX

shared_ndarray

A pickleable wrapper for sharing NumPy ndarrays between processes using POSIX shared memory.

tv-denoise

Total variation denoising for images.

pytorch-caffe-models

The original weights of some Caffe models, ported to PyTorch.

pharmacokinetics

A Flask web application to calculate and plot drug concentration over time.

pyparsing-highlighting

Syntax highlighting for prompt_toolkit and HTML with pyparsing.

aiohttp_index

aiohttp.web middleware to serve index files (e.g. index.html) when static directories are requested.

rope-flax

Rotary Position Embedding for Flax

ucs

Implements the CAM02-UCS (Luo et al. (2006)) forward transform.

philips-hue

A CLI tool to interface with Philips Hue lights.

dice-mc

DiCE: The Infinitely Differentiable Monte-Carlo Estimator

average

Exponentially weighted moving averages with initialization bias correction.

synthraw

Synthesizes camera raw files

huething

This is a work in progress to control my four Philips Hue bulbs.

base58

Package base58 implements base58 encoding as used in Bitcoin addresses.

crowsonkb.github.io

websynth

pNaCl-Csound based softsynth

color_schemer

A web application to translate color schemes between dark- and light-background.

cluster

Package cluster performs hierarchical clustering of term vectors.

gradient-maker3

A web application to generate color gradients using the CAM02-UCS colorspace.

scihub-lookup

A Safari extension to look up the current page on Sci-Hub

fragments

Miscellaneous useful, reusable code fragments

.zsh

My zsh configuration

randomness

Generates random secrets (passwords, etc).