j-min/DallEval

Stars
136
Rank 267,670 (Top 6 %)
Language
Jupyter Notebook
License
MIT License
Created almost 3 years ago
Updated 12 months ago

j-min/DallEval

j-min

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Authors: Jaemin Cho, Abhay Zala, and Mohit Bansal (UNC Chapel Hill)
Paper

Visual Reasoning

Please see ./paintskills for our DETR-based visual reasoning skill evaluation.

(Optional) Please see https://github.com/aszala/PaintSkills-Simulator for our 3D Simulator implementation.

Image Quality & Image-Text Alignment

Please see ./quality for our image quaity evaluation based on FID score.

Please see ./retrieval for our image-text alignment evaluation with CLIP-based R-precision.

Please see ./captioning for our image-text alignment evaluation with VL-T5 captioning.

Social Bias

Please see ./biases for our social (gender and skin tone) bias evaluation.

Models

We provide inference scripts for DALLE-small (DALLE-pytorch), minDALL-E, X-LXMERT, and Stable Diffusion.

Acknowledgments

We thank the developers of DETR, DALLE-pytorch, minDALL-E, X-LXMERT, and Stable Diffusion for their public code release.

Reference

Please cite our paper if you use our dataset in your works:

@article{Cho2022DallEval,
  title         = {DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers},
  author        = {Jaemin Cho and Abhay Zala and Mohit Bansal},
  year          = {2022},
  archivePrefix = {arXiv},
  primaryClass  = {cs.CV},
  eprint        = {2202.04053}
}

VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

tf_tutorial_plus

Tutorials for TensorFlow APIs the official documentation doesn't cover

Jupyter Notebook

Adversarial_Video_Summary

Unofficial PyTorch Implementation of SUM-GAN from "Unsupervised Video Summarization with Adversarial LSTM Networks" (CVPR 2017)

CLIP-Caption-Reward

PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)

DL-for-Chatbot

Deep Learning / NLP tutorial for Chatbot Developers

Jupyter Notebook

HiREST

Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)

Dropouts

PyTorch Implementations of Dropout Variants

Jupyter Notebook

MoChA-pytorch

PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)

DSG

Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

Jupyter Notebook

VPGen

Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)

Jupyter Notebook

PixelCNN

PyTorch implementation of PixelCNN from "Pixel Recurrent Neural Networks"

Easy-Namuwiki-Extractor

Easy Namuwiki Extractor

IterInpaint

Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 workshop)

Jupyter Notebook

WikiExtractor_To_the_one_text

Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)

conv_s2s

Convolutional Sequence-to-Sequence (Work in Progress)

Jupyter Notebook

pytorch_exercise

Jupyter Notebook

pytorch-docker

generative_models

PyTorch Implementations of Generative models

LayoutBench

Evaluation code for LayoutBench, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 Workshop)

dotfiles

my dotfile configurations

LayoutBench-COCO

Jupyter Notebook

ml_snips

Custom ML snippets

motion2csv

Recording Motion with skeleton coordinates using MS Kinect v2

pydropbox

Simple Wrapper for Dropbox Python API

arxiv_reader_slackbot

Slack bot that reads Arxiv abstract from urls.

Udacity_Deep-Learning_Assignments

Udacity Deep Learning course assignments (based on Tensorflow)

Jupyter Notebook

Math-for-IE

Codes for Mathematical Methods for Industrial and Management Engineering (산업경영수리기법) @ SNU 2017-F

Jupyter Notebook