• Stars
    star
    215
  • Rank 183,925 (Top 4 %)
  • Language
  • License
    MIT License
  • Created almost 6 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A collection of arbitrary text to image papers with code (constantly updating)

arbitrary-text-to-image-papers

A collection of arbitrary kinds of text to image papers, organized by Tzu-Heng Lin and Haoran Mo.

Papers are ordered in arXiv first version submitting time (if applicable).

Feel free to send a PR or an issue.

TOC

general text to image

Note Model Paper Conference paper link code link
GAN-INT-CLS Generative Adversarial Text to Image Synthesis ICML 2016 1605.05396 reedscot/icml2016
StackGAN StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks ICCV 2017 1612.03242 hanzhanggit/StackGAN
StackGAN++ StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks TPAMI 2018 1710.10916 hanzhanggit/StackGAN-v2
AttnGAN AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks CVPR 2018 1711.10485 taoxugit/AttnGAN
HD-GAN Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network CVPR 2018 1802.09178 ypxie/HDGan
StoryGAN StoryGAN: A Sequential Conditional GAN for Story Visualization CVPR 2019 1812.02784 yitong91/StoryGAN
MirrorGAN MirrorGAN: Learning Text-to-image Generation by Redescription CVPR 2019 1903.05854
DM-GAN DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis CVPR 2019 1904.01310
SD-GAN Semantics Disentangling for Text-to-Image Generation CVPR 2019 1904.01480

scene graph/layout to image

Note Model Paper Conference paper link code link
GAWWN Learning What and Where to Draw NIPS 2016 1610.02454 reedscot/nips2016
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis CVPR 2018 1801.05091
sg2im Image Generation from Scene Graphs CVPR 2018 1804.01622 google/sg2im
Text2Scene Text2Scene: Generating Abstract Scenes from Textual Descriptions CVPR 2019 1809.01110 uvavision/Text2Image
Layout2Im Image Generation from Layout CVPR 2019 1811.11389
LayoutGAN LayoutGAN: Generating Graphic Layouts with Wireframe Discriminator ICLR 2019 openreview
Object Pathways Generating Multiple Objects at Spatially Distinct Locations ICLR 2019 1901.00686 tohinz/multiple-objects-gan
Using Scene Graph Context to Improve Image Generation 1901.03762
Obj-GAN Object-driven Text-to-Image Synthesis via Adversarial Training CVPR 2019 1902.10740 jamesli1618/Obj-GAN

dialog to image

Note Model Paper Conference paper link code link
CoDraw CoDraw: Visual Dialog for Collaborative Drawing 1712.05558 CoDraw dataset
ChatPainter ChatPainter: Improving Text to Image Generation using Dialogue ICLRW 2018 1802.08216
Keep Drawing It: Iterative language-based image generation and editing NIPSW 2018 1811.09845 CLEVR dataset
Chat-crowd Chat-crowd: A Dialog-based Platform for Visual Layout Composition 1812.04081 uvavision/chat-crowd
SeqAttnGAN Sequential Attention GAN for Interactive Image Editing via Dialogue 1812.08352

More Repositories

1

image-to-image-papers

🦓<->🦒 🌃<->🌆 A collection of image to image papers with code (constantly updating)
1,088
star
2

Cool-Fashion-Papers

👔👗🕶️🎩 Cool resources about Fashion + AI! (papers, datasets, workshops, companies, ...) (constantly updating)
Shell
573
star
3

Clothes-3D

clothes research in 3D
122
star
4

metrics

IS, FID score Pytorch and TF implementation, TF implementation is a wrapper of the official ones.
Python
106
star
5

deepfashion2-kps-agg-finetune

1st place solution (Team StylingAI Inc. & PKU AIIC) for CVPR 2020 DeepFashion2 Clothes Landmark Detection Track. Aggregation and Finetuning for Clothes Landmark Detection
Cuda
74
star
6

Awesome-BEV-Papers

16
star
7

Fast-Fourier-Transform

A C++ Implementation of Fast Fourier Transform (Project of Digital Signal Processing course)
TeX
13
star
8

alias-free-gan-explanation

Trying to understand alias-free-gan.
Python
12
star
9

RBM-DBN-theano-DL4J

Train a DBN to classify a set of test data similar to MNIST, Using DL4J & theano (Project of Pattern Recognition course)
Python
10
star
10

Palm-Live-Detection

Palm Live Detection (Project of Digital Image Processing Course)
C++
8
star
11

Resources

A curated list of resources in Software Engineering related field
7
star
12

bigdatacup2021

2nd Place of BigData Cup 2021 Track1
Jupyter Notebook
7
star
13

MCMC

An introduction of MCMC, implementation of TAP, AIS, RTS algorithm to estimate the partition function of an RBM (Project of Stochastic Process course)
PostScript
6
star
14

texture-synthesis-papers

about texture synthesis papers
6
star
15

pysc2-minigame-ai

Rule based AI for StarCraft 2 Minigames
Python
5
star
16

SinGAN-pytorch

pytorch reimplementation of SinGAN
Python
4
star
17

GAN-image-synthesis-papers-collection

about all kinds of image synthesis with GANs
3
star
18

gem5-NVP-DFS

Dynamic Frequency Selection (Project of Modern Computer Architecture course)
C++
2
star
19

gpr

Gaussian Process Regression: A practical overview (Project of Stochastic Process course)
MATLAB
1
star
20

DTMF

An implementation of DTMF detection and Goertzel Algorithm (Project of Digital Signal Processing course)
C++
1
star