Guided Image-to-Image Translation papers
Feel free to send a PR or issue. (constantly updating)
- Class Label Guided
- Action Unit Guided
- Facial Landmark Guided
- Pose Guided Person Image Generation
- Segmentation Map Guided Scene Image Generation
- Texture Patch Guided
- Example Guided
- Attention Guided
- Mask Guided
- Text Guided
- Audio Guided
Class Label Guided
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
IcGAN | Invertible Conditional GANs for image editing | NeurIPSW 2016 | 1611.06355 | Guim3/IcGAN |
Conditional CycleGAN | Conditional CycleGAN for Attribute Guided Face Image Generation | ECCV 2018 | 1705.09966 | |
StarGAN | StarGAN: Uniο¬ed Generative Adversarial Networks for Multi-Domain Image-to-Image Translation | CVPR 2018 | 1711.09020 | yunjey/StarGAN |
AGUIT | Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning | 1904.12428 | imlixinyang/AGUIT | |
AttGAN | AttGAN: Facial Attribute Editing by Only Changing What You Want | TIP 2019 | 1711.10678 | LynnHo/AttGAN-Tensorflow |
SGGAN | Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation | MM 2018 | 1805.07509 | zhangqianhui/Sparsely-Grouped-GAN |
RelGAN | RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes | ICCV 2019 | 1908.07269 | elvisyjlin/RelGAN-PyTorch, willylulu/RelGAN |
Action Unit Guided
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
GANimation | GANimation: Anatomically-aware Facial Animation from a Single Image | ECCV 2018 | 1807.09251 | albertpumarola/GANimation |
Facial Landmark Guided
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
G2GAN | Geometry Guided Adversarial Facial Expression Synthesis | MM 2018 | 1712.03474 | |
CMM-Net | Every Smile is Unique: Landmark-Guided Diverse Smile Generation | CVPR 2018 | 1802.01873 | |
C2GAN | Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation | MM 2019 | 1908.00999 | Ha0Tang/C2GAN |
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models | ICCV 2019 | 1905.08233 | grey-eye/talking-heads |
Pose Guided Person Image Generation
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
PG2 | Pose Guided Person Image Generation | NeurIPS 2017 | 1705.09368 | charliememory/Pose-Guided-Person-Image-Generation |
PoseGAN | Deformable GANs for Pose-Based Human Image Generation | CVPR 2018 | 1801.00055 | AliaksandrSiarohin/pose-gan |
VUnet | A Variational U-Net for Conditional Appearance and Shape Generation | CVPR 2018 | 1804.04694 | CompVis/vunet |
PoseWarp | Synthesizing Images of Humans in Unseen Poses | CVPR 2018 | 1804.07739 | posewarp-cvpr2018 |
DPIG | Disentangled Person Image Generation | CVPR 2018 | 1712.02621 | charliememory/Disentangled-Person-Image-Generation |
FD-GAN | FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification | NeurIPS 2018 | 1810.02936 | yxgeee/FD-GAN |
PN-GAN | Pose-Normalized Image Generation for Person Re-identification | ECCV 2018 | 1712.02225 | naiq/PN_GAN |
GestureGAN | GestureGAN for Hand Gesture-to-Gesture Translation in the Wild | MM 2018 | 1808.04859 | Ha0Tang/GestureGAN |
PATN | Progressive Pose Attention for Person Image Generation | CVPR 2019 | 1904.03349 | tengteng95/Pose-Transfer |
SPT | Unsupervised Person Image Generation with Semantic Parsing Transformation | CVPR 2019 | 1904.03379 | SijieSong/person_generation_spt |
Coordinate-based Texture Inpainting for Pose-Guided Human Image Generation | CVPR 2019 | 1811.11459 | project | |
IntrinsicFlow | Dense intrinsic appearance flow for human pose transfer | CVPR 2019 | 1903.11326 | ly015/intrinsic_flow |
TriangleGAN | Gesture-to-Gesture Translation in the Wild via Category-Independent Conditional Maps | MM 2019 | 1907.05916 | yhlleo/TriangleGAN |
Pix2pixHD + Temporal Smoothing + FaceGAN | Everybody Dance Now | ICCV 2019 | 1808.07371 | project |
LiquidWarpingGAN | Liquid warping gan: A unified framework for human motion imitation, appearance transfer and novel view synthesis | ICCV 2019 | 1909.12224 | svip-lab/impersonator |
Global-Flow-Local-Attention | Deep Image Spatial Transformation for Person Image Generation | CVPR 2020 | 2003.00696 | RenYurui/Global-Flow-Local-Attention |
ADGAN | Controllable Person Image Synthesis With Attribute-Decomposed GAN | CVPR 2020 | 2003.12267 | menyifang/ADGAN |
CoCosNet | Cross-domain Correspondence Learning for Exemplar-based Image Translation | CVPR 2020 | 2004.05571 | microsoft/CoCosNet |
SMIS | Semantically Multi-modal Image Synthesis | CVPR 2020 | 2003.12697 | Seanseattle/SMIS |
MISC | MISC: Multi-Condition Injection and Spatially-Adaptive Compositing for Conditional Person Image Synthesis | CVPR 2020 | cvpr20 | |
Warp3d_Reposing | Reposing Humans by Warping 3D Features | CVPR 2020 Workshop | 2006.04898 | MKnoche/warp3d_reposing |
Wish You Were Here: Context-Aware Human Generation | CVPR 2020 | 2005.10663 | ||
PoseStylizer | Generating Person Images with Appearance-aware Pose Stylizer | IJCAI 2020 | 2007.09077 | siyuhuang/PoseStylizer |
XingGAN | XingGAN for Person Image Generation | ECCV 2020 | 2007.09278 | Ha0Tang/XingGAN |
Segmentation Map Guided Scene Image Generation
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
CRN | Photographic Image Synthesis with Cascaded Refinement Networks | ICCV 2017 | 1707.09405 | CQFIO/PhotographicImageSynthesis |
CrossNet | Predicting Ground-Level Scene Layout from Aerial Imagery | CVPR 2017 | 1612.02709 | viibridges/crossnet |
SIMS | Semi-parametric Image Synthesis | CVPR 2018 | 1804.10992 | xjqicuhk/SIMS |
Pix2PixHD | High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs | CVPR 2018 | 1711.11585 | NVIDIA/pix2pixHD |
X-Fork & X-Seq | Cross-View Image Synthesis using Conditional GANs | CVPR 2018 | 1803.03396 | kregmi/cross-view-image-synthesis |
Vid2Vid | Video-to-Video Synthesis | NeurIPS 2018 | 1808.06601 | NVIDIA/vid2vid |
SPADE | Semantic Image Synthesis with Spatially-Adaptive Normalization | CVPR 2019 | 1903.07291 | NVlabs/SPADE |
SelectionGAN | Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation | CVPR 2019 | 1904.06807 | Ha0Tang/SelectionGAN |
Art2Real | Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation | CVPR 2019 | 1811.10666 | aimagelab/art2real |
Mask-Guided Portrait Editing with Conditional GANs | CVPR 2019 | 1905.10346 | cientgu/Mask_Guided_Portrait_Editing | |
Seg2Vid | Video Generation from Single Semantic Label Map | CVPR 2019 | 1903.04480 | junting/seg2vid |
Semantic Bottleneck Scene Generation | 1911.11357 | |||
Few-shot Vid2Vid | Few-shot Video-to-Video Synthesis | NeurIPS 2019 | 1910.12713 | NVlabs/few-shot-vid2vid |
CC-FPSE | Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis | NeurIPS 2019 | 1910.06809 | xh-liu/CC-FPSE |
SEAN | SEAN: Image Synthesis with Semantic Region-Adaptive Normalization | CVPR 2020 | 1911.12861 | ZPdesu/SEAN |
BachGAN | BachGAN: High-Resolution Image Synthesis from Salient Object Layout | CVPR 2020 | 2003.11690 | Cold-Winter/BachGAN |
Panoptic-based Image Synthesis | CVPR 2020 | 2004.10289 | ||
SMIS | Semantically Multi-modal Image Synthesis | CVPR 2020 | 2003.12697 | Seanseattle/SMIS |
GAN Compression | GAN Compression: Efficient Architectures for Interactive Conditional GANs | CVPR 2020 | 2003.08936 | mit-han-lab/gan-compression |
LGGAN | Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation | CVPR 2020 | 1912.12215 | Ha0Tang/LGGAN |
TSIT | TSIT: A Simple and Versatile Framework for Image-to-Image Translation | ECCV 2020 | 2007.12072 | EndlessSora/TSIT |
SegVAE | Controllable Image Synthesis via SegVAE | ECCV 2020 | 2007.08397 | yccyenchicheng/SegVAE |
SESAME | SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects | ECCV 2020 | 2004.04977 | |
Style Semantics | Controlling Style and Semantics in Weakly-Supervised Image Generation | ECCV 2020 | 1912.03161 | dariopavllo/style-semantics |
Texture Patch Guided
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
TextureGAN | TextureGAN: Controlling Deep Image Synthesis with Texture Patches | CVPR 2018 | 1706.02823 | janesjanes/Pytorch-TextureGAN |
Guided-pix2pix | Guided Image-to-Image Translation with Bi-Directional Feature Transformation | ICCV 2019 | 1910.11328 | vt-vl-lab/Guided-pix2pix |
Example Guided
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
EG-UNIT | Exemplar Guided Unsupervised Image-to-Image Translation | ICLR 2019 | 1805.11145 | charliememory/EGSC-IT |
Pix2pixSC | Example-Guided Style-Consistent Image Synthesis from Semantic Labeling | CVPR 2019 | 1906.01314 | cxjyxxme/pix2pixSC |
Attention Guided
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
DA-GAN | DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks | CVPR 2018 | 1802.06454 | |
Attention-GAN | Attention-GAN for Object Transfiguration in Wild Images | ECCV 2018 | 1803.06798 | |
UAIT | Unsupervised Attention-guided Image to Image Translation | NeurIPS 2018 | 1806.02311 | AlamiMejjati/Unsupervised-Attention-guided-Image-to-Image-Translation |
Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention | TIP 2019 | 1806.06195 | ||
AttentionGAN | Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation | IJCNN 2019 | 1903.12296 | Ha0Tang/AttentionGAN |
U-GAT-IT | U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation | ICLR 2020 | 1907.10830 | taki0112/UGATIT, znxlwm/UGATIT-pytorch |
Mask Guided
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
ContrastGAN | Generative Semantic Manipulation with Mask-Contrasting GAN | ECCV 2018 | 1708.00315 | |
InstaGAN | Instance-aware image-to-image translation | ICLR 2019 | 1812.10889 | sangwoomo/instagan |
INIT | Towards Instance-level Image-to-Image Translation | CVPR 2019 | 1905.01744 | project |
Text Guided
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
ControlGAN | Controllable Text-to-Image Generation | NeurIPS 2019 | 1909.07083 | mrlibw/ControlGAN |
DMIT | Multi-mapping Image-to-Image Translation via Learning Disentanglement | NeurIPS 2019 | 1909.07877 | Xiaoming-Yu/DMIT |
ManiGAN | ManiGAN: Text-Guided Image Manipulation | 1912.06203 | ||
RefinedGAN | Image-to-Image Translation with Text Guidance | 2002.05235 |
Audio Guided
Model | Paper | Conference | Arxiv | Code |
---|---|---|---|---|
X2Face | X2Face: A Network for Controlling Face Generation using Images, Audio, and Pose Codes | ECCV 2018 | 1807.10550 | oawiles/X2Face |