• Stars
    star
    561
  • Rank 79,400 (Top 2 %)
  • Language
    Python
  • Created over 1 year ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Official implementation of "Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation"

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation


This is official implementation of the paper "Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation". The last column of each example is our result.

⚡️News

❗️2023.04.10: We've opened the HuggingFace Demo! Also, we fixed minor issues, including the seed not being fixed.

❗️2023.03.31: We found that we typed an incorrect version of the model for point cloud inference. The fixed commit produces much better results.

Introduction

We introduce 3DFuse, a novel framework that incorporates 3D awareness into pretrained 2D diffusion models, enhancing the robustness and 3D consistency of score distillation-based methods. For more details, please visit our project page!

🔥TODO

  • 3D Generation/Gradio Demo Code
  • HuggingFace🤗 Demo Release
  • Colab Demo Release
  • Mesh Converting Code

Installation

Please follow installation.

Interactive Gradio App

for Text-to-3D / Image-to-3D

Enter your own prompt and enjoy! With this gradio app, you can preview the point cloud before 3D generation and determine the desired shape.

python gradio_app.py
# or python gradio_app.py --share

Text-to-3D Generation

After modifying the run.sh file with the desired prompt and hyperparameters, please execute the following command:

sh run.sh

Acknowledgement

We would like to acknowledge the contributions of public projects, including SJC and ControlNet whose code has been utilized in this repository.

Citation

@article{seo2023let,
  title={Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation},
  author={Seo, Junyoung and Jang, Wooseok and Kwak, Min-Seop and Ko, Jaehoon and Kim, Hyeonsu and Kim, Junho and Kim, Jin-Hwa and Lee, Jiyoung and Kim, Seungryong},
  journal={arXiv preprint arXiv:2303.07937},
  year={2023}
}

More Repositories

1

RAIN-GS

Code for "Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting" by Jaewoo Jung, Jisang Han, Honggyu An, Jiwon Kang, Seonghoon Park, and Seungryong Kim
Python
287
star
2

CAT-Seg

Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
Python
244
star
3

GaussianTalker

Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
Python
209
star
4

LANIT

Official repository for LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data (CVPR 2023)
Python
137
star
5

DiffMatch

Official implementation of "Diffusion Model for Dense Matching" (ICLR'24 Oral)
Python
135
star
6

MoDiTalker

Python
133
star
7

Talk3D

Python
127
star
8

3DGAN-Inversion

Official Implementation of WACV 2023 paper "3D GAN Inversion with Pose Optimization".
Python
107
star
9

locotrack

Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)
Python
106
star
10

NeMF

Official code implementation of NeMF (NeurIPS'22)
Python
83
star
11

DirecT2V

Python
75
star
12

MIDMs

Official code implementation of MIDMs: Matching Interleaved Diffusion Models for Exemplar-based Image Translation (AAAI'23)
Python
65
star
13

DaRF

Official code implementation of "DäRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation"(NeurIPS 2023)
Python
64
star
14

InstaFormer

Official repository for InstaFormer: Instance-aware Image-to-Image Translation with Transformer (CVPR 2022)
Python
51
star
15

CATs-PlusPlus

Official repository for CATs++: Boosting Cost Aggregation with Convolutions and Transformers (TPAMI'22)
Python
42
star
16

MaskingDepth

Python
39
star
17

DAG

33
star
18

INR-st

Official repository for Controllable Style Transfer via Test-time Training of Implicit Neural Representation
Python
28
star
19

GeCoNeRF

HTML
28
star
20

SE-NeRF

16
star
21

MoA

Mixture-of-Adapters
Python
15
star
22

SplitNet

11
star
23

RetDream

Official implementation of "Retrieval-Augmented Score Distillation for Text-to-3D Generation"
5
star
24

CATs-PlusPlus-Project-Page

HTML
5
star
25

3DFuse-threestudio

Threestudio extension of the paper "Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation".
Python
2
star
26

GSD

Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling
1
star