• Stars
    star
    450
  • Rank 96,492 (Top 2 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 1 year ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Official Repository of ChatCaptioner

Interactive ChatCaptioner for image and video

Official repository of ChatCaptioner and Video ChatCaptioner.

ChatCaptioner paper ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions

Video ChatCaptioner paper Video ChatCaptioner: Towards the Enriched Spatiotemporal Descriptions

Demo

demo1 demo2 demo3 demo4

Acknowledgement

Please cite ChatCaptioner and Video ChatCaptioner from the following bibtex

@article{zhu2023chatgpt,
  title={ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions},
  author={Deyao Zhu and Jun Chen and Kilichbek Haydarov and Xiaoqian Shen and Wenxuan Zhang and Mohamed Elhoseiny},
  journal={arXiv preprint arXiv:2303.06594},
  year={2023}
}
@article{chen2023video,
      title={Video ChatCaptioner: Towards the Enriched Spatiotemporal Descriptions}, 
      author={Jun Chen and Deyao Zhu and Kilichbek Haydarov and Xiang Li and Mohamed Elhoseiny},
      journal={arXiv preprint arXiv:2304.04227},
      year={2023}
}

License

ChatCaptioner and Video ChatCaptioner are released under the MIT license.

More Repositories

1

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Python
25,271
star
2

MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Python
486
star
3

VisualGPT

VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models
Python
316
star
4

3DCoMPaT-v2

3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition
Python
75
star
5

MiniGPT-Med

Open-sourced code of miniGPT-Med
Python
63
star
6

LTVRR

Python
35
star
7

RelTransformer

Python
29
star
8

MammalNet

Python
25
star
9

artemis-v2

Code for the paper: It is Okay to Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection
Jupyter Notebook
17
star
10

3DCoMPaT

Official repository for the 3DCoMPaT dataset (ECCV2022 Oral)
Jupyter Notebook
16
star
11

saai-factory-tutorial-creative-ai

Creative AI for Visual Art and Music slides and demos.
11
star
12

AF-Guide

Official repository of Action-Free Guide
Python
11
star
13

InfiniBench

Python
10
star
14

affectiveVisDial

Python
9
star
15

CWAN

Creative Walk Adversarial Networks: Novel Art Generation with Probabilistic Random Walk Deviation from Style Norms
Python
7
star
16

WAGA

Code for Wรถlfflin Affective Generative Analysis paper published in ICCC 2021
Jupyter Notebook
6
star
17

CIZSLv2

CIZSL++: Creativity Inspired Generative Zero-Shot Learning. T-PAMI under review.
Python
6
star
18

HalentNet

Python
6
star
19

cs326-few-shot-classification

CS326 Practical assignment #2: few-shot classification
Python
5
star
20

GRaWD

Imaginative Walks: Generative Random Walk Deviation Loss for Improved Unseen Learning Representation. CVPR 2022 Workshop, ICCC 2022.
Python
4
star
21

artelingo

Jupyter Notebook
3
star
22

UnlikelihoodMotionForecasting

Jupyter Notebook
3
star