This repository is the official codebase of "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation" [paper]. Our paper has been accepted at the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 (CVPR 2024).
🤣👉Click [project page] for more funny examples👈.
To the best of our knowledge, we are the first to profoundly explore the Leap-of-Thought (LoT) ability in multimodal large language models (LLMs). This involves challenging LLMs to think outside the box, a non-sequential thinking skill equally crucial alongside popular sequential thinking abilities, such as Chain-of-Thought based methods. In this study, we delve into the LLM's LoT ability through the lens of a humor generation game called Oogiri (大喜利). The Oogiri game serves as an ideal platform for exploring the LLM's LoT ability, as it compels participants to think outside the box and provide unexpected and humorous responses to multimodal information (including I2T, T2T, and IT2T).
We provide a simple Chinese example in inference.py
for using CLoT with zero-shot inference. Before you start, make sure you install the following packages:
pip install -r requirements.txt
Then run the command below:
python inference.py
Try launching the Gradio web interface with the following code!
python gradio_demo.py
2024/3/16 - We released our dataset and checkpoint. 👈😆Please try it!
2023/12/6 - We released our project page. 👈😆Please check it out!
2023/12/5 - We released our paper [arxiv]. Please check it out!
- project page
- preprint paper
-
online demo (Hugging Face etc.)checkpoint - dataset
- code
@misc{zhong2023clot,
title={Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation},
author={Zhong, Shanshan and Huang, Zhongzhan and Gao, Shanghua and Wen, Weushao and Lin, Liang and Zitnik, Marinka and Zhou, Pan},
journal={arXiv preprint arXiv:2312.02439},
year={2023}
}