Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Rust

R

MATLAB

Groovy

Nix

Lua

Perl

JavaScript

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Python

Elixir

Groovy

Shell

C++

PowerShell

Zig

Perl

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇨🇮 Côte d'Ivoire

🇧🇮 Burundi

🇧🇪 Belgium

🇬🇱 Greenland

🇿🇼 Zimbabwe

🇲🇾 Malaysia

🇾🇹 Mayotte

🇸🇪 Sweden

All Countries Compare Countries

showlab/TTC-Tuning

Stars
2
Language
Created over 1 year ago
Updated over 1 year ago

showlab/TTC-Tuning

showlab

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Tune-A-Video

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Image2Paragraph

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Awesome-MLLM-Hallucination

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

all-in-one

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training

BoxDiff

[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

DeVRF

The Pytorch implementation of "DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes"

EgoVLP

[NeurIPS2022] Egocentric Video-Language Pretraining

VisorGPT

[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT

Awesome-GUI-Agent

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

ShowAnything

Jupyter Notebook

cosmo

loveu-tgve-2023

Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.

sparseformer

(ICLR 2024, CVPR 2024) SparseFormer

datacentric.vlp

Compress conventional Vision-Language Pre-training data

Region_Learner

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

ShowRoom3D

This is the project page of ShowRoom3D

Long-form-Video-Prior

DemoVLP

[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training

CLVQA

[AAAI2023 (Oral)] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task

BYOC

[IEEE-VR 2024] Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters

Q2A

[ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant

HOSNeRF

This is the project page for the HOSNeRF

headshot

GEB-Plus

[ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval

LOVA3

[NeurIPS 2024] "Learning to Visual Question Answering, Asking and Assessment"

Show-Anything-3D

Edit and Generate Anything in 3D world!

Awesome-Long-Context

A curated list of resources about long-context in large-language models and video understanding.

SCT

[IJCV2023] Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"

VisInContext

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

SOIS

The Pytorch implementation of "Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization"

AVA-AVD

Efficient-CLS

[arXiv2022] Label-Efficient Online Continual Object Detection in Streaming Video

videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Tune-An-Ellipse

[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want

mist

ColonNeRF

This is the project page for ColonNeRF.

DynVideo-E

This is the project page for DynVideo-E.

VideoLISA

[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

assistq