There are no reviews yet. Be the first to send feedback to the community and the maintainers!
MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"VoxelNeXt
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)DeepUPE
Underexposed Photo Enhancement Using Deep Illumination Estimation3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.LLMGA
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, LoRAPanopticFCN
Fully Convolutional Networks for Panoptic Segmentation (CVPR2021 Oral)PointGroup
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation3DSSD
3DSSD: Point-based 3D Single Stage Object Detector (CVPR 2020)Video-P2P
Video-P2P: Video Editing with Cross-attention ControlFocalsConv
Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)Stratified-Transformer
Stratified Transformer for 3D Point Cloud Segmentation (CVPR 2022)DSGN
DSGN: Deep Stereo Geometry Network for 3D Object Detection (CVPR 2020)PFENet
PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).SphereFormer
The official implementation for "Spherical Transformer for LiDAR-based 3D Recognition" (CVPR 2023).GridMask
ReviewKD
Distilling Knowledge via Knowledge Review, CVPR 2021Parametric-Contrastive-Learning
Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)Step-DPO
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"Simple-SR
Include MuCAN, LAPAR, etc.UVTR
Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)Facelet_Bank
Facelet-Bank for Fast Portrait Manipulation (pytorch)SA-AutoAug
Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)LargeKernel3D
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs (CVPR 2023)SNR-Aware-Low-Light-Enhance
This is the official implementation for the paper "SNR-aware low-light image enhancement" in CVPR2022MASA-SR
MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution (CVPR2021)ECCV22-P3AFormer-Tracking-Objects-as-Pixel-wise-Distributions
The official code for our ECCV22 oral paper: tracking objects as pixel-wise distributions.Context-Aware-Consistency
Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)SparseTransformer
A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).spconv-plus
EfficientNeRF
The official code for "Efficient Neural Radiance Fields" in CVPR2022.MiSLAS
Improving Calibration for Long-Tailed Recognition (CVPR2021)RIVAL
[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion ChainMOOD
Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution Detection.outpainting_srn
Wide-Context Semantic Image Extrapolation, CVPR2019MSAD
Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)DeepVision3D
DeepVision3D is an open source toolbox for point-cloud understanding.Ref-NPR
[CVPR 2023] Ref-NPR: Reference-Based Non-PhotoRealistic Radiance FieldsVFIformer
Video Frame Interpolation with Transformer (CVPR2022)Prompt-Highlighter
[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMsVFF
Voxel Field Fusion for 3D Object Detection (CVPR2022)SMR
Self-Supervised 3D Mesh Reconstruction from Single Images (CVPR2021)SCGAN
The implementation of 'Image synthesis via semantic composition', ICCV2021.JigsawClustering
This is the code for CVPR 2021 oral paper: Jigsaw Clustering for Unsupervised Visual Representation LearningAttenNorm
Attentive Normalization for Conditional Image GenerationGFS-Seg
The official implementation of Generalized Few-shot Semantic Segmentation (CVPR 2022)Mask-Attention-Free-Transformer
Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"MoTCoder
This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.SDSD
Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)GroupContrast
[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D UnderstandingProposeReduce
Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)Robust-Semantic-Segmentation
Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation (ICCV2021)Mr-Ben
This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"BAL
BAL: Balancing Diversity and Novelty for Active Learning - Official Pytorch ImplementationTriVol
The official code of TriVol in CVPR-2023MR-GSM8K
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMsDecoupleNet
Official implementation for our ECCV 2022 paper "DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation"Dsig
Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)LBGAT
Learnable Boundary Guided Adversarial Training (ICCV2021)Q-LLM
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"AGSS-VOS
AGSS-VOS: Attention Guided Single-Shot Video Object SegmentationMAT
MAT: Mask-Aware Transformer for Large Hole Image InpaintingMSN
Memory Selection Network for Video Propagation (ECCV 2020)APD
Point2Pix
The official code of Point2pix in CVPR-2023TagCLIP
Love Open Source and this site? Check out how you can help us