Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model4DGaussians
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene RenderingYOLOP
You Only Look Once for Panopitic Driving Perception.(MIR2022)MapTR
[ICLR'23 Spotlight] MapTR: Structured Modeling and Learning for Online Vectorized HD Map ConstructionYOLOS
[NeurIPS 2021] You Only Look at One SequenceGaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)VAD
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous DrivingSparseInst
[CVPR 2022] SparseInst: Sparse Instance Activation for Real-Time Instance SegmentationMatte-Anything
[Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything ModelsQueryInst
[ICCV 2021] Instances as QueriesTopFormer
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022MIMDet
[ICCV 2023] You Only Look at One Partial SequenceTiNeuVox
TiNeuVox: Fast Dynamic Radiance Fields with Time-Aware Neural Voxels (SIGGRAPH Asia 2022)ViTMatte
[Information Fusion] Boosting Image Matting with Pretrained Plain Vision TransformersTeViT
Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, OralGKT
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerBMaskR-CNN
[ECCV 2020] Boundary-preserving Mask R-CNNHAIS
Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)Symphonies
[CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance QueriesVMA
A general map auto annotation framework based on MapTR, with high flexibility in terms of spatial scale and element typeWeakTr
WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic SegmentationLaneGAP
[ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph ConstructionSparseTrack
Official PyTorch implementation of SparseTrack (the new version of code will come soon)CrossVIS
[ICCV 2021] Crossover Learning for Fast Online Video Instance SegmentationMSG-Transformer
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)PolarDETR
BoxTeacher
[CVPR 2023] Exploring High-Quality Pseudo Masks for Weakly Supervised Instance SegmentationTinyDet
osp
[ECCV 2024] Occupancy as Set of PointsGNeuVox
GNeuVox: Generalizable Neural Voxels for Fast Human Radiance FieldsAziNorm
AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception, CVPR 2022.Featurized-QueryRCNN
Featurized Query R-CNNRILS
[CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference MetricMIM4D
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningNeuSample
Code of "NeuSample: Neural Sample Field for Efficient View Synthesis"SAUNet
A Simple Adaptive Unfolding Network for Hyperspectral Image ReconstructionQuery6DoF
Query6DoF: Learning Sparse Queries as Implicit Shape Prior for Category-Level 6DoF Pose EstimationHDR-HexPlane
3DV 2024: Fast High Dynamic Range Radiance Fields for Dynamic ScenesWeakSAM
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level RecognitionViTGaze
CircuitFormer
[NeurIPS 2023] CircuitFormer: Circuit as Set of PointsEfficientPose
MMIL-Transformer
LSFA
Real-Time and Accurate Object Detection in Compressed Video by Long Short-term Feature AggregationOpenInst
BoxCaseg
mancs
Mancs: A multi-task attentional network with curriculum sampling for person re-identificationRND-SCI
A Range-Null Space Decomposition Approach for Fast and Flexible Spectral Compressive ImagingDGCN
PySA
Pyramid Self-Attention for Semantic SegmentationEM-OLN
BCF
Xinggang Wang, Bin Feng, Xiang Bai, Wenyu Liu, and Longin Jan Latecki. Bag of Contour Fragments for Robust Shape Classification. Pattern Recognition, Volume 47, Issue 6, June 2014, Pages 2116-2125.DiG
TOGS
The official code of "TOGS: Gaussian Splatting with Temporal Opacity Offset for Real-Time 4D DSA Rendering"tbcl
DeepTunel
Love Open Source and this site? Check out how you can help us