There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model4DGaussians
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene RenderingYOLOP
You Only Look Once for Panopitic Driving Perception.(MIR2022)MapTR
[ICLR'23 Spotlight] MapTR: Structured Modeling and Learning for Online Vectorized HD Map ConstructionYOLOS
[NeurIPS 2021] You Only Look at One SequenceGaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)VAD
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous DrivingSparseInst
[CVPR 2022] SparseInst: Sparse Instance Activation for Real-Time Instance SegmentationMatte-Anything
[Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything ModelsQueryInst
[ICCV 2021] Instances as QueriesTopFormer
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022MIMDet
[ICCV 2023] You Only Look at One Partial SequenceTiNeuVox
TiNeuVox: Fast Dynamic Radiance Fields with Time-Aware Neural Voxels (SIGGRAPH Asia 2022)ViTMatte
[Information Fusion] Boosting Image Matting with Pretrained Plain Vision TransformersTeViT
Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, OralGKT
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerBMaskR-CNN
[ECCV 2020] Boundary-preserving Mask R-CNNHAIS
Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)Symphonies
[CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance QueriesVMA
A general map auto annotation framework based on MapTR, with high flexibility in terms of spatial scale and element typeWeakTr
WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic SegmentationLaneGAP
[ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph ConstructionSparseTrack
Official PyTorch implementation of SparseTrack (the new version of code will come soon)CrossVIS
[ICCV 2021] Crossover Learning for Fast Online Video Instance SegmentationMSG-Transformer
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)PolarDETR
BoxTeacher
[CVPR 2023] Exploring High-Quality Pseudo Masks for Weakly Supervised Instance SegmentationTinyDet
osp
[ECCV 2024] Occupancy as Set of PointsGNeuVox
GNeuVox: Generalizable Neural Voxels for Fast Human Radiance FieldsAziNorm
AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception, CVPR 2022.Featurized-QueryRCNN
Featurized Query R-CNNRILS
[CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference MetricMIM4D
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningNeuSample
Code of "NeuSample: Neural Sample Field for Efficient View Synthesis"SAUNet
A Simple Adaptive Unfolding Network for Hyperspectral Image ReconstructionQuery6DoF
Query6DoF: Learning Sparse Queries as Implicit Shape Prior for Category-Level 6DoF Pose EstimationHDR-HexPlane
3DV 2024: Fast High Dynamic Range Radiance Fields for Dynamic ScenesWeakSAM
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level RecognitionViTGaze
CircuitFormer
[NeurIPS 2023] CircuitFormer: Circuit as Set of PointsEfficientPose
MMIL-Transformer
LSFA
Real-Time and Accurate Object Detection in Compressed Video by Long Short-term Feature AggregationOpenInst
BoxCaseg
mancs
Mancs: A multi-task attentional network with curriculum sampling for person re-identificationRND-SCI
A Range-Null Space Decomposition Approach for Fast and Flexible Spectral Compressive ImagingDGCN
PySA
Pyramid Self-Attention for Semantic SegmentationEM-OLN
DiG
TOGS
The official code of "TOGS: Gaussian Splatting with Temporal Opacity Offset for Real-Time 4D DSA Rendering"tbcl
DeepTunel
Love Open Source and this site? Check out how you can help us