There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Vim
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space ModelYOLOP
You Only Look Once for Panopitic Driving Perception.(MIR2022)4DGaussians
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene RenderingMapTR
[ICLR'23 Spotlight] MapTR: Structured Modeling and Learning for Online Vectorized HD Map ConstructionYOLOS
[NeurIPS 2021] You Only Look at One SequenceSparseInst
[CVPR 2022] SparseInst: Sparse Instance Activation for Real-Time Instance SegmentationGaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)Matte-Anything
Matte Anything: Interactive Natural Image Matting with Segment Anything ModelsQueryInst
[ICCV 2021] Instances as QueriesVAD
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous DrivingTopFormer
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022MIMDet
[ICCV 2023] You Only Look at One Partial SequenceTiNeuVox
TiNeuVox: Fast Dynamic Radiance Fields with Time-Aware Neural Voxels (SIGGRAPH Asia 2022)ViTMatte
[Information Fusion] Boosting Image Matting with Pretrained Plain Vision TransformersTeViT
Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, OralGKT
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerBMaskR-CNN
[ECCV 2020] Boundary-preserving Mask R-CNNHAIS
Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)VMA
A general map auto annotation framework based on MapTR, with high flexibility in terms of spatial scale and element typeSymphonies
[CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance QueriesWeakTr
WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic SegmentationLaneGAP
Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph ConstructionSparseTrack
Official PyTorch implementation of SparseTrack (the new version of code will come soon)CrossVIS
[ICCV 2021] Crossover Learning for Fast Online Video Instance SegmentationMSG-Transformer
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)PolarDETR
BoxTeacher
[CVPR 2023] Exploring High-Quality Pseudo Masks for Weakly Supervised Instance SegmentationTinyDet
GNeuVox
GNeuVox: Generalizable Neural Voxels for Fast Human Radiance FieldsAziNorm
AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception, CVPR 2022.Featurized-QueryRCNN
Featurized Query R-CNNRILS
[CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference MetricMIM4D
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningNeuSample
Code of "NeuSample: Neural Sample Field for Efficient View Synthesis"SAUNet
A Simple Adaptive Unfolding Network for Hyperspectral Image ReconstructionQuery6DoF
Query6DoF: Learning Sparse Queries as Implicit Shape Prior for Category-Level 6DoF Pose EstimationHDR-HexPlane
3DV 2024: Fast High Dynamic Range Radiance Fields for Dynamic ScenesWeakSAM
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level RecognitionCircuitFormer
[NeurIPS 2023] CircuitFormer: Circuit as Set of PointsMMIL-Transformer
LSFA
Real-Time and Accurate Object Detection in Compressed Video by Long Short-term Feature AggregationEfficientPose
OpenInst
BoxCaseg
mancs
Mancs: A multi-task attentional network with curriculum sampling for person re-identificationRND-SCI
A Range-Null Space Decomposition Approach for Fast and Flexible Spectral Compressive ImagingDGCN
PySA
Pyramid Self-Attention for Semantic SegmentationEM-OLN
BCF
Xinggang Wang, Bin Feng, Xiang Bai, Wenyu Liu, and Longin Jan Latecki. Bag of Contour Fragments for Robust Shape Classification. Pattern Recognition, Volume 47, Issue 6, June 2014, Pages 2116-2125.tbcl
DeepTunel
Love Open Source and this site? Check out how you can help us