There are no reviews yet. Be the first to send feedback to the community and the maintainers!
OneFormer
OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022Prompt-Free-Diffusion
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024Matting-Anything
Matting Anything Model (MAM), an efficient and versatile framework for estimating the alpha matte of any instance in an image with flexible and interactive visual or linguistic user prompt guidance.Compact-Transformers
Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)Cross-Scale-Non-Local-Attention
PyTorch code for our paper "Image Super-Resolution with Cross-Scale Non-Local Attention and Exhaustive Self-Exemplars Mining" (CVPR2020).Pyramid-Attention-Networks
[IJCV] Pyramid Attention Networks for Image Restoration: new SOTA results on multiple image restoration tasks: denoising, demosaicing, compression artifact reduction, super-resolutionNATTEN
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!Smooth-Diffusion
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024VCoder
VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024Rethinking-Text-Segmentation
[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement ApproachAgriculture-Vision
[CVPR 2020 & 2021 & 2022 & 2023] Agriculture-Vision Dataset, Prize Challenge and Workshop: A joint effort with many great collaborators to bring Agriculture and Computer Vision/AI communities together to benefit humanity!Self-Similarity-Grouping
Self-similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-identification (ICCV 2019, Oral)FcF-Inpainting
[WACV 2023] Keys to Better Image Inpainting: Structure and Texture Go Hand in HandDecoupled-Classification-Refinement
Revisiting RCNN: On Awakening the Classification Power of Faster RCNN (ECCV 2018)Convolutional-MLPs
[Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 20213D-Point-Cloud-Learning
CuMo
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-ExpertsForget-Me-Not
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models, 2023VMFormer
[Preprint] VMFormer: End-to-End Video Matting with TransformerSemi-Supervised-Transfer-Learning
[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer LearningSGL-Retinal-Vessel-Segmentation
[MICCAI 2021] Study Group Learning: Improving Retinal Vessel Segmentation Trained with Noisy Labels: New SOTA on both DRIVE and CHASE_DB1.StyleNAT
New flexible and efficient image generation framework that sets new SOTA on FFHQ-256 with FID 2.05, 2022Unsupervised-Domain-Adaptation-with-Differential-Treatment
[CVPR 2020] Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic SegmentationText2Video-Zero-sd-webui
GFR-DSOD
Improving Object Detection from Scratch via Gated Feature Reuse (BMVC 2019)SH-GAN
[WACV 2023] Image Completion with Heterogeneously Filtered Spectral HintsVIM
UltraSR-Arbitrary-Scale-Super-Resolution
[Preprint] UltraSR: Spatial Encoding is a Missing Key for Implicit Image Function-based Arbitrary-Scale Super-Resolution, 2021Any-Precision-DNNs
Any-Precision Deep Neural Networks (AAAI 2021)Pseudo-IoU-for-Anchor-Free-Object-Detection
Pseudo-IoU: Improving Label Assignment in Anchor-Free Object DetectionHuman-Object-Interaction-Detection
CompFeat-for-Video-Instance-Segmentation
CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation (AAAI 2021)Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain AlignmentOneFormer-Colab
[Colab Demo Code] OneFormer: One Transformer to Rule Universal Image Segmentation.DiSparse-Multitask-Model-Compression
[CVPR 2022] DiSparse: Disentangled Sparsification for Multitask Model CompressionInterpretable-Visual-Reasoning
[ICCV 2021] Interpretable Visual Reasoning via Induced Symbolic SpaceMask-Selection-Networks
[CVPR 2021] Youtube-VIS 2021 3rd place, [CVPR 2020] winner DAVIS 2020. Code for mask selection based methods.Activity-Recognition
Boosted-Dynamic-Networks
Boosted Dynamic Neural Networks, AAAI 2023Aneurysm-Segmentation-with-Multi-Teacher-Pseudo-Labels
Love Open Source and this site? Check out how you can help us