There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate AnythingGroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt SynergyDWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.awesome-detection-transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)MaskDINO
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model SeriesOpenSeeD
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"Motion-X
[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"DN-DETR
[CVPR 2022 Oral] Official implementation of DN-DETRDAB-DETR
[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"OSX
[CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"HumanTOMATO
[ICML 2024] ๐ HumanTOMATO: Text-aligned Whole-body Motion GenerationMotionLLM
[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videosdeepdataspace
The Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.Stable-DINO
[ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"Lite-DETR
[CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"DreamWaltz
[NeurIPS 2023] Official implementation of the paper "DreamWaltz: Make a Scene with Complex 3D Animatable Avatars".MP-Former
[CVPR 2023] Official implementation of the paper: MP-Former: Mask-Piloted Transformer for Image SegmentationHumanSD
The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"HumanArt
The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"ED-Pose
The official repo for [ICLR'23] "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "DQ-DETR
[AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and GroundingDisCo-CLIP
Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".LipsFormer
DiffHOI
Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"TOSS
[ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"IYFC
TAPTR
detrex-storage
Love Open Source and this site? Check out how you can help us