π² Stable-DINO: Detection Transformer with Stable Matching
Shilong Liu*, Tianhe Ren*, Jiayu Chen*, Zhaoyang Zeng, Hao Zhang, Feng Li, Hongyang Li, Jun Huang, Hang Su, Jun Zhu, Lei Zhang
(*) equal contribution, (
[Stable-DINO Paper
] [Focal-Stable-DINO Report
] [BibTex
] [Code in detrex (coming soon)
]
β¨ News
26 Apr, 2023
: By combining with FocalNet-Huge backbone, Focal-Stable-DINO achieves 64.6 AP on COCO val2017 and 64.8 AP on COCO test-dev without any test time augmentation! Check our Technical Report for more details.12 Apr, 2023
: Preprint our paper on ArXiv!
π‘ Highlight
- High performance. Maybe the strongest object detector. 63.8 AP on COCO with Swin-Large backbones (only 218M parameters).
- Scalable. Combining with larger backbone FocalNet-Huge (only 689M parameters), Stable-DINO still improves the performance to 64.6 AP on COCO val2017 and 64.8 AP on COCO test-dev without any test time augmentation.
- Easy to use. Only a few lines of code to be modified on DINO.
- Lightweight. Nearly no extra cost during training and inference compared with DINO.
- Generalization. Easy to combine with the existing DETR variants and boost the performance.
π Methods:
π Results:
π Related Projects:
π₯ Citing Stable-DINO
If you use Stable-DINO in your research or wish to refer to the baseline results published here, please use the following BibTeX entry.
@misc{liu2023detection,
title={Detection Transformer with Stable Matching},
author={Shilong Liu and Tianhe Ren and Jiayu Chen and Zhaoyang Zeng and Hao Zhang and Feng Li and Hongyang Li and Jun Huang and Hang Su and Jun Zhu and Lei Zhang},
year={2023},
eprint={2304.04742},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
@misc{ren2023strong,
title={A Strong and Reproducible Object Detector with Only Public Datasets},
author={Tianhe Ren and Jianwei Yang and Shilong Liu and Ailing Zeng and Feng Li and Hao Zhang and Hongyang Li and Zhaoyang Zeng and Lei Zhang},
year={2023},
eprint={2304.13027},
archivePrefix={arXiv},
primaryClass={cs.CV}
}