• Stars
    star
    4,781
  • Rank 8,401 (Top 0.2 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created almost 4 years ago
  • Updated about 1 month ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

OpenMMLab's next-generation platform for general 3D object detection.
 
OpenMMLab website HOT      OpenMMLab platform TRY IT OUT
 

docs badge codecov license

News:

We have renamed the branch 1.1 to main and switched the default branch from master to main. We encourage users to migrate to the latest version, though it comes with some cost. Please refer to Migration Guide for more details.

v1.1.1 was released in 30/5/2023

We have constructed a comprehensive LiDAR semantic segmentation benchmark on SemanticKITTI, including Cylinder3D, MinkUNet and SPVCNN methods. Noteworthy, the improved MinkUNetv2 can achieve 70.3 mIoU on the validation set of SemanticKITTI. We have also supported the training of BEVFusion and an occupancy prediction method, TPVFomrer, in our projects. More new features about 3D perception are on the way. Please stay tuned!

Introduction

English | 简体中文

The main branch works with PyTorch 1.8+.

MMDetection3D is an open source object detection toolbox based on PyTorch, towards the next-generation platform for general 3D detection. It is a part of the OpenMMLab project developed by MMLab.

demo image

Major features

  • Support multi-modality/single-modality detectors out of box

    It directly supports multi-modality/single-modality detectors including MVXNet, VoteNet, PointPillars, etc.

  • Support indoor/outdoor 3D detection out of box

    It directly supports popular indoor and outdoor 3D detection datasets, including ScanNet, SUNRGB-D, Waymo, nuScenes, Lyft, and KITTI. For nuScenes dataset, we also support nuImages dataset.

  • Natural integration with 2D detection

    All the about 300+ models, methods of 40+ papers, and modules supported in MMDetection can be trained or used in this codebase.

  • High efficiency

    It trains faster than other codebases. The main results are as below. Details can be found in benchmark.md. We compare the number of samples trained per second (the higher, the better). The models that are not supported by other codebases are marked by .

    Methods MMDetection3D OpenPCDet votenet Det3D
    VoteNet 358 77
    PointPillars-car 141 140
    PointPillars-3class 107 44
    SECOND 40 30
    Part-A2 17 14

Like MMDetection and MMCV, MMDetection3D can also be used as a library to support different projects on top of it.

License

This project is released under the Apache 2.0 license.

Changelog

1.1.0 was released in 6/4/2023.

Please refer to changelog.md for details and release history.

Benchmark and model zoo

Results and models are available in the model zoo.

Components
Backbones Heads Features
Architectures
3D Object Detection Monocular 3D Object Detection Multi-modal 3D Object Detection 3D Semantic Segmentation
  • Outdoor
  • Indoor
  • Outdoor
  • Indoor
  • Outdoor
  • Indoor
  • Outdoor
  • Indoor
  • ResNet PointNet++ SECOND DGCNN RegNetX DLA MinkResNet Cylinder3D MinkUNet
    SECOND
    PointPillars
    FreeAnchor
    VoteNet
    H3DNet
    3DSSD
    Part-A2
    MVXNet
    CenterPoint
    SSN
    ImVoteNet
    FCOS3D
    PointNet++
    Group-Free-3D
    ImVoxelNet
    PAConv
    DGCNN
    SMOKE
    PGD
    MonoFlex
    SA-SSD
    FCAF3D
    PV-RCNN
    Cylinder3D
    MinkUNet
    SPVCNN

    Note: All the about 300+ models, methods of 40+ papers in 2D detection supported by MMDetection can be trained or used in this codebase.

    Installation

    Please refer to get_started.md for installation.

    Get Started

    Please see get_started.md for the basic usage of MMDetection3D. We provide guidance for quick run with existing dataset and with new dataset for beginners. There are also tutorials for learning configuration systems, customizing dataset, designing data pipeline, customizing models, customizing runtime settings and Waymo dataset.

    Please refer to FAQ for frequently asked questions. When updating the version of MMDetection3D, please also check the compatibility doc to be aware of the BC-breaking updates introduced in each version.

    Citation

    If you find this project useful in your research, please consider cite:

    @misc{mmdet3d2020,
        title={{MMDetection3D: OpenMMLab} next-generation platform for general {3D} object detection},
        author={MMDetection3D Contributors},
        howpublished = {\url{https://github.com/open-mmlab/mmdetection3d}},
        year={2020}
    }

    Contributing

    We appreciate all contributions to improve MMDetection3D. Please refer to CONTRIBUTING.md for the contributing guideline.

    Acknowledgement

    MMDetection3D is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors as well as users who give valuable feedbacks. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new 3D detectors.

    Projects in OpenMMLab

    • MMEngine: OpenMMLab foundational library for training deep learning models.
    • MMCV: OpenMMLab foundational library for computer vision.
    • MMEval: A unified evaluation library for multiple machine learning libraries.
    • MIM: MIM installs OpenMMLab packages.
    • MMPreTrain: OpenMMLab pre-training toolbox and benchmark.
    • MMDetection: OpenMMLab detection toolbox and benchmark.
    • MMDetection3D: OpenMMLab's next-generation platform for general 3D object detection.
    • MMRotate: OpenMMLab rotated object detection toolbox and benchmark.
    • MMYOLO: OpenMMLab YOLO series toolbox and benchmark.
    • MMSegmentation: OpenMMLab semantic segmentation toolbox and benchmark.
    • MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.
    • MMPose: OpenMMLab pose estimation toolbox and benchmark.
    • MMHuman3D: OpenMMLab 3D human parametric model toolbox and benchmark.
    • MMSelfSup: OpenMMLab self-supervised learning toolbox and benchmark.
    • MMRazor: OpenMMLab model compression toolbox and benchmark.
    • MMFewShot: OpenMMLab fewshot learning toolbox and benchmark.
    • MMAction2: OpenMMLab's next-generation action understanding toolbox and benchmark.
    • MMTracking: OpenMMLab video perception toolbox and benchmark.
    • MMFlow: OpenMMLab optical flow toolbox and benchmark.
    • MMagic: OpenMMLab Advanced, Generative and Intelligent Creation toolbox.
    • MMGeneration: OpenMMLab image and video generative models toolbox.
    • MMDeploy: OpenMMLab model deployment framework.

    More Repositories

    1

    mmdetection

    OpenMMLab Detection Toolbox and Benchmark
    Python
    27,899
    star
    2

    mmsegmentation

    OpenMMLab Semantic Segmentation Toolbox and Benchmark.
    Python
    7,372
    star
    3

    mmagic

    OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
    Jupyter Notebook
    6,575
    star
    4

    mmcv

    OpenMMLab Computer Vision Foundation
    Python
    5,626
    star
    5

    mmpose

    OpenMMLab Pose Estimation Toolbox and Benchmark.
    Python
    4,969
    star
    6

    OpenPCDet

    OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
    Python
    4,342
    star
    7

    mmocr

    OpenMMLab Text Detection, Recognition and Understanding Toolbox
    Python
    4,099
    star
    8

    Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
    Python
    3,947
    star
    9

    mmaction2

    OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
    Python
    3,903
    star
    10

    mmtracking

    OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
    Python
    3,384
    star
    11

    mmpretrain

    OpenMMLab Pre-training Toolbox and Benchmark
    Python
    3,138
    star
    12

    mmselfsup

    OpenMMLab Self-Supervised Learning Toolbox and Benchmark
    Python
    3,084
    star
    13

    mmskeleton

    A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
    Python
    2,855
    star
    14

    mmyolo

    OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
    Python
    2,719
    star
    15

    mmdeploy

    OpenMMLab Model Deployment Framework
    Python
    2,520
    star
    16

    mmaction

    An open-source toolbox for action understanding based on PyTorch
    Python
    1,847
    star
    17

    mmgeneration

    MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
    Python
    1,796
    star
    18

    mmrotate

    OpenMMLab Rotated Object Detection Toolbox and Benchmark
    Python
    1,724
    star
    19

    Multimodal-GPT

    Multimodal-GPT
    Python
    1,397
    star
    20

    mmrazor

    OpenMMLab Model Compression Toolbox and Benchmark.
    Python
    1,366
    star
    21

    mmfashion

    Open-source toolbox for visual fashion analysis based on PyTorch
    Python
    1,204
    star
    22

    mmhuman3d

    OpenMMLab 3D Human Parametric Model Toolbox and Benchmark
    Python
    1,129
    star
    23

    mmengine

    OpenMMLab Foundational Library for Training Deep Learning Models
    Python
    1,029
    star
    24

    playground

    A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
    Python
    1,017
    star
    25

    OpenMMLabCourse

    OpenMMLab course index and stuff
    Jupyter Notebook
    919
    star
    26

    mmflow

    OpenMMLab optical flow toolbox and benchmark
    Python
    889
    star
    27

    PIA

    [CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
    Python
    716
    star
    28

    mmfewshot

    OpenMMLab FewShot Learning Toolbox and Benchmark
    Python
    662
    star
    29

    awesome-vit

    396
    star
    30

    OpenUnReID

    PyTorch open-source toolbox for unsupervised or domain adaptive object re-ID.
    Python
    389
    star
    31

    labelbee-client

    Out-of-the-box Annotation Toolbox
    JavaScript
    374
    star
    32

    mim

    MIM Installs OpenMMLab Packages
    Python
    329
    star
    33

    denseflow

    Extracting optical flow and frames
    C++
    286
    star
    34

    MMGEN-FaceStylor

    Python
    245
    star
    35

    mmeval

    A unified evaluation library for multiple machine learning libraries
    Python
    243
    star
    36

    labelbee

    LabelBee is an annotation Library
    TypeScript
    223
    star
    37

    OpenMMLabCamp

    Jupyter Notebook
    92
    star
    38

    polynet

    The Github Repo for PolyNet
    77
    star
    39

    CLUE

    C++ Lightweight Utility Extensions
    C++
    70
    star
    40

    mim-example

    Python
    57
    star
    41

    mmengine-template

    Python
    47
    star
    42

    ecosystem

    36
    star
    43

    mmstyles

    Latex style file to facilitate writing of technical papers
    TeX
    36
    star
    44

    mmpose-webcam-demo

    Python
    23
    star
    45

    pre-commit-hooks

    Python
    17
    star
    46

    mdformat-openmmlab

    Python
    8
    star
    47

    .github

    3
    star