LiDAR R-CNN: An Efficient and Universal 3D Object Detector

Introduction

This is the official code of LiDAR R-CNN: An Efficient and Universal 3D Object Detector. In this work, we present LiDAR R-CNN, a second stage detector that can generally improve any existing 3D detector. We find a common problem in Point-based RCNN, which is the learned features ignore the size of proposals, and propose several methods to remedy it. Evaluated on WOD benchmarks, our method significantly outperforms previous state-of-the-art.

中文介绍：https://zhuanlan.zhihu.com/p/359800738

News

We provide the training code for multi-frame setting, and show 3 frame results based PointPillars.

Requirements

All the codes are tested in the following environment:

Linux (tested on Ubuntu 16.04)
Python 3.6+
PyTorch 1.5 or higher (tested on PyTorch 1.5, 6, 7)
CUDA 10.1

To install pybind11:

git clone [email protected]:pybind/pybind11.git
cd pybind11
mkdir build && cd build
cmake .. && make -j 
sudo make install

To install requirements:

pip install -r requirements.txt
apt-get install ninja-build libeigen3-dev

Install LiDAR_RCNN library:

python setup.py develop --user

Cuda Extensions:

# Rotated IOU
cd src/LiDAR_RCNN/ops/iou3d/
python setup.py build_ext --inplace

Preparing Data

Please refer to data processer to generate the proposal data.

Training

After preparing WOD data, we can train the vehicle only model in the paper, run this command:

python -m torch.distributed.launch --nproc_per_node=4 tools/train.py --cfg config/lidar_rcnn.yaml --name lidar_rcnn

For 3 class in WOD:

python -m torch.distributed.launch --nproc_per_node=8 tools/train.py --cfg config/lidar_rcnn_all_cls.yaml --name lidar_rcnn_all

The models and logs will be saved to work_dirs/outputs.

NOTE: for multi-frame training, please set MODEL.Frame = n in config.

Evaluation

To evaluate, run distributed testing with 4 gpus:

python -m torch.distributed.launch --nproc_per_node=4 tools/test.py --cfg config/lidar_rcnn.yaml --checkpoint outputs/lidar_rcnn/checkpoint_lidar_rcnn_59.pth.tar
python tools/create_results.py --cfg config/lidar_rcnn.yaml

Note that, you should keep the nGPUS in config equal to nproc_per_node .This will generate a val.bin file in the work_dir/results. You can create submission to Waymo server using waymo-open-dataset code by following the instructions here.

Results

Our model achieves the following performance on:

Waymo Open Dataset Challenges (3D Detection)

Proposals from	Class	Frame/Channel	3D AP L1 Vehicle	3D AP L1 Pedestrian	3D AP L1 Cyclist
PointPillars	Vehicle	1 / 1x	75.6	-	-
PointPillars	Vehicle	1 / 2x	75.6	-	-
PointPillars	Vehicle	3 / 2x	77.8	-	-
SST	Vehicle	3 / 2x	78.6	-	-
PointPillars	3 Class	1 / 1x	73.4	70.7	67.4
PointPillars	3 Class	1 / 2x	73.8	71.9	69.4

Proposals from	Class	Frame/Channel	3D AP L2 Vehicle	3D AP L2 Pedestrian	3D AP L2 Cyclist
PointPillars	Vehicle	1 / 1x	66.8	-	-
PointPillars	Vehicle	1 / 2x	67.9	-	-
PointPillars	Vehicle	3 / 2x	69.1	-	-
SST	Vehicle	3 / 2x	69.9	-	-
PointPillars	3 Class	1 / 1x	64.8	62.4	64.8
PointPillars	3 Class	1 / 2x	65.1	63.5	66.8

Note: The proposals provided by PointPillars are detected on 1 frame points cloud.

Citation

If you find our paper or repository useful, please consider citing

@article{li2021lidar,
  title={LiDAR R-CNN: An Efficient and Universal 3D Object Detector},
  author={Li, Zhichao and Wang, Feng and Wang, Naiyan},
  journal={CVPR},
  year={2021},
}

Acknowledgement

This project draws on the following codebases.

tusen-ai/LiDAR_RCNN

tusen-ai

Reviews

Repository Details