• Stars
    star
    100
  • Rank 340,703 (Top 7 %)
  • Language
    Python
  • Created about 4 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A Large Dataset of Object Scans (redwood-3dscan)

Abstract

We have created a dataset of more than ten thousand 3D scans of real objects. To create the dataset, we recruited 70 operators, equipped them with consumer-grade mobile 3D scanning setups, and paid them to scan objects in their environments. The operators scanned objects of their choosing, outside the laboratory and without direct supervision by computer vision professionals. The result is a large and diverse collection of object scans: from shoes, mugs, and toys to grand pianos, construction vehicles, and large outdoor sculptures. We worked with an attorney to ensure that data acquisition did not violate privacy constraints. The acquired data was irrevocably placed in the public domain and is available freely.

You can preview and explore the dataset at http://redwood-data.org/3dscan/.

Dataset organization

If you use our download scripts, the downloaded dataset will be stored in data/.

data
β”œβ”€β”€ rgbd (10,933 RGBD scans, each contains multiple RGB and depth images)
β”‚Β Β  β”œβ”€β”€ 00001.zip
β”‚Β Β  β”œβ”€β”€ 00002.zip
β”‚Β Β  β”œβ”€β”€ ...
β”‚Β Β  β”œβ”€β”€ 11097.zip
β”‚Β Β  └── 11098.zip
β”œβ”€β”€ mesh (441 reconstructed models)
β”‚Β Β  β”œβ”€β”€ 00004.ply
β”‚Β Β  β”œβ”€β”€ 00033.ply
β”‚Β Β  β”œβ”€β”€ ...
β”‚Β Β  β”œβ”€β”€ 10548.ply
β”‚Β Β  └── 10664.ply
└── video (10,933 videos created from images)
    β”œβ”€β”€ 00001.mp4
    β”œβ”€β”€ 00002.mp4
    β”œβ”€β”€ ...
    β”œβ”€β”€ 11097.mp4
    └── 11098.mp4
  • Dataset size: The entire dataset is ~4TB in size.
  • RGBD scans: There are 10,933 RGBD scans. Each scan contains multiple RGBD images.
  • Meshes: There are 441 reconstructed mesh models.
  • Videos: There are 10,933 videos, created from RGBD images.
  • Categories: 9,131 RGBD scans are categorized into 320 categories. Each scan has at most one category.

Example usage

import redwood_3dscan as rws

# Print RGBD scans (list)
print(rws.rgbds)

# Print meshes (list)
print(rws.meshes)

# Print categories (dict: string->list)
print(rws.categories)

# Print all scan_id of the "sofa" category
print(rws.categories["sofa"])

# Download by scan_id "00033"
# Download will be skipped if the resource is unavailable
# e.g. some RGBD images may not come with mesh.
rws.download_rgbd("00033")  # Save to data/rgbd/00033.zip, if available
rws.download_mesh("00033")  # Save to data/mesh/00033.ply, if available
rws.download_video("00033") # Save to data/video/00033.mp4, if available
rws.download_all("00033")   # Downloads rgbd, mesh, and video together

# Download by category "sofa"
rws.download_category("sofa")

You can use Open3D to load and visualize the models.

# For installation guides, see:
# http://www.open3d.org/docs/release/getting_started.html
pip install open3d
import redwood_3dscan as rws
import open3d as o3d

rws.download_mesh("00033")
mesh = o3d.io.read_triangle_mesh("data/mesh/00033.ply")
mesh.compute_vertex_normals()
o3d.visualization.draw_geometries([mesh])

Open3D Visualizer

License

The entire dataset, including both RGB-D scans and reconstructed models, is in the public domain. Any part of the dataset can be used for any purpose with proper attribution. If you use any of the data, please cite our technical report.

@article{Choi2016,
          author    = {Sungjoon Choi and Qian-Yi Zhou and Stephen Miller and Vladlen Koltun},
          title     = {A Large Dataset of Object Scans},
          journal   = {arXiv:1602.02481},
          year      = {2016},
        }

File Format

The RGB-D sequences were acquired with PrimeSense Carmine cameras. The resolution is 640Γ—480, the frame rate is 30Hz. Each scan is packaged in a zip archive that contains consecutive color images stored as JPG and depth images stored as 16-bit PNG, where pixel values represent depth in millimeters. The first part of a file name indicates the frame number and the second part provides a time stamp in microseconds.

The focal length is 525 for both axes and the principal point is (319.5, 239.5). The depth images are registered to the color images.

The reconstructed models are all in PLY file format.

More Repositories

1

Open3D

Open3D: A Modern Library for 3D Data Processing
C++
11,405
star
2

MiDaS

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
Python
4,418
star
3

OpenBot

OpenBot leverages smartphones as brains for low-cost robots. We have designed a small electric vehicle that costs about $50 and serves as a robot body. Our software stack for Android smartphones supports advanced robotics workloads such as person following and real-time autonomous navigation.
Swift
2,819
star
4

ZoeDepth

Metric depth estimation from a single image
Jupyter Notebook
2,167
star
5

DPT

Dense Prediction Transformers
Python
1,866
star
6

Open3D-ML

An extension of Open3D to address 3D Machine Learning tasks
Python
1,821
star
7

PhotorealismEnhancement

Code & Data for Enhancing Photorealism Enhancement
Python
1,237
star
8

MultiObjectiveOptimization

Source code for Neural Information Processing Systems (NeurIPS) 2018 paper "Multi-Task Learning as Multi-Objective Optimization"
Python
753
star
9

lang-seg

Language-Driven Semantic Segmentation
Jupyter Notebook
704
star
10

FastGlobalRegistration

Fast Global Registration
C++
500
star
11

Open3D-PointNet2-Semantic3D

Semantic3D segmentation with Open3D and PointNet++
Python
461
star
12

FreeViewSynthesis

Code repository for "Free View Synthesis", ECCV 2020.
Python
262
star
13

spear

SPEAR: A Simulator for Photorealistic Embodied AI Research
C++
219
star
14

StableViewSynthesis

Python
212
star
15

DeepLagrangianFluids

Code repository for "Lagrangian Fluid Simulation with Continuous Convolutions", ICLR 2020.
Python
207
star
16

DirectFuturePrediction

Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017
Python
152
star
17

VI-Depth

Code for Monocular Visual-Inertial Depth Estimation (ICRA 2023)
Python
147
star
18

NPHard

Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search
Python
139
star
19

Intseg

Interactive Image Segmentation with Latent Diversity
Python
78
star
20

TanksAndTemples

Toolbox for the TanksAndTemples benchmark website
Python
58
star
21

dcflow

Code for the paper "Accurate Optical Flow via Direct Cost Volume Processing. Jia Xu, RenΓ© Ranftl, and Vladlen Koltun. CVPR 2017"
C++
52
star
22

adaptive-surface-reconstruction

Adaptive Surface Reconstruction for 3D Data Processing
Python
49
star
23

open3d-cmake-find-package

Find pre-installed Open3D package in CMake
C++
48
star
24

DFE

Python
43
star
25

vision-for-action

Code to accompany "Does computer vision matter for action?"
Python
41
star
26

LMRS

Source code for ICLR 2020 paper: "Learning to Guide Random Search"
Python
39
star
27

objects-with-lighting

Repository for the Objects With Lighting Dataset
Python
36
star
28

open3d_downloads

Hosting Open3D test data for development use
23
star
29

Open3D-3rdparty

C
20
star
30

open3d-cmake-external-project

Use Open3D as a CMake external project
CMake
18
star
31

0shot-object-insertion

Simulation and robot code for contact-rich household object insertion (ICRA 2023).
Python
11
star
32

Open3D-Viewer

C++
7
star
33

generalized-smoothing

Companion code for the ICML 2022 paper "Generalizing Gaussian Smoothing for Random Search"
Python
7
star
34

Open3D-Python-CI

Testing Open3D Python package from PyPI and Conda
4
star
35

MetaLearningTradeoffs

Source code for the NeurIPS 2020 Paper: Modeling and Optimization Trade-off in Meta-learning.
Python
4
star
36

hello-world-docker-action

Dockerfile
1
star
37

mshadow

Forked from https://github.com/dmlc/mshadow
C++
1
star