Awesome Scene Understanding
A curated list of awesome scene understanding papers, inspired by awesome-computer-vision.
📷 Multi-view images🎲 Point cloud
Related Resources
Workshops and Tutorials
Survey
Papers | Venue | Links |
---|---|---|
State-of-the-art in Automatic 3D Reconstruction of Structured Indoor Environments | CGF 2020 | [project] |
Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey | IEEE Access 2019 | - |
RGBD Datasets: Past, Present and Future | CVPR Workshop 2016 | [project] |
Dataset
Realistic Dataset
Synthetic Dataset
Holistic Scene Understanding
Perspective Image
Panoramic Image
Room Layout Estimation
Perspective Image
(AW: Atlanta-world, SS: single-floor and single-ceiling, PP: Piece-wise Planarity.)
Dataset | Year | Modality | #Frames | Prior | Source |
---|---|---|---|---|---|
CAD-Estate | 2023 | RGB Video | Generic | RealEstate-10K | |
Matterport3D-Layout | 2020 | RGB-D | 7360 | PP | Matterport |
ScanNet-Layout | 2020 | RGB-D | 293 | PP | ScanNet |
Structured3D | 2020 | RGB-D | 82027 | AW+SS | Structured3D |
LSUN Room Layout | 2016 | RGB | 5394 | Cuboid | SUN |
SUN RGB-D | 2015 | RGB-D | 10335 | AW+SS | NYUv2, Berkeley B3DO, and SUN3D |
NYUv2 303 | 2013 | RGB-D | 303 | Cuboid | NYUv2 |
Hedau | 2009 | RGB | 366 | Cuboid | - |
Panoramic Image
(MW: Manhattan world, AW: Atlanta world, SS: single-floor and single-ceiling.)
Dataset | Year | Modality | #Frames | Prior | Source |
---|---|---|---|---|---|
ZInD | 2021 | RGB | 71474 | AW+SS | ZinD |
MatterportLayout | 2020 | RGB-D | 2295 | MW+SS | Matterport |
Structured3D | 2020 | RGB-D | 196515 | AW+SS | Structured3D |
LayoutMP3D | 2020 | RGB-D | 2505 | MW+SS | Matterport |
2D-3D-S | 2018 | RGB-D | 571 | Cuboid | 2D-3D-S |
PanoContext | 2014 | RGB | 500 | Cuboid | SUN360 |
Floorplan
Floorplan Vectorization
Papers | Venue | Links |
---|---|---|
Parsing Line Segments of Floor Plan Images Using Graph Neural Networks | CoRR 2023 | - |
Residential floor plan recognition and reconstruction | CVPR 2021 | - |
Versailles-FP dataset: Wall Detection in Ancient Floor Plans | CoRR 2021 | - |
Deep Floor Plan Recognition using a Multi-task Network with Room-boundary-Guided Attention | ICCV 2019 | [project] |
CubiCasa5K: A Dataset and an Improved Multi-Task Model for Floorplan Image Analysis | Scandinavian Conference on Image Analysis 2019 | [code] |
Raster-to-Vector: Revisiting Floorplan Transformation | ICCV 2017 | [project] [code] |
Visual Localization
Papers | Venue | Links |
---|---|---|
LaLaLoc++: Global Floor Plan Comprehension for Layout Localisation in Unvisited Environments | ECCV 2022 | [code] |
LASER: LAtent SpacE Rendering for 2D Visual Localization | CVPR 2022 | - |
LaLaLoc: Latent Layout Localisation in Dynamic, Unvisited Environments | ICCV 2021 | - |
Primitive
Junction
Papers | Venue | Links |
---|---|---|
Manhattan Junction Catalogue for Spatial Reasoning of Indoor Scenes | CVPR 2013 | - |