• Stars
    star
    137
  • Rank 258,500 (Top 6 %)
  • Language
  • Created almost 4 years ago
  • Updated almost 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Paper list of monocular 3D human pose and shape estimation

Table of Contents

Single-view recovery

  • [CVPR2021] Frank Yu et al., PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective Crop Layers

  • [ECCV2020],I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image

  • [CVPR2020],3D Human Mesh Regression with Dense Correspondence

  • [ICCV2019],Georgios Pavlakos et al,TexturePose: Supervising Human Mesh Estimation with Texture Consistency

  • [ICCV2019], Nikos Kolotouros et al., Learning to Reconstruct 3D Human Pose and Shape via Model-Fitting in the Loop

  • [ICCV2019], Saurabh Sharma et al., Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking

  • [ICCV2019], Kun Zhou et al., HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation

  • [CVPR2019], Long Zhao et al., Semantic Graph Convolutional Networks for 3D Human Pose Regression

  • [CVPR2019], Chen Li et al., Generating Multiple Hypotheses for 3D Human Pose Estimation With Mixture Density Network

  • [CVPR2019], Riza Alp Guler et al., HoloPose: Holistic 3D Human Reconstruction In-The-Wild

  • [CVPR2019], Xipeng Chen et al., Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation

  • [CVPR2019], Ikhsanul Habibie et al., In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

2D pose to 3D pose

  • [CVPR2021] Xiaoxuan Ma et al., Context Modeling in 3D Human Pose Estimation: A Unified Perspective

  • [CVPR2021] Tianhan Xu et al., Graph Stacked Hourglass Networks for 3D Human Pose Estimation

  • [ICCV2019], Hai Ci et al., Optimizing Network Structure for 3D Human Pose Estimation

  • [CVPR2019], Dario Pavllo et al., 3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training

Video

  • [CVPR2021] Hongsuk Choi et al., Beyond Static Features for Temporally Consistent 3D Human Pose and Shape from a Video

  • [CVPR2021] Ye Yuan et al., SimPoE: Simulated Character Control for 3D Human Pose Estimation

  • [CVPR2020] Muhammed Kocabas et al., VIBE: Video Inference for Human Body Pose and Shape Estimation

  • [CVPR2019] Angjoo Kanazawa et al., Learning 3D Human Dynamics from Video

  • [ICCV2019], Jason Y. Zhang et al., Predicting 3D Human Dynamics from Video

  • [ICCV2019], Yu Sun et al., Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation

  • [CVPR2019], Anurag Arnab et al., Exploiting temporal context for 3D human pose estimation in the wild

Multi-view recovery

  • [ECCV2020] Junting Dong et al., Motion Capture from Internet Videos
  • [ICCV2019], Nitin Saini et al., Markerless Outdoor Human Motion Capture Using Multiple Autonomous Micro Aerial Vehicles
  • [ICCV2019], Haibo Qiu et al., Cross View Fusion for 3D Human Pose Estimation
  • [ICCV2019], Junbang Liang et al., Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images
  • [ICCV2019], Karim Iskakov et al., Learnable Triangulation of Human Pose
  • [CVPR2020], Yuxiang Zhang et al., 4D Association Graph for Realtime Multi-Person Motion Capture Using Multiple Video Cameras
  • [CVPR2019], Junting Dong et al., Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views

Multi-person

  • [CVPR2021] Yu Cheng et al., Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks
  • [AAAI2021] Yu Cheng et al., Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
  • [ECCV2020] Jianan Zhen et al., SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation
  • [ECCV2020] Jogendra Nath Kundu et al., Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose Estimation
  • [ECCV2020] Jiefeng Li et al., HMOR: Hierarchical Multi-Person Ordinal Relations for Monocular Multi-Person 3D Pose Estimation
  • [ECCV2020] Jiahao Lin et al., HDNet: Human Depth Estimation for Multi-Person Camera-Space Localization
  • [CVPR2020] Mihai Fieraru et al., Three-dimensional Reconstruction of Human Interactions
  • [(SIGGRAPH) 2020] Dushyant Mehta et al., XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera
  • [CVPR2020] Matteo Fabbri et al., Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation
  • [CVPR2020] Abdallah Benzine et al., PandaNet : Anchor-Based Single-Shot Multi-Person 3D Pose Estimation
  • [CVPR2020] Wen Jiang et al., Coherent Reconstruction of Multiple Humans From a Single Image
  • [ICCV 2019] Gyeongsik Moon et al., Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image

Other input

  • [CVPR2021] Vladimir Guzov et al., Human POSEitioning System (HPS): 3D Human Pose Estimation and Self- localization in Large Scenes from Body-Mounted Sensors

  • [CVPR2021] Qi Fang et al., Reconstructing 3D Human Pose by Watching Humans in the Mirror

  • [CVPR2021] Yiyue Luo et al., Intelligent Carpet: Inferring 3D Human Pose from Tactile Signals

  • [ICCV2019], Haiyong Jiang et al., Skeleton-Aware 3D Human Shape Reconstruction From Point Clouds

  • [ICCV2019], Denis Tome et al., xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera

Detailed geometry

  • [CVPR2021], Yan Zhang et al., We are More than Our Joints: Predicting how 3D Bodies Move

  • [CVPR2021], Jiefeng Li et al., HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation

  • [ECCV2020] CLOTH3D: Clothed 3D Humans

  • [ECCV2020] Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction

  • [ECCV2020] Luyang Zhu et al., Reconstructing NBA Players

  • [CVPR2020] Shunsuke Saito et al., PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization

  • [CVPR2020] Marc Habermann et al., DeepCap: Monocular Human Performance Capture Using Weak Supervision

  • [CVPR2020] Shunsuke Saito et al., PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization

  • [CVPR2020] Feitong Tan et al., Self-Supervised Human Depth Estimation from Monocular Videos

  • [CVPR2020] Zeng Huang et al., ARCH: Animatable Reconstruction of Clothed Humans

  • [CVPR2020] Hayato Onizuka et al., TetraTSDF: 3D human reconstruction from a single image with a tetrahedral outer shell

  • [CVPR2020] Yong-Lu Li et al., Detailed 2D-3D Joint Representation for Human-Object Interaction

  • [ICCV2019] Shunsuke Saito et al., PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization

  • [ICCV2019], Zerong Zheng et al., DeepHuman: 3D Human Reconstruction From a Single Image

  • [ICCV2019], Albert Pumarola et al., 3DPeople: Modeling the Geometry of Dressed Humans

  • [ICCV19] Thiemo Alldieck et al., Tex2Shape: Detailed Full Human Body Geometry From a Single Image

  • [ICCV19] Sicong Tang et al., A Neural Network for Detailed Human Depth Estimation From a Single Image

  • [CVPR19 Oral] Ryota Natsume et al., SiCloPe: Silhouette-Based Clothed People

  • [CVPR19 Oral] Nikos Kolotouros et al.,Convolutional Mesh Regression for Single-Image Human Shape Reconstruction

  • [CVPR19 Oral] Hao Zhu et al., Detailed Human Shape Estimation from a Single Image by Hierarchical mesh deformation

  • [CVPR2019], Thiemo Alldieck et al., Learning to Reconstruct People in Clothing from a Single RGB Camera

  • [CVPR19] Thiemo Alldieck et al., Learning to Reconstruct People in Clothing from a Single RGB Camera

  • [CVPR19] Tao Yu et al., SimulCap: Single-View Human Performance Capture with Cloth Simulation

  • [TOG 2019] Marc Habermann et al., LiveCap: Real-Time Human Performance Capture From Monocular Video

  • [ICCV2019] Valentin Gabeur et al., Moulding Humans: Non-parametric 3D Human Shape Estimation from Single Images

Self-supervise/ weak supervise

  • [CVPR2021] Bastian Wandt et al., CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild

  • [CVPR2019] Muhammed Kocabas et al., Self-Supervised Learning of 3D Human Pose using Multi-view Geometry

  • [CVPR2019], Bastian Wandt et al., RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation

Interaction with scene

  • [CVPR 2020] Yong-Lu Li et al., Detailed 2D-3D Joint Representation for Human-Object Interaction

  • [ICCV 2019] Mohamed Hassan et al., Resolving 3D Human Pose Ambiguities with 3D Scene Constraints

  • [ICCV 2019] Yixin Chen et al., Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense

Occlusion

  • [CVPR2020] Object-Occluded Human Shape and Pose Estimation from a Single Color Image
  • [ICCV2019] Yu Cheng et al., Occlusion-Aware Networks for 3D Human Pose Estimation in Video

Total capture

  • [CVPR2019] Donglai Xiang et al., Monocular Total Capture: Posing Face, Body and Hands in the Wild
  • [CVPR2019] Georgios Pavlakos et al., Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

Future prediction

  • [ECCV2020] Long-term Human Motion Prediction with Scene Context
  • [ICCV2019] Zhi Li et al., On Boosting Single-Frame 3D Human Pose Estimation via Monocular Videos
  • [ICCV2019] A. Hernandez Ruiz et al., Human Motion Prediction via Spatio-Temporal Inpainting
  • [ICCV2019] Emre Aksan et al., Structured Prediction Helps 3D Human Motion Modelling
  • [ICCV2019] Wei Mao et al., Learning Trajectory Dependencies for Human Motion Prediction
  • [CVPR2019] Zhenguang Liu et al., Towards Natural and Accurate Future Motion Prediction of Humans and Animals
  • [CVPR2019] Anand Gopalakrishnan et al., A Neural Temporal Model for Human Motion Prediction

Others

  • [CVPR2021] Kehong Gong et al., PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation

  • [ICCV19] Yu Rong et al., Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild

  • [CVPR19] Chung-Yi Weng et al., Photo Wake-Up: 3D Character Animation from a Single Phot

  • [CVPR2019] Rohit Pandey et al. Volumetric Capture of Humans with a Single RGBD Camera via Semi-Parametric Learning

3D-Hands

  • [CVPR2020],Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild

More Repositories

1

EasyMocap

Make human motion capture easier.
Python
3,279
star
2

LoFTR

Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
Jupyter Notebook
2,054
star
3

NeuralRecon

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral
Python
1,913
star
4

4K4D

[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution
Python
1,417
star
5

snake

Code for "Deep Snake for Real-Time Instance Segmentation" CVPR 2020 oral
Jupyter Notebook
1,147
star
6

OnePose

Code for "OnePose: One-Shot Object Pose Estimation without CAD Models", CVPR 2022
Python
903
star
7

neuralbody

Code for "Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans" CVPR 2021 best paper candidate
Python
897
star
8

pvnet

Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral
Jupyter Notebook
788
star
9

NeuralRecon-W

Code for "Neural 3D Reconstruction in the Wild", SIGGRAPH 2022 (Conference Proceedings)
Python
681
star
10

street_gaussians

Code for "Street Gaussians for Modeling Dynamic Urban Scenes"
576
star
11

mvpose

Code for "Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views" (CVPR 2019, T-PAMI 2021)
Jupyter Notebook
508
star
12

animatable_nerf

Code for "Animatable Implicit Neural Representations for Creating Realistic Avatars from Videos" TPAMI 2024, ICCV 2021
Python
488
star
13

manhattan_sdf

Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral
Python
482
star
14

EasyVolcap

[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research
Python
461
star
15

ENeRF

SIGGRAPH Asia 2022: Code for "Efficient Neural Radiance Fields for Interactive Free-viewpoint Video"
Python
400
star
16

DetectorFreeSfM

Code for "Detector-Free Structure from Motion", Arxiv Preprint
393
star
17

clean-pvnet

Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral
C++
384
star
18

NeuMesh

Code for "MeuMesh: Learning Disentangled Neural Mesh-based Implicit Field for Geometry and Texture Editing", ECCV 2022 Oral
Python
374
star
19

AutoRecon

Code for "AutoRecon: Automated 3D Object Discovery and Reconstruction" CVPR 2023 (Highlight)
Python
341
star
20

OnePose_Plus_Plus

Code for "OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models" NeurIPS 2022
Python
329
star
21

object_nerf

Code for "Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering", ICCV 2021
Python
306
star
22

PVIO

Robust and Efficient Visual-Inertial Odometry with Multi-plane Priors
C++
298
star
23

Vox-Fusion

Code for "Dense Tracking and Mapping with Voxel-based Neural Implicit Representation", ISMAR 2022
Python
257
star
24

EfficientLoFTR

Jupyter Notebook
251
star
25

ENFT-SfM

This source code provides a reference implementation for ENFT-SfM.
C++
250
star
26

Wis3D

A web-based 3D visualization tool for 3D computer vision.
TypeScript
248
star
27

SMAP

Code for "SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation" (ECCV 2020)
Python
237
star
28

mlp_maps

Code for "Representing Volumetric Videos as Dynamic MLP Maps" CVPR 2023
Cuda
230
star
29

im4d

SIGGRAPH Asia 2023: Code for "Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes"
Python
226
star
30

disprcnn

Code release for Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation (CVPR 2020, TPAMI 2021)
Jupyter Notebook
209
star
31

PVO

code for "PVO: Panoptic Visual Odometry", CVPR 2023
Python
198
star
32

GIFT

Code for "GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs" NeurIPS 2019
Python
191
star
33

Mirrored-Human

Code for "Reconstructing 3D Human Pose by Watching Humans in the Mirror" (CVPR 2021 Oral)
184
star
34

pvnet-rendering

render images for pvnet training
Python
177
star
35

IntrinsicNeRF

code for "IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis", ICCV 2023
Python
174
star
36

InvRender

Code for "Modeling Indirect Illumination for Inverse Rendering", CVPR 2022
Python
165
star
37

EIBA

Efficient Incremental BA
C++
161
star
38

instant-nvr

[CVPR 2023] Code for "Learning Neural Volumetric Representations of Dynamic Humans in Minutes"
Python
144
star
39

eval-vislam

Toolkit for VI-SLAM evaluation.
C++
137
star
40

SINE

Code for "Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field", CVPR 2023
Python
123
star
41

rnin-vio

Python
116
star
42

deltar

Code for "DELTAR: Depth Estimation from a Light-weight ToF Sensor And RGB Image", ECCV 2022
Python
112
star
43

NeuSC

A Temporal Voyage: Code for "Neural Scene Chronology" [CVPR 2023]
Python
111
star
44

DeFlowSLAM

code for "DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM"
109
star
45

SegmentBA

Segment based Bundle Adjustment
C++
108
star
46

CoLi-BA

C++
107
star
47

iMoCap

dataset for ECCV 2020 "Motion Capture from Internet Videos"
Python
104
star
48

VS-Net

VS-Net: Voting with Segmentation for Visual Localization
Python
86
star
49

UDOLO

Python
84
star
50

pats

Code for "PATS: Patch Area Transportation with Subdivision for Local Feature Matching", CVPR 2023
C++
84
star
51

SA-HMR

Code for "Learning Human Mesh Recovery in 3D Scenes" CVPR 2023
Python
79
star
52

ENFT

Efficient Non-Consecutive Feature Tracking for Robust SfM http://www.zjucvg.net/ls-acts/ls-acts.html
C++
76
star
53

TotalSelfScan

Code for "TotalSelfScan: Learning Full-body Avatars from Self-Portrait Videos of Faces, Hands, and Bodies" (NeurIPS 2022)
Python
73
star
54

SAM-Graph

Code for "SAM-guided Graph Cut for 3D Instance Segmentation"
69
star
55

gcasp

[CoRL 2022] Generative Category-Level Shape and Pose Estimation with Semantic Primitives
Python
66
star
56

GeneAvatar

Code for "GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image", CVPR 2024
59
star
57

zju3dv.github.io

HTML
57
star
58

vig-init

Rapid and Robust Monocular Visual-Inertial Initialization with Gravity Estimation via Vertical Edges
C++
56
star
59

coxgraph

Code for "Coxgraph: Multi-Robot Collaborative, Globally Consistent, Online Dense Reconstruction System", IROS 2021 Best Paper Award Finalist on Safety, Security, and Rescue Robotics in memory of Motohiro Kisoi
C++
54
star
60

RVL-Dynamic

Code for "Prior Guided Dropout for Robust Visual Localization in Dynamic Environments" in ICCV 2019
Python
47
star
61

Vox-Surf

Code for "Vox-Surf: Voxel-based Implicit Surface Representation", TVCG 2022
Python
46
star
62

NIID-Net

Code for "NIID-Net: Adapting Surface Normal Knowledge for Intrinsic Image Decomposition in Indoor Scenes" TVCG
Python
43
star
63

hghoi

ICCV 2023, Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models
C++
43
star
64

RLP_VIO

Code for "RLP-VIO: Robust and lightweight plane-based visual-inertial odometry for augmented reality, CAVW 2022
C++
42
star
65

Mirror-NeRF

Code for "Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing", ACM MM 2023
Python
37
star
66

AutoDecomp

3D object discovery from casual object captures
HTML
36
star
67

RelightableAvatar

[CVPR 2024 (Highlight)] Relightable and Animatable Neural Avatar from Sparse-View Video
Python
35
star
68

CloseMoCap

Official implementation of "Reconstructing Close Human Interaction from Multiple Views"
33
star
69

poking_perception

Python
29
star
70

MagLoc-AR

14
star
71

MVN-AFM

Code for "Multi-View Neural 3D Reconstruction of Micro-/Nanostructures with Atomic Force Microscopy"
Python
11
star
72

blink_sim

11
star
73

pvnet-depth-sup

10
star
74

hybrid3d

C++
10
star
75

nr_in_a_room

Code for "Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects", ACM ToG
Python
10
star
76

RNNPose

RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization, CVPR 2022
6
star
77

rnin-vio.github.io

CSS
2
star
78

LSFB

1
star