Human Motion Capture

Collecting papers about human motion capture

⭐🔥💪 I will update each day and add more details about every paper ☀️☀️☀️

NOTE : Since motion capture is a field that has been studied for many years, I will collect the up to date papers first and then try to gather the old but classical one.

methods based directly on video

Physically Plausible Monocular 3D Motion Capture [project]
"Neural PhysCap" Neural Monocular 3D Human Motion Capture with Physical Awareness [project]
XNect: real-time multi-person 3D motion capture with a single RGB camera[project]
4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras [code]
VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera [project]
TesseTrack: End-to-End Learnable Multi-Person Articulated 3D Pose Tracking [project]
Synergetic Reconstruction from 2D Pose and 3D Motion for Wide-Space Multi-Person Video Motion Capture in the Wild [project]
MonoPerfCap: Human Performance Capture from Monocular Video [project]
Capturing Detailed Deformations of Moving Human Bodies [project]
Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras [project]
Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos [pdf]
Direct Multi-view Multi-person 3D Human Pose Estimation [code]
Human Performance Capture from Monocular Video in the Wild [pdf] | [code]
Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning[pdf] | [code]
Human Dynamics from Monocular Video with Dynamic Camera Movements [code] | [project]
AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape Estimation [code]
Neural MoCon: Neural Motion Control for Physically Plausible Human Motion Capture [project]
Permutation-Invariant Relational Network for Multi-person 3D Pose Estimation [pdf]
PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision [code]
Generalizable Human Pose Triangulation [code]
Differentiable Dynamics for Articulated 3d Human Motion Reconstruction [pdf]
VTP: Volumetric Transformer for Multi-view Multi-person 3D Pose Estimation [pdf]
Trajectory Optimization for Physics-Based Reconstruction of 3d Human Pose from Monocular Video [pdf]
Learning Variational Motion Prior for Video-based Motion Capture [pdf]
Accurate and Efficient Absolute 3D Human Pose Estimation Trained on Dozens of Datasets [project]
GFPose: Learning 3D Human Pose Prior with Gradient Fields [[project]
DiffPose: Toward More Reliable 3D Pose Estimation [pdf] | [project]
Learning 3D Human Pose Estimation From Dozens of Datasets Using a Geometry-Aware Autoencoder To Bridge Between Skeleton Formats [project]
Scene-Aware 3D Multi-Human Motion Capture from a Single Camera [project]
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition [project]
3D Human Pose Estimation via Intuitive Physics [project]
GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction [project]
Real-time Monocular Full-body Capture in World Space via Sequential Proxy-to-Motion Learning [project]
Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation [pdf]

methods based on body model such as smpl/smpl-x

Note that there are some papers that I don't list here most because I have tested it and the result is not so good(e,g. frankmocap,VIBE and so on)

Contact and Human Dynamics from Monocular Video [code]
DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras [project]
Monocular, One-stage, Regression of Multiple 3D People [code]
BEV:Putting People in their Place,Monocular Regression of 3D People in Depth [project]
PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop [project]
End-to-End Human Pose and Mesh Reconstruction with Transformers [code]
Pose2Pose: 3D Positional Pose-Guided 3D Rotational Pose Prediction for Expressive 3D Human Pose and Mesh Estimation[paper] | [code]
AGORA: Avatars in Geography Optimized for Regression Analysis [code]
Real-time RGBD-based Extended Body Pose Estimation [code]
EasyMocap [code]
PanoMan: Sparse Localized Components–based Model for Full Human Motions [paper]
Monocular Total Capture: Posing Face, Body and Hands in the Wild [code]
Full-body motion capture for multiple closely interacting persons [paper]
PARE: Part Attention Regressor for 3D Human Body Estimation [project] | [code]
DSFN: Dynamic Surface Function Networks for Clothed Human Bodies [code]
3D Human Pose and Shape Estimation Through Collaborative Learning and Multi-view Model-fitting [code]
On Self-Contact and Human Pose [project]
KAMA: 3D Keypoint Aware Body Mesh Articulation [paper] | [video]
Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction [code]
Multi-person Implicit Reconstruction from a Single Image [paper] | [video]
Body Meshes as Points [code]
Collaborative Regression of Expressive Bodies using Moderation [project]
Deep3DPose: Realtime Reconstruction of Arbitrarily Posed Human Bodies from Single RGB Images [paper]
RobustFusion: Robust Volumetric Performance Reconstruction under Human-object Interactions from Monocular RGBD Stream [paper]
Monocular Real-time Full Body Capture with Inter-part Correlations [project]
Learning Complex 3D Human Self-Contact [pdf]
Learning Local Recurrent Models for Human Mesh Recovery [pdf]
LASOR: Learning Accurate 3D Human Pose and Shape Via Synthetic Occlusion-Aware Data and Neural Mesh Rendering [pdf]
HuMoR: 3D Human Motion Model for Robust Pose Estimation [code]
Probabilistic Modeling for Human Mesh Recovery [code]
NeMF: Neural Motion Fields for Kinematic Animation [pdf]
Encoder-decoder with Multi-level Attention for 3D Human Shape and PoseEstimation [code]
LEMO: Learning Motion Priors for 4D Human Body Capture in 3D Scenes [ptoject]
Shape-aware Multi-Person Pose Estimation from Multi-View Images [project]
SPEC: Seeing People in the Wild with an Estimated Camera [project]
Learning to Regress Bodies from Images using Differentiable Semantic Rendering [code]
Mesh Graphormer [code]
VoteHMR: Occlusion-Aware Voting Network for Robust 3D Human Mesh Recovery from Partial Point Clouds [pdf] | [code]
Leveraging MoCap Data for Human Mesh Recovery [pdf]
PIXIE: Collaborative Regression of Expressive Bodies [code]
Dynamic Multi-Person Mesh Recovery From Uncalibrated Multi-View Cameras [pdf] }| [code]
Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild [code]
Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation [code]
Tracking People with 3D Representations [code]
LatentHuman: Shape-and-Pose Disentangled Latent Representation for for Human Bodies[project]
PoseBERT [project]
Camera Motion Agnostic 3D Human Pose Estimation [code]
GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras [project]
Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation [project]
ICON: Implicit Clothed humans Obtained from Normals [code]
Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video [project]
Occluded Human Mesh Recovery [pdf]
Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation [code]
Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation [code]
CHOMP: Occluded Human Body Capture with Self-Supervised Spatial-Temporal Motion Prior [code]
PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images [project]
Benchmarking 3D Pose and Shape Estimation Beyond Algorithms [code]
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers [code]
CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation [code]
Learning Visibility for Robust Dense Human Body Estimation [code]
XRMocap - multi-view motion capture [code]
CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation [code]
PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body Estimation [pdf]
FuRPE: Learning Full-body Reconstruction from Part Experts [code]
NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action [project]
IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation [project]
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers [project]
Decoupling Human and Camera Motion from Videos in the Wild [project]
Capturing the motion of every joint: 3D human pose and shape estimation with independent tokens [project]
PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation [code]
GATOR: Graph-Aware Transformer with Offset-Disentangled Regression for Human Mesh Reconstruction from a 2D Pose [project]
BoPR: Body-aware Part Regressor for Human Shape and Pose Estimation [pdf]
Contact, Human and Object REconstruction from a single RGB image [code]
One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer [project]
EgoHMR: Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views [project]
HybrIK-X: Hybrid Analytical-Neural Inverse Kinematics for Whole-body Mesh Recovery [project]
Learning Analytical Posterior Probability for Human Mesh Recovery [project]
Sampling is Matter: Point-guided 3D Human Mesh Reconstruction [code]
SHOW: Synchronous HOlistic body in the Wild [code]
HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution Estimation [code]
NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation [code]
XFormer: Fast and Accurate Monocular 3D Body Capture [pdf]
IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation [code]
Learning Analytical Posterior Probability for Human Mesh Recovery [code]
Humans in 4D: Reconstructing and Tracking Humans with Transformers [code]
NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D HPS [code]
Implicit 3D Human Mesh Recovery using Consistency with Pose and Shape from Unseen-view [pdf]
BEDLAM: Bodies Exhibiting Detailed Lifelike Animated Motion [project]
MPM: A Unified 2D-3D Human Pose Representation via Masked Pose Modeling [code]
Tracking People by Predicting 3D Appearance, Location & Pose [code]
Learning 3D Human Shape and Pose from Dense Body Parts [code]
Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction [pdf]
JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery [code]
Semantify:Simplifying the Control of 3D Morphable Models using CLIP [code]

human 3d pose estimation

Freemocap [code]
MeTRAbs: Metric-Scale Truncation-Robust Heatmaps for Absolute 3D Human Pose Estimation [code]
Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking [code]
Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo [code]
3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks [code]
CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild [paper]
3D Human Pose Estimation with Spatial and Temporal Transformers [code]
Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild [paper]
AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild [code]
Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation [code]
Probabilistic-Monocular-3D-Human-Pose-Estimation-with-Normalizing-Flows [code]
VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild [pdf]
MetaPose: Fast 3D Pose from Multiple Views without 3D Supervision [pdf]
CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation[pdf]
Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation [project]
Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization [code]
PedRecNet: Multi-task deep neural network for full 3D human pose and orientation estimation [code]
Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation [code]
PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision [code]
A Dual-Masked Auto-Encoder for Robust Motion Capture with Spatial-Temporal Skeletal Token Completion [pdf]
DeciWatch: A Simple Baseline for 10× Efficient 2D and 3D Pose Estimation [project]
VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data [code]
Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection [pdf]
KinePose: A temporally optimized inverse kinematics technique for 6DOF human pose estimation with biomechanical constraint [code]
A Dual-Masked Auto-Encoder for Robust Motion Capture with Spatial-Temporal Skeletal Token Completion [code]
HDFormer: High-order Directed Transformer for 3D Human Pose Estimation [pdf]
HTNet: Human Topology Aware Network for 3D Human Pose Estimation [project]
Proactive Multi-Camera Collaboration for 3D Human Pose Estimation [project]
GFPose: Learning 3D Human Pose Prior with Gradient Fields [project]
Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation [code]
Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation [pdf]
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation [project]
PoseRAC: Pose Saliency Transformer for Repetitive Action Counting [code]
CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation [pdf]
Motion-DVAE: Unsupervised learning for fast human motion denoising [project]
Iterative Graph Filtering Network for 3D Human Pose Estimation [code]

simplify optical or inertial based motion capture

Real-Time Multi-person Motion Capture from Multi-view Video and IMUs[paper]
TransPose [code]
Physical Inertial Poser (PIP):Physics-aware Real-time Human Motion Tracking from Sparse Inertial Sensors [project]
EM-POSE: 3D Human Pose Estimation from Sparse Electromagnetic Trackers [code]
HybridCap: Inertia-aid Monocular Capture of Challenging Human Motions [pdf]
Transformer Inertial Poser: Attention-based Real-time Human Motion Reconstruction from Sparse IMUs [project]
DeMoCap: Low-cost Marker-based Motion Capture [code]
LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors [pdf]
AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing [code]
HybridTrak: Adding Full-Body Tracking to VR Using an Off-the-Shelf Webcam [video]
FusePose: IMU-Vision Sensor Fusion in Kinematic Space for Parametric Human Pose Estimation [pdf]
Fusion Poser: 3D Human Pose Estimation using Sparse IMUs and Head Tracker in real-time [code]
QuestSim: Human Motion Tracking from Sparse Sensors with Simulated Avatars [pdf]
TIP: Real-time Human Motion Reconstruction from Sparse IMUs with Simultaneous Terrain Generation [pdf]
FLAG: Flow-based 3D Avatar Generation from Sparse Observations [project]
Combining Motion Matching and Orientation Prediction to Animate Avatars for Consumer-Grade VR Devices [project]
Faster Deep Inertial Pose Estimation with Six Inertial Sensors [code]
Neural3Points: Learning to Generate Physically Realistic Full-body Motion for Virtual Reality Users [pdf]

human motion capture in 3d scene

4D Human Body Capture from Egocentric Video via 3D Scene Grounding [project]
LEMO Learning Motion Priors for 4D Human Body Capture in 3D Scenes [project]
Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors [project]
EgoBody Dataset:Human Body Shape, Motion and Social Interactions from Head-Mounted Devices[project]
GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping [project]
HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDAR [project]
Human-Aware Object Placement for Visual Environment Reconstruction [project]
COAP: Compositional Articulated Occupancy of People [project]
BEHAVE: Dataset and Method for Tracking Human Object Interactions [project]
Synthesizing Long-Term 3D Human Motion and Interaction in 3D [project]
Visually plausible human-object interaction capture from wearable sensors [project]
HULC: 3D HUman Motion Capture with Pose Manifold Sampling and Dense Contact Guidance [project]
Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis [pdf]
MoCapDeform: Monocular 3D Human Motion Capture in Deformable Scenes [project]
The One Where They Reconstructed 3D Humans and Environments in TV Shows [project]
Stochastic Scene-Aware Motion Prediction [code]
COINS : Compositional Human-Scene Interaction Synthesis with Semantic Control [projects]
Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes [[project]
InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction [project]
HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes [project]
SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments [code]
EgoLocate:Real-time Motion Capture, Localization, and Mapping with Sparse Body-mounted Sensors [project]
Total-Recon: Deformable Scene Reconstruction for Embodied View Synthesis [project]
Learning Human Mesh Recovery in 3D Scenes [project]
MIME: Human-Aware 3D Scene Generation [project]
TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D Environments [project]
CIMI4D: A Large Multimodal Climbing Motion Dataset under Human-scene Interactions [project]
IMoS: Intent-Driven Full-Body Motion Synthesis for Human-Object Interactions [code]

visonpon/human-motion-capture

visonpon

Reviews

Repository Details