Human Motion Capture
Collecting papers about human motion capture
โญ๐ฅ๐ช I will update each day and add more details about every paper โ๏ธโ๏ธโ๏ธ
NOTE : Since motion capture is a field that has been studied for many years, I will collect the up to date papers first and then try to gather the old but classical one.
methods based directly on video
- Physically Plausible Monocular 3D Motion Capture [project]
- "Neural PhysCap" Neural Monocular 3D Human Motion Capture with Physical Awareness [project]
- XNect: real-time multi-person 3D motion capture with a single RGB camera[project]
- 4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras [code]
- VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera [project]
- TesseTrack: End-to-End Learnable Multi-Person Articulated 3D Pose Tracking [project]
- Synergetic Reconstruction from 2D Pose and 3D Motion for Wide-Space Multi-Person Video Motion Capture in the Wild [project]
- MonoPerfCap: Human Performance Capture from Monocular Video [project]
- Capturing Detailed Deformations of Moving Human Bodies [project]
- Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras [project]
- Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos [pdf]
- Direct Multi-view Multi-person 3D Human Pose Estimation [code]
- Human Performance Capture from Monocular Video in the Wild [pdf] | [code]
- Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning[pdf] | [code]
- Human Dynamics from Monocular Video with Dynamic Camera Movements [code] | [project]
- AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape Estimation [code]
- Neural MoCon: Neural Motion Control for Physically Plausible Human Motion Capture [project]
- Permutation-Invariant Relational Network for Multi-person 3D Pose Estimation [pdf]
- PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision [code]
- Generalizable Human Pose Triangulation [code]
- Differentiable Dynamics for Articulated 3d Human Motion Reconstruction [pdf]
- VTP: Volumetric Transformer for Multi-view Multi-person 3D Pose Estimation [pdf]
- Trajectory Optimization for Physics-Based Reconstruction of 3d Human Pose from Monocular Video [pdf]
- Learning Variational Motion Prior for Video-based Motion Capture [pdf]
- Accurate and Efficient Absolute 3D Human Pose Estimation Trained on Dozens of Datasets [project]
- GFPose: Learning 3D Human Pose Prior with Gradient Fields [[project]
- DiffPose: Toward More Reliable 3D Pose Estimation [pdf] | [project]
- Learning 3D Human Pose Estimation From Dozens of Datasets Using a Geometry-Aware Autoencoder To Bridge Between Skeleton Formats [project]
- Scene-Aware 3D Multi-Human Motion Capture from a Single Camera [project]
- Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition [project]
- 3D Human Pose Estimation via Intuitive Physics [project]
- GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction [project]
- Real-time Monocular Full-body Capture in World Space via Sequential Proxy-to-Motion Learning [project]
- Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation [pdf]
methods based on body model such as smpl/smpl-x
Note that there are some papers that I don't list here most because I have tested it and the result is not so good(e,g. frankmocap,VIBE and so on)
- Contact and Human Dynamics from Monocular Video [code]
- DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras [project]
- Monocular, One-stage, Regression of Multiple 3D People [code]
- BEV:Putting People in their Place,Monocular Regression of 3D People in Depth [project]
- PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop [project]
- End-to-End Human Pose and Mesh Reconstruction with Transformers [code]
- Pose2Pose: 3D Positional Pose-Guided 3D Rotational Pose Prediction for Expressive 3D Human Pose and Mesh Estimation[paper] | [code]
- AGORA: Avatars in Geography Optimized for Regression Analysis [code]
- Real-time RGBD-based Extended Body Pose Estimation [code]
- EasyMocap [code]
- PanoMan: Sparse Localized Componentsโbased Model for Full Human Motions [paper]
- Monocular Total Capture: Posing Face, Body and Hands in the Wild [code]
- Full-body motion capture for multiple closely interacting persons [paper]
- PARE: Part Attention Regressor for 3D Human Body Estimation [project] | [code]
- DSFN: Dynamic Surface Function Networks for Clothed Human Bodies [code]
- 3D Human Pose and Shape Estimation Through Collaborative Learning and Multi-view Model-fitting [code]
- On Self-Contact and Human Pose [project]
- KAMA: 3D Keypoint Aware Body Mesh Articulation [paper] | [video]
- Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction [code]
- Multi-person Implicit Reconstruction from a Single Image [paper] | [video]
- Body Meshes as Points [code]
- Collaborative Regression of Expressive Bodies using Moderation [project]
- Deep3DPose: Realtime Reconstruction of Arbitrarily Posed Human Bodies from Single RGB Images [paper]
- RobustFusion: Robust Volumetric Performance Reconstruction under Human-object Interactions from Monocular RGBD Stream [paper]
- Monocular Real-time Full Body Capture with Inter-part Correlations [project]
- Learning Complex 3D Human Self-Contact [pdf]
- Learning Local Recurrent Models for Human Mesh Recovery [pdf]
- LASOR: Learning Accurate 3D Human Pose and Shape Via Synthetic Occlusion-Aware Data and Neural Mesh Rendering [pdf]
- HuMoR: 3D Human Motion Model for Robust Pose Estimation [code]
- Probabilistic Modeling for Human Mesh Recovery [code]
- NeMF: Neural Motion Fields for Kinematic Animation [pdf]
- Encoder-decoder with Multi-level Attention for 3D Human Shape and PoseEstimation [code]
- LEMO: Learning Motion Priors for 4D Human Body Capture in 3D Scenes [ptoject]
- Shape-aware Multi-Person Pose Estimation from Multi-View Images [project]
- SPEC: Seeing People in the Wild with an Estimated Camera [project]
- Learning to Regress Bodies from Images using Differentiable Semantic Rendering [code]
- Mesh Graphormer [code]
- VoteHMR: Occlusion-Aware Voting Network for Robust 3D Human Mesh Recovery from Partial Point Clouds [pdf] | [code]
- Leveraging MoCap Data for Human Mesh Recovery [pdf]
- PIXIE: Collaborative Regression of Expressive Bodies [code]
- Dynamic Multi-Person Mesh Recovery From Uncalibrated Multi-View Cameras [pdf] }| [code]
- Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild [code]
- Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation [code]
- Tracking People with 3D Representations [code]
- LatentHuman: Shape-and-Pose Disentangled Latent Representation for for Human Bodies[project]
- PoseBERT [project]
- Camera Motion Agnostic 3D Human Pose Estimation [code]
- GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras [project]
- Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation [project]
- ICON: Implicit Clothed humans Obtained from Normals [code]
- Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video [project]
- Occluded Human Mesh Recovery [pdf]
- Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation [code]
- Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation [code]
- CHOMP: Occluded Human Body Capture with Self-Supervised Spatial-Temporal Motion Prior [code]
- PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images [project]
- Benchmarking 3D Pose and Shape Estimation Beyond Algorithms [code]
- Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers [code]
- CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation [code]
- Learning Visibility for Robust Dense Human Body Estimation [code]
- XRMocap - multi-view motion capture [code]
- CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation [code]
- PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body Estimation [pdf]
- FuRPE: Learning Full-body Reconstruction from Part Experts [code]
- NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action [project]
- IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation [project]
- Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers [project]
- Decoupling Human and Camera Motion from Videos in the Wild [project]
- Capturing the motion of every joint: 3D human pose and shape estimation with independent tokens [project]
- PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation [code]
- GATOR: Graph-Aware Transformer with Offset-Disentangled Regression for Human Mesh Reconstruction from a 2D Pose [project]
- BoPR: Body-aware Part Regressor for Human Shape and Pose Estimation [pdf]
- Contact, Human and Object REconstruction from a single RGB image [code]
- One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer [project]
- EgoHMR: Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views [project]
- HybrIK-X: Hybrid Analytical-Neural Inverse Kinematics for Whole-body Mesh Recovery [project]
- Learning Analytical Posterior Probability for Human Mesh Recovery [project]
- Sampling is Matter: Point-guided 3D Human Mesh Reconstruction [code]
- SHOW: Synchronous HOlistic body in the Wild [code]
- HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution Estimation [code]
- NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation [code]
- XFormer: Fast and Accurate Monocular 3D Body Capture [pdf]
- IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation [code]
- Learning Analytical Posterior Probability for Human Mesh Recovery [code]
- Humans in 4D: Reconstructing and Tracking Humans with Transformers [code]
- NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D HPS [code]
- Implicit 3D Human Mesh Recovery using Consistency with Pose and Shape from Unseen-view [pdf]
- BEDLAM: Bodies Exhibiting Detailed Lifelike Animated Motion [project]
- MPM: A Unified 2D-3D Human Pose Representation via Masked Pose Modeling [code]
- Tracking People by Predicting 3D Appearance, Location & Pose [code]
- Learning 3D Human Shape and Pose from Dense Body Parts [code]
- Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction [pdf]
- JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery [code]
- Semantify:Simplifying the Control of 3D Morphable Models using CLIP [code]
human 3d pose estimation
- Freemocap [code]
- MeTRAbs: Metric-Scale Truncation-Robust Heatmaps for Absolute 3D Human Pose Estimation [code]
- Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking [code]
- Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo [code]
- 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks [code]
- CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild [paper]
- 3D Human Pose Estimation with Spatial and Temporal Transformers [code]
- Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild [paper]
- AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild [code]
- Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation [code]
- Probabilistic-Monocular-3D-Human-Pose-Estimation-with-Normalizing-Flows [code]
- VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild [pdf]
- MetaPose: Fast 3D Pose from Multiple Views without 3D Supervision [pdf]
- CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation[pdf]
- Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation [project]
- Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization [code]
- PedRecNet: Multi-task deep neural network for full 3D human pose and orientation estimation [code]
- Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation [code]
- PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision [code]
- A Dual-Masked Auto-Encoder for Robust Motion Capture with Spatial-Temporal Skeletal Token Completion [pdf]
- DeciWatch: A Simple Baseline for 10ร Efficient 2D and 3D Pose Estimation [project]
- VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data [code]
- Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection [pdf]
- KinePose: A temporally optimized inverse kinematics technique for 6DOF human pose estimation with biomechanical constraint [code]
- A Dual-Masked Auto-Encoder for Robust Motion Capture with Spatial-Temporal Skeletal Token Completion [code]
- HDFormer: High-order Directed Transformer for 3D Human Pose Estimation [pdf]
- HTNet: Human Topology Aware Network for 3D Human Pose Estimation [project]
- Proactive Multi-Camera Collaboration for 3D Human Pose Estimation [project]
- GFPose: Learning 3D Human Pose Prior with Gradient Fields [project]
- Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation [code]
- Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation [pdf]
- PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation [project]
- PoseRAC: Pose Saliency Transformer for Repetitive Action Counting [code]
- CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation [pdf]
- Motion-DVAE: Unsupervised learning for fast human motion denoising [project]
- Iterative Graph Filtering Network for 3D Human Pose Estimation [code]
simplify optical or inertial based motion capture
- Real-Time Multi-person Motion Capture from Multi-view Video and IMUs[paper]
- TransPose [code]
- Physical Inertial Poser (PIP):Physics-aware Real-time Human Motion Tracking from Sparse Inertial Sensors [project]
- EM-POSE: 3D Human Pose Estimation from Sparse Electromagnetic Trackers [code]
- HybridCap: Inertia-aid Monocular Capture of Challenging Human Motions [pdf]
- Transformer Inertial Poser: Attention-based Real-time Human Motion Reconstruction from Sparse IMUs [project]
- DeMoCap: Low-cost Marker-based Motion Capture [code]
- LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors [pdf]
- AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing [code]
- HybridTrak: Adding Full-Body Tracking to VR Using an Off-the-Shelf Webcam [video]
- FusePose: IMU-Vision Sensor Fusion in Kinematic Space for Parametric Human Pose Estimation [pdf]
- Fusion Poser: 3D Human Pose Estimation using Sparse IMUs and Head Tracker in real-time [code]
- QuestSim: Human Motion Tracking from Sparse Sensors with Simulated Avatars [pdf]
- TIP: Real-time Human Motion Reconstruction from Sparse IMUs with Simultaneous Terrain Generation [pdf]
- FLAG: Flow-based 3D Avatar Generation from Sparse Observations [project]
- Combining Motion Matching and Orientation Prediction to Animate Avatars for Consumer-Grade VR Devices [project]
- Faster Deep Inertial Pose Estimation with Six Inertial Sensors [code]
- Neural3Points: Learning to Generate Physically Realistic Full-body Motion for Virtual Reality Users [pdf]
human motion capture in 3d scene
- 4D Human Body Capture from Egocentric Video via 3D Scene Grounding [project]
- LEMO Learning Motion Priors for 4D Human Body Capture in 3D Scenes [project]
- Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors [project]
- EgoBody Dataset:Human Body Shape, Motion and Social Interactions from Head-Mounted Devices[project]
- GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping [project]
- HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDAR [project]
- Human-Aware Object Placement for Visual Environment Reconstruction [project]
- COAP: Compositional Articulated Occupancy of People [project]
- BEHAVE: Dataset and Method for Tracking Human Object Interactions [project]
- Synthesizing Long-Term 3D Human Motion and Interaction in 3D [project]
- Visually plausible human-object interaction capture from wearable sensors [project]
- HULC: 3D HUman Motion Capture with Pose Manifold Sampling and Dense Contact Guidance [project]
- Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis [pdf]
- MoCapDeform: Monocular 3D Human Motion Capture in Deformable Scenes [project]
- The One Where They Reconstructed 3D Humans and Environments in TV Shows [project]
- Stochastic Scene-Aware Motion Prediction [code]
- COINS : Compositional Human-Scene Interaction Synthesis with Semantic Control [projects]
- Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes [[project]
- InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction [project]
- HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes [project]
- SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments [code]
- EgoLocate:Real-time Motion Capture, Localization, and Mapping with Sparse Body-mounted Sensors [project]
- Total-Recon: Deformable Scene Reconstruction for Embodied View Synthesis [project]
- Learning Human Mesh Recovery in 3D Scenes [project]
- MIME: Human-Aware 3D Scene Generation [project]
- TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D Environments [project]
- CIMI4D: A Large Multimodal Climbing Motion Dataset under Human-scene Interactions [project]
- IMoS: Intent-Driven Full-Body Motion Synthesis for Human-Object Interactions [code]