4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Miao Liu, Dexin Yang, Yan Zhang, Zhaopeng Cui, James M. Rehg, Siyu Tang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We introduce a novel task of reconstructing a time series of second-person1 3D human body meshes from monocular egocentric videos. The unique viewpoint and rapid embodied camera motion of egocentric videos raise additional technical barriers for human body capture. To address those challenges,we propose a simple yet effective optimization-based approach that leverages 2D observations of the entire video sequence and human-scene interaction constraint to estimate second-person human poses,shapes,and global motion that are grounded on the 3D environment captured from the egocentric view. We conduct detailed ablation studies to validate our design choice. Moreover,we compare our method with the previous state-of-the-art method on human motion capture from monocular video,and show that our method estimates more accurate human-body poses and shapes under the challenging egocentric setting. In addition,we demonstrate that our approach produces more realistic human-scene interaction.

Original languageEnglish (US)
Title of host publicationProceedings - 2021 International Conference on 3D Vision, 3DV 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages930-939
Number of pages10
ISBN (Electronic)9781665426886
DOIs
StatePublished - 2021
Externally publishedYes
Event9th International Conference on 3D Vision, 3DV 2021 - Virtual, Online, United Kingdom
Duration: Dec 1 2021Dec 3 2021

Publication series

NameProceedings - 2021 International Conference on 3D Vision, 3DV 2021

Conference

Conference9th International Conference on 3D Vision, 3DV 2021
Country/TerritoryUnited Kingdom
CityVirtual, Online
Period12/1/2112/3/21

Keywords

  • 3D Human Reconstruction
  • Egocnetric Vision
  • Human Scene Interaction

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of '4D Human Body Capture from Egocentric Video via 3D Scene Grounding'. Together they form a unique fingerprint.

Cite this