Improving the Robustness of 3D Human Pose Estimation: A Benchmark Dataset and Learning from Noisy Input

Trung Hieu Hoang, Mona Zehni, Huy Phan, Duc Minh Vo, Minh N. Do

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Despite the promising performance of current 3D human pose estimation techniques, understanding and enhancing their robustness on challenging in-the-wild videos remain an open problem. In this work, we focus on building robust 2D-to-3D pose lifters. To this end, we develop two benchmark datasets, namely Human3.6M-C and HumanEva-I-C, to examine the resilience of video-based 3D pose lifters to a wide range of common video corruptions including temporary occlusion, motion blur, and pixel-level noise. We demonstrate the poor generalization of state-of-the-art 3D pose lifters in the presence of corruption and establish two techniques to tackle this issue. First, we introduce Temporal Additive Gaussian Noise (TAGN) as a simple yet effective 2D input pose data augmentation. Additionally, to incorporate the confidence scores output by the 2D pose detectors, we design a confidence-aware convolution (CA-Conv) block. Extensively tested on corrupted videos, the proposed strategies consistently boost the robustness of 3D pose lifters and serve as new baselines for future research.

Original languageEnglish (US)
Title of host publicationProceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
PublisherIEEE Computer Society
Pages113-123
Number of pages11
ISBN (Electronic)9798350365474
DOIs
StatePublished - 2024
Event2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024 - Seattle, United States
Duration: Jun 16 2024Jun 22 2024

Publication series

NameIEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
ISSN (Print)2160-7508
ISSN (Electronic)2160-7516

Conference

Conference2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
Country/TerritoryUnited States
CitySeattle
Period6/16/246/22/24

Keywords

  • 3D human pose estimation
  • adversarial attacks
  • confidence-aware
  • data augmentation
  • robustness of AI models
  • synthetic dataset
  • video corruptions

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Improving the Robustness of 3D Human Pose Estimation: A Benchmark Dataset and Learning from Noisy Input'. Together they form a unique fingerprint.

Cite this