Joint face and head tracking inside multi-camera smart rooms

Zhenqiu Zhang, Gerasimos Potamianos, Andrew W. Senior, Thomas S. Huang

Research output: Contribution to journalArticlepeer-review

Abstract

The paper introduces a novel detection and tracking system that provides both frame-view and world-coordinate human location information, based on video from multiple synchronized and calibrated cameras with overlapping fields of view. The system is developed and evaluated for the specific scenario of a seminar lecturer presenting in front of an audience inside a "smart room", its aim being to track the lecturer's head centroid in the three-dimensional (3D) space and also yield two-dimensional (2D) face information in the available camera views. The proposed approach is primarily based on a statistical appearance model of human faces by means of well-known AdaBoost-like face detectors, extended to address the head pose variation observed in the smart room scenario of interest. The appearance module is complemented by two novel components and assisted by a simple tracking drift detection mechanism. The first component of interest is the initialization module, which employs a spatio-temporal dynamic programming approach with appropriate penalty functions to obtain optimal 3D location hypotheses. The second is an adaptive subspace learning based 2D tracking scheme with a novel forgetting mechanism, introduced to reduce tracking drift and increase robustness. System performance is benchmarked on an extensive database of realistic human interaction in the lecture smart room scenario, collected as part of the European integrated project "CHIL". The system consistently achieves excellent tracking precision, with a 3D mean tracking error of less than 16 cm, and is demonstrated to outperform four alternative tracking schemes. Furthermore, the proposed system performs relatively well in detecting frontal and near-frontal faces in the available frame views.

Original languageEnglish (US)
Pages (from-to)163-178
Number of pages16
JournalSignal, Image and Video Processing
Volume1
Issue number2
DOIs
StatePublished - Jun 2007

Keywords

  • AdaBoost
  • Adaptive subspace tracking
  • Dynamic programming
  • Face detection
  • Lecture data
  • Mean-shift tracking
  • Multi-camera tracking
  • Person tracking
  • Smart rooms

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Joint face and head tracking inside multi-camera smart rooms'. Together they form a unique fingerprint.

Cite this