TY - GEN
T1 - Variational transform invariant mixture of probabilistic PCA
AU - Tu, Jilin
AU - Fu, Yun
AU - Ivanovic, Alexandar
AU - Huang, Thomas S.
AU - Li, Fei Fei
PY - 2008
Y1 - 2008
N2 - In many video-based object recognition applications, the object appearances are acquired by visual tracking or detection and are inconsistent due to misalignments. We believe the misalignments can be removed if we can reduce the inconsistency in the object appearances caused by mis-alignments through clustering the objects in appearance, space and time domain simultaneously. We therefore propose to learn Transform Invariant Mixtures of Probabilistic PCA (TIMPPCA) model from the data while at the same time eliminating the misalignments. The model is formulated in a generative framework, and the misalignments are considered as hidden variables in the model. Variational EM update rules are then derived based on Variational Message Passing (VMP) techniques. The proposed TIMP-PCA is applied to improve head pose estimation performance and to detect the change of attention focus in meeting room video for meeting room video indexing/retrieval and achieves promising performance.
AB - In many video-based object recognition applications, the object appearances are acquired by visual tracking or detection and are inconsistent due to misalignments. We believe the misalignments can be removed if we can reduce the inconsistency in the object appearances caused by mis-alignments through clustering the objects in appearance, space and time domain simultaneously. We therefore propose to learn Transform Invariant Mixtures of Probabilistic PCA (TIMPPCA) model from the data while at the same time eliminating the misalignments. The model is formulated in a generative framework, and the misalignments are considered as hidden variables in the model. Variational EM update rules are then derived based on Variational Message Passing (VMP) techniques. The proposed TIMP-PCA is applied to improve head pose estimation performance and to detect the change of attention focus in meeting room video for meeting room video indexing/retrieval and achieves promising performance.
UR - http://www.scopus.com/inward/record.url?scp=50849135945&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50849135945&partnerID=8YFLogxK
U2 - 10.1109/WACV.2008.4543995
DO - 10.1109/WACV.2008.4543995
M3 - Conference contribution
AN - SCOPUS:50849135945
SN - 1424419131
SN - 9781424419135
T3 - 2008 IEEE Workshop on Applications of Computer Vision, WACV
BT - 2008 IEEE Workshop on Applications of Computer Vision, WACV
T2 - 2008 IEEE Workshop on Applications of Computer Vision, WACV
Y2 - 7 January 2008 through 9 January 2008
ER -