TY - GEN
T1 - Audio-visual affect recognition through multi-stream fused HMM for HCI
AU - Zeng, Zhihong
AU - Tu, Jilin
AU - Pianfetti, Brian
AU - Liu, Ming
AU - Zhang, Tong
AU - Zhang, Zhenqiu
AU - Huang, Thomas S
AU - Levinson, Stephen
PY - 2005
Y1 - 2005
N2 - Advances in computer processing power and emerging algorithms are allowing new ways of envisioning Human Computer Interaction. This paper focuses on the development of a computing algorithm that uses audio and visual sensors to detect and track a user's affective state to aid computer decision making. Using our Multi-stream Fused Hidden Markov Model (MFHMM), we analyzed coupled audio and visual streams to detect 11 cognitive/emotive states. The MFHMM allows the building of an optimal connection among multiple streams according to the maximum entropy principle and the maximum mutual information criterion. Person-independent experimental results from 20 subjects in 660 sequences show that the MFHMM approach performs with an accuracy of 80.61% which outperforms face-only HMM, pitch-only HMM, energy-only HMM, and independent HMM fusion.
AB - Advances in computer processing power and emerging algorithms are allowing new ways of envisioning Human Computer Interaction. This paper focuses on the development of a computing algorithm that uses audio and visual sensors to detect and track a user's affective state to aid computer decision making. Using our Multi-stream Fused Hidden Markov Model (MFHMM), we analyzed coupled audio and visual streams to detect 11 cognitive/emotive states. The MFHMM allows the building of an optimal connection among multiple streams according to the maximum entropy principle and the maximum mutual information criterion. Person-independent experimental results from 20 subjects in 660 sequences show that the MFHMM approach performs with an accuracy of 80.61% which outperforms face-only HMM, pitch-only HMM, energy-only HMM, and independent HMM fusion.
UR - http://www.scopus.com/inward/record.url?scp=24644432083&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=24644432083&partnerID=8YFLogxK
U2 - 10.1109/CVPR.2005.77
DO - 10.1109/CVPR.2005.77
M3 - Conference contribution
AN - SCOPUS:24644432083
SN - 0769523722
SN - 9780769523729
T3 - Proceedings - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005
SP - 967
EP - 972
BT - Proceedings - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005
PB - IEEE Computer Society
T2 - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005
Y2 - 20 June 2005 through 25 June 2005
ER -