TY - GEN
T1 - Prosody-dependent acoustic modeling using variable-parameter hidden markov models
AU - Huang, Jui Ting
AU - Huang, Po Sen
AU - Mo, Yoonsook
AU - Hasegawa-Johnson, Mark
AU - Cole, Jennifer
N1 - Publisher Copyright:
© 2010 Proceedings of the International Conference on Speech Prosody.
PY - 2010
Y1 - 2010
N2 - As an effort to make prosody useful in spontaneous speech recognition, we adopt a quasi-continuous prosodic annotation and accordingly design a prosody-dependent acoustic model to improve ASR performances. We propose a variable-parameter Hidden Markov Models, modeling the mean vector as a function of the prosody variable through a polynomial regression model. The prosodically-adapted acoustic models are used to re-score the N-best output from a standard ASR, according to the prosody variable assigned by an automatic prosody detector. Experiments on the Buckeye corpus demonstrate the effectiveness of our approach.
AB - As an effort to make prosody useful in spontaneous speech recognition, we adopt a quasi-continuous prosodic annotation and accordingly design a prosody-dependent acoustic model to improve ASR performances. We propose a variable-parameter Hidden Markov Models, modeling the mean vector as a function of the prosody variable through a polynomial regression model. The prosodically-adapted acoustic models are used to re-score the N-best output from a standard ASR, according to the prosody variable assigned by an automatic prosody detector. Experiments on the Buckeye corpus demonstrate the effectiveness of our approach.
KW - Prosody-dependent ASR
KW - Re-scoring
KW - Variable parameter HMM
UR - http://www.scopus.com/inward/record.url?scp=84902969600&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84902969600&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84902969600
T3 - Proceedings of the International Conference on Speech Prosody
BT - 5th International Conference on Speech Prosody 2010
PB - International Speech Communication Association
T2 - 5th International Conference on Speech Prosody: Every Language, Every Style, SP 2010
Y2 - 10 May 2010 through 14 May 2010
ER -