TY - GEN
T1 - An auditory-based feature for robust speech recognition
AU - Shao, Yang
AU - Jin, Zhaozhang
AU - Wang, Deliang
AU - Srinivasan, Soundararajan
N1 - Copyright:
Copyright 2009 Elsevier B.V., All rights reserved.
PY - 2009
Y1 - 2009
N2 - A conventional automatic speech recognizer does not perform well in the presence of noise, while human listeners are able to segregate and recognize speech in noisy conditions. We study a novel feature based on an auditory periphery model for robust speech recognition. Specifically, gammatone frequency cepstral coefficients are derived by applying a cepstral analysis on gammatone filterbank responses. Our evaluations show that the proposed feature performs considerably better than conventional acoustic features. We further demonstrate that integrating the proposed feature with a computational auditory scene analysis system yields promising recognition performance.
AB - A conventional automatic speech recognizer does not perform well in the presence of noise, while human listeners are able to segregate and recognize speech in noisy conditions. We study a novel feature based on an auditory periphery model for robust speech recognition. Specifically, gammatone frequency cepstral coefficients are derived by applying a cepstral analysis on gammatone filterbank responses. Our evaluations show that the proposed feature performs considerably better than conventional acoustic features. We further demonstrate that integrating the proposed feature with a computational auditory scene analysis system yields promising recognition performance.
KW - Auditory feature
KW - Computational auditory scene analysis
KW - Gammatone frequency cepstral coefficients
KW - Robust speech recognition
UR - http://www.scopus.com/inward/record.url?scp=70349223037&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349223037&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2009.4960661
DO - 10.1109/ICASSP.2009.4960661
M3 - Conference contribution
AN - SCOPUS:70349223037
SN - 9781424423545
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 4625
EP - 4628
BT - 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009
T2 - 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009
Y2 - 19 April 2009 through 24 April 2009
ER -