TY - GEN
T1 - Robust speaker identification using auditory features and computational auditory scene analysis
AU - Shao, Yang
AU - Wang, De Liang
PY - 2008
Y1 - 2008
N2 - The performance of speaker recognition systems drop significantly under noisy conditions. To improve robustness, we have recently proposed novel auditory features and a robust speaker recognition system using a front-end based on computational auditory scene analysis. In this paper, we further study the auditory features by exploring different feature dimensions and incorporating dynamic features. In addition, we evaluate the features and robust recognition in a speaker identification task in a number of noisy conditions. We find that one of the auditory features performs substantially better than a conventional speaker feature. Furthermore, our recognition system achieves significant performance improvements compared with an advanced front-end in a wide range of signal-to-noise conditions.
AB - The performance of speaker recognition systems drop significantly under noisy conditions. To improve robustness, we have recently proposed novel auditory features and a robust speaker recognition system using a front-end based on computational auditory scene analysis. In this paper, we further study the auditory features by exploring different feature dimensions and incorporating dynamic features. In addition, we evaluate the features and robust recognition in a speaker identification task in a number of noisy conditions. We find that one of the auditory features performs substantially better than a conventional speaker feature. Furthermore, our recognition system achieves significant performance improvements compared with an advanced front-end in a wide range of signal-to-noise conditions.
KW - Auditory feature
KW - Computational auditory scene analysis
KW - Gammatone feature
KW - Gammatone frequency cepstral coefficient
KW - Robust speaker recognition
UR - http://www.scopus.com/inward/record.url?scp=51449101666&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51449101666&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2008.4517928
DO - 10.1109/ICASSP.2008.4517928
M3 - Conference contribution
AN - SCOPUS:51449101666
SN - 1424414849
SN - 9781424414840
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 1589
EP - 1592
BT - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
T2 - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
Y2 - 31 March 2008 through 4 April 2008
ER -