TY - GEN
T1 - Accurate speech segmentation by mimicking human auditory processing
AU - King, Sarah
AU - Hasegawa-Johnson, Mark
PY - 2013/10/18
Y1 - 2013/10/18
N2 - This paper addresses the problem of locating phone boundaries without prior knowledge of the text of an utterance. A biomimetic model of human auditory processing is used to calculate the neural features of frequency synchrony and average signal level. Frequency synchrony and average signal level are used as input to a two-layered support vector machine (SVM)-based system to detect phone boundaries. Phone boundaries are detected with 87.0% precision and 84.8% recall when the automatic segmentation system has no prior knowledge of the phone sequence in the utterance.
AB - This paper addresses the problem of locating phone boundaries without prior knowledge of the text of an utterance. A biomimetic model of human auditory processing is used to calculate the neural features of frequency synchrony and average signal level. Frequency synchrony and average signal level are used as input to a two-layered support vector machine (SVM)-based system to detect phone boundaries. Phone boundaries are detected with 87.0% precision and 84.8% recall when the automatic segmentation system has no prior knowledge of the phone sequence in the utterance.
KW - Automatic segmentation
KW - auditory modeling
KW - average signal level
KW - frequency synchrony
UR - http://www.scopus.com/inward/record.url?scp=84890508656&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84890508656&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2013.6639242
DO - 10.1109/ICASSP.2013.6639242
M3 - Conference contribution
AN - SCOPUS:84890508656
SN - 9781479903566
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 8096
EP - 8100
BT - 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings
T2 - 2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
Y2 - 26 May 2013 through 31 May 2013
ER -