TY - JOUR
T1 - On the Use of Hidden Markov Models for Speaker‐Independent Recognition of Isolated Words From a Medium‐Size Vocabulary
AU - Rabiner, L. R.
AU - Levinson, S. E.
AU - Sondhi, M. M.
PY - 1984/4
Y1 - 1984/4
N2 - Recent work at AT&T Bell Laboratories has shown how the theories of Vector Quantization (VQ) and Hidden Markov Modeling (HMM) can be applied to the recognition of isolated word vocabularies. The initial experiments with an HMM word recognizer were restricted to a vocabulary of 10 digits. For this simple vocabulary with dialed‐up telephone recordings, we found that a high‐performance, speaker‐independent word recognizer could be implemented, and that the performance was, for the most part, insensitive to parameters of both the HMM and the VQ. In this paper we extend our investigations of the HMM recognizer to the recognition of isolated words from a medium‐size vocabulary (129 words), as used in the AT&T Bell Laboratories airlines reservation and information system. For this moderately complex word vocabulary, we have found that recognition accuracy is indeed a function of the HMM parameters (i.e., the number of states in the model and the number of symbols per state). We have also found that a VQ that includes energy information gives better performance than a conventional VQ of the same size (i.e., same number of code‐book entries).
AB - Recent work at AT&T Bell Laboratories has shown how the theories of Vector Quantization (VQ) and Hidden Markov Modeling (HMM) can be applied to the recognition of isolated word vocabularies. The initial experiments with an HMM word recognizer were restricted to a vocabulary of 10 digits. For this simple vocabulary with dialed‐up telephone recordings, we found that a high‐performance, speaker‐independent word recognizer could be implemented, and that the performance was, for the most part, insensitive to parameters of both the HMM and the VQ. In this paper we extend our investigations of the HMM recognizer to the recognition of isolated words from a medium‐size vocabulary (129 words), as used in the AT&T Bell Laboratories airlines reservation and information system. For this moderately complex word vocabulary, we have found that recognition accuracy is indeed a function of the HMM parameters (i.e., the number of states in the model and the number of symbols per state). We have also found that a VQ that includes energy information gives better performance than a conventional VQ of the same size (i.e., same number of code‐book entries).
UR - http://www.scopus.com/inward/record.url?scp=0021407797&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0021407797&partnerID=8YFLogxK
U2 - 10.1002/j.1538-7305.1984.tb00023.x
DO - 10.1002/j.1538-7305.1984.tb00023.x
M3 - Article
AN - SCOPUS:0021407797
SN - 0748-612X
VL - 63
SP - 627
EP - 642
JO - AT&T Bell Laboratories Technical Journal
JF - AT&T Bell Laboratories Technical Journal
IS - 4
ER -