In the last several years, a wide variety of techniques have been developed which make practical the implementation and development of large networks for recognizing connected sequences of words. Included among these techniques are efficient and accurate speech modeling methods (e. g. , vector quantization, hidden Markov models) and efficient, optimal network search procedures (i. e. , level building). It is shown how to integrate these techniques to give a speaker-independent, syntax-directed, connected word recognition system which requires only a modest amount of computation and has a performance comparable to that of previous recognizers requiring an order of magnitude more computation. In particular, the recognizer studied was an airline information and reservation system using a 129-word vocabulary and a deterministic syntax (grammar) with 144 states, 450 state transitions, and 21 final states, generating more than 6 multiplied by 10**9 sentences. An evaluation of the system, using six talkers each speaking 51 test sentences, yielded a sentence accuracy of about 75% resulting from a word accuracy of about 93%, for an average speaking rate of about 210 words per minute.
|Original language||English (US)|
|Number of pages||13|
|Journal||IEEE Transactions on Acoustics, Speech, and Signal Processing|
|State||Published - 1985|
ASJC Scopus subject areas
- Signal Processing