Speaker independent phonetic transcription of fluent speech for large vocabulary speech recognition

S. E. Levinson, M. Y. Liberman, A. Ljolje, L. G. Miller

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Speaker independent phonetic Iranscription of fluent speech is performed using an ergodic continuously variable duration hidden Markov model (CVDHMM) to represent the acoustic, phonetic and phonotactic structure of speech. An important property of the model is that each of its fifty-one states is uniquely identified with a single phonetic unit. Thus, for any spoken utterance, a phonetic transcription is obtained from a dynamic programming (DP) procedure for finding the state sequence of maximum likelihood. A model has been constructed based on 4020 sentences from the TIMIT database. When tested on 180 different sentences from this database, phonetic accuracy was observed to be 56% with 9% insertions. A speaker dependent version of the model was also constructed. The transcription algorithm was then combined with lexical access and parsing routines to form a complete recognition system. When tested on sentences from the DARPA resource management task spoken over the local switched telephone network, phonetic accuracy of 64% with 8% insertions and word accuracy of 87% with 3% insertions was measured. This system is presently operating in an on-line mode over the local switched telephone network in less than ten times real time on an Alliant FX-80.

Original languageEnglish (US)
Title of host publicationSpeech and Natural Language, Proceedings of a Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages75-80
Number of pages6
ISBN (Electronic)1558600736, 9781558600737
StatePublished - 1989
Externally publishedYes
Event1989 Speech and Natural Language Workshop held at Philadelphia, PA Human Language Technology Conference, HLT 1989 - Philadelphia, United States
Duration: Feb 21 1989Feb 23 1989

Publication series

NameSpeech and Natural Language, Proceedings of a Workshop

Conference

Conference1989 Speech and Natural Language Workshop held at Philadelphia, PA Human Language Technology Conference, HLT 1989
Country/TerritoryUnited States
CityPhiladelphia
Period2/21/892/23/89

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Speaker independent phonetic transcription of fluent speech for large vocabulary speech recognition'. Together they form a unique fingerprint.

Cite this