Improving the robustness of prosody dependent language modeling based on prosody syntax dependence

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents a novel approach that improves the robustness of prosody dependent language modeling by lever-aging the dependence between prosody and syntax. A prosody dependent language model describes the joint probability distribution of concurrent word and prosody sequences and can be used to provide prior language constraints in a prosody dependent speech recognizer. Robust Maximum Likelihood (ML) estimation of prosody dependent n-gram language models requires a large amount of prosodically transcribed data. In this paper, we show that the prosody-syntax dependence can be utilized to diminish the data sparseness introduced by prosody dependent modeling. Experiments on Radio News Corpus show that the prosody dependent language model estimated using our approach reduces the joint perplexity by up to 34% as compared with the standard ML-estimated prosody dependent language model; the word perplexity can be reduced by up to 84% as compared with the standard ML-estimated prosody independent language model. In recognition experiments, the language model estimated by our approach create an improvement of 1% in word recognition accuracy, 0.7% in accent recognition accuracy and 1.5% in intonational phrase boundary (IPB) recognition accuracy over a baseline prosody dependent language model.

Original languageEnglish (US)
Title of host publication2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages435-440
Number of pages6
ISBN (Electronic)0780379802, 9780780379800
DOIs
StatePublished - Jan 1 2003
EventIEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003 - St. Thomas, United States
Duration: Nov 30 2003Dec 4 2003

Publication series

Name2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003

Other

OtherIEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003
CountryUnited States
CitySt. Thomas
Period11/30/0312/4/03

ASJC Scopus subject areas

  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Improving the robustness of prosody dependent language modeling based on prosody syntax dependence'. Together they form a unique fingerprint.

  • Cite this

    Chen, K., & Hasegawa-Johnson, M. (2003). Improving the robustness of prosody dependent language modeling based on prosody syntax dependence. In 2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003 (pp. 435-440). [1318480] (2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ASRU.2003.1318480