Efficient Adaptation of TTS Duration Model to New Speakers

Chilin Shih, Wentao Gu, Jan P.H. van Santen

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper discusses a methodology using a minimal set of sentences to adapt an existing TTS duration model to capture inter-speaker variations. The assumption is that the original duration database contains information of both language-specific and speaker-specific duration characteristics. In training a duration model for a new speaker, only the speaker-specific information needs to be modeled, therefore the size of the training data can be reduced drastically. Results from several experiments are compared and discussed.

Original languageEnglish (US)
StatePublished - 1998
Externally publishedYes
Event5th International Conference on Spoken Language Processing, ICSLP 1998 - Sydney, Australia
Duration: Nov 30 1998Dec 4 1998

Conference

Conference5th International Conference on Spoken Language Processing, ICSLP 1998
Country/TerritoryAustralia
CitySydney
Period11/30/9812/4/98

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Efficient Adaptation of TTS Duration Model to New Speakers'. Together they form a unique fingerprint.

Cite this