Efficient Adaptation of TTS Duration Model to New Speakers

Chilin Shih, Wentao Gu, Jan P.H. van Santen

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper discusses a methodology using a minimal set of sentences to adapt an existing TTS duration model to capture inter-speaker variations. The assumption is that the original duration database contains information of both language-specific and speaker-specific duration characteristics. In training a duration model for a new speaker, only the speaker-specific information needs to be modeled, therefore the size of the training data can be reduced drastically. Results from several experiments are compared and discussed.

Original languageEnglish (US)
Pages81-86
Number of pages6
StatePublished - 1998
Externally publishedYes
Event3rd ESCA/COCOSDA Workshop on Speech Synthesis, SSW 1998 - Blue Mountains, Australia
Duration: Nov 26 1998Nov 29 1998

Conference

Conference3rd ESCA/COCOSDA Workshop on Speech Synthesis, SSW 1998
Country/TerritoryAustralia
CityBlue Mountains
Period11/26/9811/29/98

ASJC Scopus subject areas

  • Language and Linguistics
  • Cultural Studies

Fingerprint

Dive into the research topics of 'Efficient Adaptation of TTS Duration Model to New Speakers'. Together they form a unique fingerprint.

Cite this