State-transition interpolation and map adaptation for HMM-based dysarthric speech recognition

Harsh Vardhan Sharma, Mark Hasegawa-Johnson

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper describes the results of our experiments in building speaker-adaptive recognizers for talkers with spastic dysarthria. We study two modifications - (a) MAP adaptation of speaker-independent systems trained on normal speech and, (b) using a transition probability matrix that is a linear interpolation between fully ergodic and (exclusively) leftto- right structures, for both speaker-dependent and speaker-adapted systems. The experiments indicate that (1) for speaker-dependent systems, left-to-right HMMs have lower word error rate than transition-interpolated HMMs, (2) adapting all parameters other than transition probabilities results in the highest recognition accuracy compared to adapting any subset of these parameters or adapting all parameters including transition probabilities, (3) performing both transition-interpolation and adaptation gives higher word error rate than performing adaptation alone and, (4) dysarthria severity is not a sufficient indicator of the relative performance of speakerdependent and speaker-adapted systems.

Conference

Conference1st Workshop on Speech and Language Processing for Assistive Technologies, SLPAT 2010 at the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2010
Country/TerritoryUnited States
CityLos Angeles
Period6/5/10 → …

ASJC Scopus subject areas

  • Artificial Intelligence
  • Language and Linguistics
  • Linguistics and Language
  • Computer Science Applications

Cite this