A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel

Ameya Nitin Deoras, Mark Hasegawa-Johnson

Research output: Contribution to journalConference articlepeer-review

Abstract

This paper addresses the novel problem of recognizing digits spoken simultaneously by two different talkers. A Factorial Hidden Markov Model architecture is proposed to accurately model the simultaneous utterance of two digits. Nadas' MIXMAX approximation is extended to a mixture of Gaussians observation PDF which enables the implementation of the proposed system. The multiple digit recognizer is found to successfully recognize pairs of simultaneous utterances of digits at 0db SNR with up to 89% accuracy.

Original languageEnglish (US)
Pages (from-to)I861-I864
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
StatePublished - 2004
EventProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing - Montreal, Que, Canada
Duration: May 17 2004May 21 2004

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel'. Together they form a unique fingerprint.

Cite this