A Bayesian predictive method for automatic speech segmentation

Ming Liu, Thomas S. Huang

Research output: Contribution to journalConference article

Abstract

Implicit speech segmentation is basically to find time instances when the spectral distortion is large. Spectral Variation Function is a widely used measure of spectral distortion. However, SVF is a data-dependent measure. In order to make the measurement data-independent, a likelihood ratio is constructed to measure the spectral distortion. This ratio can be computed efficiently with a Bayesian predictive model. The prior of the Bayesian predictive model is estimated from unlabeled data via an unsupervised machine learning technique - Gaussian Mixture Model(GMM). The experimental results show that effectiveness of this novel method. The performance on TIMIT corpus indicate the potential applications in speech recognition, synthesis and coding.

Original languageEnglish (US)
Article number1699837
Pages (from-to)290-293
Number of pages4
JournalProceedings - International Conference on Pattern Recognition
Volume4
DOIs
StatePublished - Dec 1 2006
Event18th International Conference on Pattern Recognition, ICPR 2006 - Hong Kong, China
Duration: Aug 20 2006Aug 24 2006

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition

Fingerprint Dive into the research topics of 'A Bayesian predictive method for automatic speech segmentation'. Together they form a unique fingerprint.

  • Cite this