Strong-sense class-dependent features for statistical recognition

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In statistical classification and recognition problems with many classes, it is commonly the case that different classes exhibit wildly different properties. In this case it is unreasonable to expect to be able to summarize these properties by using features designed to represent all the classes. In contrast, features should be designed to represent subsets that exhibit common properties without regard to any class outside this subset. The value of these features for classes outside the subset may be meaningless, or simply undefined. The main problem, due to the statistical nature of the recognizer, is how to compare likelihoods conditioned on different sets of features to decode an input pattern. This paper introduces a class-dependent feature design approach that can be integrated with any probabilistic model. This approach avoids the need of having a conditional probabilistic model for each class and feature type pair, and therefore decreases the computational and storage requirements of using heterogeneous features. This paper presents an algorithm to calculate the class-dependent features that minimize an estimate of the relative entropy between the conditional probabilistic model and the actual conditional probability density function (PDF) of the features of each class. An approach to a hidden Markov model (HMM) automatic speech recognition (ASR) system is applied. A nonlinear class-dependent volume-preserving transformation of the features is used to minimize the objective function. Using this approach, 2% improvement in phoneme recognition accuracy is achieved compared to the baseline system. The approach also shows improvement in recognition accuracy compared to previous class-dependent linear features transformation.

Original languageEnglish (US)
Title of host publicationProceedings of the 2003 IEEE Workshop on Statistical Signal Processing, SSP 2003
PublisherIEEE Computer Society
Pages490-493
Number of pages4
ISBN (Electronic)0780379977
DOIs
StatePublished - 2003
EventIEEE Workshop on Statistical Signal Processing, SSP 2003 - St. Louis, United States
Duration: Sep 28 2003Oct 1 2003

Publication series

NameIEEE Workshop on Statistical Signal Processing Proceedings
Volume2003-January

Other

OtherIEEE Workshop on Statistical Signal Processing, SSP 2003
Country/TerritoryUnited States
CitySt. Louis
Period9/28/0310/1/03

Keywords

  • Automatic speech recognition
  • Decoding
  • Entropy
  • Hidden Markov models
  • Pattern recognition
  • Performance loss
  • Probability density function
  • Robustness
  • Speech recognition

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Applied Mathematics
  • Signal Processing
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Strong-sense class-dependent features for statistical recognition'. Together they form a unique fingerprint.

Cite this