Audio segment retrieval using a short duration example query

Atulya Velivelli, Chengxiang Zhai, Thomas S. Huang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we propose a general approach to audio segment retrieval using a synthesized HMM. The approach allows a user to query audio data by an example audio segment of a short duration and find similar segments. The basic idea of our approach is to first train a theme HMM using the given example and a general background HMM using all the audio data, and then combine these individual HMMs to form a synthesized "Background-Theme-Background" HMM. This synthesized HMM can then be applied to any audio stream as a parser to detect the most likely theme segment. We overcome the problem of a short duration being used to train a theme HMM, by using the MAP rule with the Background model as a prior model. Evaluation of the proposed retrieval scheme using short duration example audio clips of narration as queries gives quite promising results.

Original languageEnglish (US)
Title of host publication2004 IEEE International Conference on Multimedia and Expo (ICME)
Pages1603-1606
Number of pages4
StatePublished - 2004
Event2004 IEEE International Conference on Multimedia and Expo (ICME) - Taipei, Taiwan, Province of China
Duration: Jun 27 2004Jun 30 2004

Publication series

Name2004 IEEE International Conference on Multimedia and Expo (ICME)
Volume3

Other

Other2004 IEEE International Conference on Multimedia and Expo (ICME)
Country/TerritoryTaiwan, Province of China
CityTaipei
Period6/27/046/30/04

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Audio segment retrieval using a short duration example query'. Together they form a unique fingerprint.

Cite this