TY - GEN
T1 - Audio segment retrieval using a short duration example query
AU - Velivelli, Atulya
AU - Zhai, Chengxiang
AU - Huang, Thomas S.
PY - 2004
Y1 - 2004
N2 - In this paper, we propose a general approach to audio segment retrieval using a synthesized HMM. The approach allows a user to query audio data by an example audio segment of a short duration and find similar segments. The basic idea of our approach is to first train a theme HMM using the given example and a general background HMM using all the audio data, and then combine these individual HMMs to form a synthesized "Background-Theme-Background" HMM. This synthesized HMM can then be applied to any audio stream as a parser to detect the most likely theme segment. We overcome the problem of a short duration being used to train a theme HMM, by using the MAP rule with the Background model as a prior model. Evaluation of the proposed retrieval scheme using short duration example audio clips of narration as queries gives quite promising results.
AB - In this paper, we propose a general approach to audio segment retrieval using a synthesized HMM. The approach allows a user to query audio data by an example audio segment of a short duration and find similar segments. The basic idea of our approach is to first train a theme HMM using the given example and a general background HMM using all the audio data, and then combine these individual HMMs to form a synthesized "Background-Theme-Background" HMM. This synthesized HMM can then be applied to any audio stream as a parser to detect the most likely theme segment. We overcome the problem of a short duration being used to train a theme HMM, by using the MAP rule with the Background model as a prior model. Evaluation of the proposed retrieval scheme using short duration example audio clips of narration as queries gives quite promising results.
UR - http://www.scopus.com/inward/record.url?scp=11244287370&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=11244287370&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:11244287370
SN - 0780386035
SN - 9780780386037
T3 - 2004 IEEE International Conference on Multimedia and Expo (ICME)
SP - 1603
EP - 1606
BT - 2004 IEEE International Conference on Multimedia and Expo (ICME)
T2 - 2004 IEEE International Conference on Multimedia and Expo (ICME)
Y2 - 27 June 2004 through 30 June 2004
ER -