Interplay between visual and audio scene analysis

Ziyou Xiong, Thomas S. Huang

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

We have argued the necessity of joint audio-visual scene analysis to deal with the difficult problem of CASA. It is argued that the problem of CASA will benefit from computer audio-visual scene analysis (CAVSA). We also propose a generative probabilistic model on correlogram, the video representation of audio signal, to separate the audio sources.

Original languageEnglish (US)
Title of host publicationSpeech Separation by Humans and Machines
PublisherSpringer
Pages283-293
Number of pages11
ISBN (Print)1402080018, 9781402080012
DOIs
StatePublished - 2005
Externally publishedYes

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint

Dive into the research topics of 'Interplay between visual and audio scene analysis'. Together they form a unique fingerprint.

Cite this