Interplay between visual and audio scene analysis

Ziyou Xiong, Thomas S. Huang

Research output: Chapter in Book/Report/Conference proceedingChapter


We have argued the necessity of joint audio-visual scene analysis to deal with the difficult problem of CASA. It is argued that the problem of CASA will benefit from computer audio-visual scene analysis (CAVSA). We also propose a generative probabilistic model on correlogram, the video representation of audio signal, to separate the audio sources.

Original languageEnglish (US)
Title of host publicationSpeech Separation by Humans and Machines
Number of pages11
ISBN (Print)1402080018, 9781402080012
StatePublished - 2005
Externally publishedYes

ASJC Scopus subject areas

  • Engineering(all)


Dive into the research topics of 'Interplay between visual and audio scene analysis'. Together they form a unique fingerprint.

Cite this