Multichannel source separation and tracking with RANSAC and directional statistics

Johannes Traa, Paris Smaragdis

Research output: Contribution to journalArticlepeer-review

Abstract

We describe multichannel blind source separation and tracking algorithms based on clustering wrapped interchannel phase difference (IPD) features. We pose the clustering problem as one of multimodal circular-linear regression and present its probabilistic formulation. Phase wrapping due to spatial aliasing is explicitly incorporated by modeling the IPD features as circular variables. We present two methods based on Expectation-Maximization (EM) and a sequential variant of RANdom SAmple Consensus (RANSAC). We show that their strengths can be combined by using RANSAC to initialize EM. The IPD clustering algorithm is applied to separate stationary speakers from a multichannel mixture.We then extend it to the case of moving speakers by tracking their directions-of-arrival with the FactorialWrapped Kalman Filter (FWKF) using RANSAC as a data preprocessor. Experimental results demonstrate that the proposed methods perform well in the presence of reverberant babble noise and spatial aliasing. The FWKF successfully tracks and separates moving speakers with separation quality comparable to that for stationary speakers.

Original languageEnglish (US)
Pages (from-to)2233-2243
Number of pages11
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume22
Issue number12
DOIs
StatePublished - Dec 1 2014

Keywords

  • Blind source separation (BSS)
  • Directional statistics
  • Interchannel phase difference (IPD)
  • Wrapped Kalman filter

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Acoustics and Ultrasonics
  • Computational Mathematics
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Multichannel source separation and tracking with RANSAC and directional statistics'. Together they form a unique fingerprint.

Cite this