Speech separation using partially asynchronous microphone arrays without resampling

Ryan M. Corey, Andrew C. Singer

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We consider the problem of separating speech sources captured by multiple spatially separated devices, each of which has multiple microphones and samples its signals at a slightly different rate. Most asynchronous array processing methods rely on sample rate offset estimation and resampling, but these offsets can be difficult to estimate if the sources or microphones are moving. We propose a source separation method that does not require offset estimation or signal resampling. Instead, we divide the distributed array into several synchronous subarrays. All arrays are used jointly to estimate the time-varying signal statistics, and those statistics are used to design separate time-varying spatial filters in each array. We demonstrate the method for speech mixtures recorded on both stationary and moving microphone arrays.

Original languageEnglish (US)
Title of host publication16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages111-115
Number of pages5
ISBN (Electronic)9781538681510
DOIs
StatePublished - Nov 2 2018
Event16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Tokyo, Japan
Duration: Sep 17 2018Sep 20 2018

Publication series

Name16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Proceedings

Other

Other16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018
Country/TerritoryJapan
CityTokyo
Period9/17/189/20/18

Keywords

  • Ad hoc microphone array
  • Asynchronous microphone array
  • Audio source separation
  • Distributed arrays
  • Sampling rate offset
  • Spatial filtering
  • Speech enhancement

ASJC Scopus subject areas

  • Signal Processing
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'Speech separation using partially asynchronous microphone arrays without resampling'. Together they form a unique fingerprint.

Cite this