Communication-cost aware microphone selection for neural speech enhancement with ad-hoc microphone arrays

Jonah Casebeer, Jamshed Kaikaus, Paris Smaragdis

Research output: Contribution to journalConference articlepeer-review

Abstract

In this paper, we present a method for jointly-learning a microphone selection mechanism and a speech enhancement network for multi-channel speech enhancement with an ad-hoc microphone array. The attention-based microphone selection mechanism is trained to reduce communication-costs through a penalty term which represents a task-performance/communication-cost trade-off. While working within the trade-off, our method can intelligently stream from more microphones in lower SNR scenes and fewer microphones in higher SNR scenes. We evaluate the model in complex echoic acoustic scenes with moving sources and show that it matches the performance of models that stream from a fixed number of microphones while reducing communication costs.

Original languageEnglish (US)
Pages (from-to)8438-8442
Number of pages5
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2021-June
DOIs
StatePublished - 2021
Event2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021 - Virtual, Toronto, Canada
Duration: Jun 6 2021Jun 11 2021

Keywords

  • Ad-hoc microphone array
  • Beamforming
  • Deep learning
  • Sensor selection
  • Speech enhancement

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Communication-cost aware microphone selection for neural speech enhancement with ad-hoc microphone arrays'. Together they form a unique fingerprint.

Cite this