DNN-based distributed multichannel mask estimation for speech enhancement in microphone arrays

  • Nicolas Furnon
  • , Romain Serizel
  • , Irina Illina
  • , Slim Essid

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Multichannel processing is widely used for speech enhancement but several limitations appear when trying to deploy these solutions in the real world. Distributed sensor arrays that consider several devices with a few microphones is a viable solution which allows for exploiting the multiple devices equipped with microphones that we are using in our everyday life. In this context, we propose to extend the distributed adaptive node-specific signal estimation approach to a neural network framework. At each node, a local filtering is performed to send one signal to the other nodes where a mask is estimated by a neural network in order to compute a global multichannel Wiener filter. In an array of two nodes, we show that this additional signal can be leveraged to predict the masks and leads to better speech enhancement performance than when the mask estimation relies only on the local signals.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages4672-4676
Number of pages5
ISBN (Electronic)9781509066315
DOIs
Publication statusPublished - 1 May 2020
Event2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Barcelona, Spain
Duration: 4 May 20208 May 2020

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2020-May
ISSN (Print)1520-6149

Conference

Conference2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020
Country/TerritorySpain
CityBarcelona
Period4/05/208/05/20

Keywords

  • Distributed processing
  • Microphone arrays
  • Speech enhancement

Fingerprint

Dive into the research topics of 'DNN-based distributed multichannel mask estimation for speech enhancement in microphone arrays'. Together they form a unique fingerprint.

Cite this