Separating time-frequency sources from time-domain convolutive mixtures using non-negative matrix factorization

Simon Leglaive, Roland Badeau, Gael Richard

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that the mixing filters are known. Source separation is performed from the time-domain mixture signals in order to accurately model the convolutive mixing process. The source signals are however modeled as latent variables in a time-frequency domain. In a previous paper we proposed to use the modified discrete cosine transform. The present paper generalizes the method to the use of the odd-frequency short-Time Fourier transform. In this domain, the source coefficients are modeled as centered complex Gaussian random variables whose variances are structured by means of a non-negative matrix factorization model. The inference procedure relies on a variational expectation-maximization algorithm. In the experiments we discuss the choice of the source representation and we show that the proposed approach outperforms two methods from the literature.

Original languageEnglish
Title of host publication2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages264-268
Number of pages5
ISBN (Electronic)9781538616321
DOIs
Publication statusPublished - 7 Dec 2017
Externally publishedYes
Event2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2017 - New Paltz, United States
Duration: 15 Oct 201718 Oct 2017

Publication series

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Volume2017-October

Conference

Conference2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2017
Country/TerritoryUnited States
CityNew Paltz
Period15/10/1718/10/17

Keywords

  • Audio source separation
  • non-negative matrix factorization
  • reverberant mixtures
  • variational inference

Fingerprint

Dive into the research topics of 'Separating time-frequency sources from time-domain convolutive mixtures using non-negative matrix factorization'. Together they form a unique fingerprint.

Cite this