Student's t Source and Mixing Models for Multichannel Audio Source Separation

Simon Leglaive, Roland Badeau, Gael Richard

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents a Bayesian framework for under-determined audio source separation in multichannel reverberant mixtures. We model the source signals as Student's t latent random variables in a time-frequency domain. The specific structure of musical signals in this domain is exploited by means of a nonnegative matrix factorization model. Conversely, we design the mixing model in the time domain. In addition to leading to an exact representation of the convolutive mixing process, this approach allows us to develop simple probabilistic priors for the mixing filters. Indeed, as those filters correspond to room responses they exhibit a simple characteristic structure in the time domain that can be used to guide their estimation. We also rely on the Student's t distribution for modeling the impulse response of the mixing filters. From this model, we develop a variational inference algorithm in order to perform source separation. The experimental evaluation demonstrates the potential of this approach for separating multichannel reverberant mixtures.

Original languageEnglish
Pages (from-to)1150-1164
Number of pages15
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume26
Issue number6
DOIs
Publication statusPublished - 1 Jun 2018
Externally publishedYes

Keywords

  • Audio source separation
  • Student's t distribution
  • multichannel reverberant mixtures
  • non-negative matrix factorization
  • statistical room acoustics
  • variational inference

Fingerprint

Dive into the research topics of 'Student's t Source and Mixing Models for Multichannel Audio Source Separation'. Together they form a unique fingerprint.

Cite this