TY - GEN
T1 - Separating time-frequency sources from time-domain convolutive mixtures using non-negative matrix factorization
AU - Leglaive, Simon
AU - Badeau, Roland
AU - Richard, Gael
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/12/7
Y1 - 2017/12/7
N2 - This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that the mixing filters are known. Source separation is performed from the time-domain mixture signals in order to accurately model the convolutive mixing process. The source signals are however modeled as latent variables in a time-frequency domain. In a previous paper we proposed to use the modified discrete cosine transform. The present paper generalizes the method to the use of the odd-frequency short-Time Fourier transform. In this domain, the source coefficients are modeled as centered complex Gaussian random variables whose variances are structured by means of a non-negative matrix factorization model. The inference procedure relies on a variational expectation-maximization algorithm. In the experiments we discuss the choice of the source representation and we show that the proposed approach outperforms two methods from the literature.
AB - This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that the mixing filters are known. Source separation is performed from the time-domain mixture signals in order to accurately model the convolutive mixing process. The source signals are however modeled as latent variables in a time-frequency domain. In a previous paper we proposed to use the modified discrete cosine transform. The present paper generalizes the method to the use of the odd-frequency short-Time Fourier transform. In this domain, the source coefficients are modeled as centered complex Gaussian random variables whose variances are structured by means of a non-negative matrix factorization model. The inference procedure relies on a variational expectation-maximization algorithm. In the experiments we discuss the choice of the source representation and we show that the proposed approach outperforms two methods from the literature.
KW - Audio source separation
KW - non-negative matrix factorization
KW - reverberant mixtures
KW - variational inference
U2 - 10.1109/WASPAA.2017.8170036
DO - 10.1109/WASPAA.2017.8170036
M3 - Conference contribution
AN - SCOPUS:85042387416
T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
SP - 264
EP - 268
BT - 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2017
Y2 - 15 October 2017 through 18 October 2017
ER -