TY - GEN
T1 - Semi-blind student's t source separation for multichannel audio convolutive mixtures
AU - Leglaive, Simon
AU - Badeau, Roland
AU - Richard, Gaël
N1 - Publisher Copyright:
© EURASIP 2017.
PY - 2017/10/23
Y1 - 2017/10/23
N2 - This paper addresses the problem of multichannel audio source separation in under-determined convolutive mixtures. We target a semi-blind scenario assuming that the mixing filters are known. The convolutive mixing process is exactly modeled using the time-domain impulse responses of the mixing filters. We propose a Student's t time-frequency source model based on non-negative matrix factorization (NMF). The Student's t distribution being heavy-tailed with respect to the Gaussian, it provides some flexibility in the modeling of the sources. We also study a simpler Student's t sparse source model within the same general source separation framework. The inference procedure relies on a variational expectationmaximization algorithm. Experiments show the advantage of using an NMF model compared with the sparse source model. While the Student's t NMF source model leads to slightly better results than our previous Gaussian one, we demonstrate the superiority of our method over two other approaches from the literature.
AB - This paper addresses the problem of multichannel audio source separation in under-determined convolutive mixtures. We target a semi-blind scenario assuming that the mixing filters are known. The convolutive mixing process is exactly modeled using the time-domain impulse responses of the mixing filters. We propose a Student's t time-frequency source model based on non-negative matrix factorization (NMF). The Student's t distribution being heavy-tailed with respect to the Gaussian, it provides some flexibility in the modeling of the sources. We also study a simpler Student's t sparse source model within the same general source separation framework. The inference procedure relies on a variational expectationmaximization algorithm. Experiments show the advantage of using an NMF model compared with the sparse source model. While the Student's t NMF source model leads to slightly better results than our previous Gaussian one, we demonstrate the superiority of our method over two other approaches from the literature.
KW - Multichannel convolutive mixture
KW - Nonnegative matrix factorization
KW - Student's t distribution
KW - Under-determined audio source separation
KW - Variational inference.
U2 - 10.23919/EUSIPCO.2017.8081612
DO - 10.23919/EUSIPCO.2017.8081612
M3 - Conference contribution
AN - SCOPUS:85041491703
T3 - 25th European Signal Processing Conference, EUSIPCO 2017
SP - 2259
EP - 2263
BT - 25th European Signal Processing Conference, EUSIPCO 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th European Signal Processing Conference, EUSIPCO 2017
Y2 - 28 August 2017 through 2 September 2017
ER -