Passer à la navigation principale Passer à la recherche Passer au contenu principal

Cauchy multichannel speech enhancement with a deep speech prior

  • Mathieu Fontaine
  • , Aditya Arie Nugraha
  • , Roland Badeau
  • , Kazuyoshi Yoshii
  • , Antoine Liutkus
  • Nancy Université
  • RIKEN AIP
  • Université Paris-Saclay
  • Kyoto University
  • DALI/LIRMM

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

We propose a semi-supervised multichannel speech enhancement system based on a probabilistic model which assumes that both speech and noise follow the heavy-tailed multivariate complex Cauchy distribution. As we advocate, this allows handling strong and adverse noisy conditions. Consequently, the model is parameterized by the source magnitude spectrograms and the source spatial scatter matrices. To deal with the nonadditivity of scatter matrices, our first contribution is to perform the enhancement on a projected space. Then, our second contribution is to combine a latent variable model for speech, which is trained by following the variational autoencoder framework, with a low-rank model for the noise source. At test time, an iterative inference algorithm is applied, which produces estimated parameters to use for separation. The speech latent variables are estimated first from the noisy speech and then updated by a gradient descent method, while a majorization-equalization strategy is used to update both the noise and the spatial parameters of both sources. Our experimental results show that the Cauchy model outperforms the state-of-art methods. The standard deviation scores also reveal that the proposed method is more robust against non-stationary noise.

langue originaleAnglais
titreEUSIPCO 2019 - 27th European Signal Processing Conference
EditeurEuropean Signal Processing Conference, EUSIPCO
ISBN (Electronique)9789082797039
Les DOIs
étatPublié - 1 sept. 2019
Modification externeOui
Evénement27th European Signal Processing Conference, EUSIPCO 2019 - A Coruna, Espagne
Durée: 2 sept. 20196 sept. 2019

Série de publications

NomEuropean Signal Processing Conference
Volume2019-September
ISSN (imprimé)2219-5491

Une conférence

Une conférence27th European Signal Processing Conference, EUSIPCO 2019
Pays/TerritoireEspagne
La villeA Coruna
période2/09/196/09/19

Empreinte digitale

Examiner les sujets de recherche de « Cauchy multichannel speech enhancement with a deep speech prior ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation