Alpha-Stable Low-Rank Plus Residual Decomposition for Speech Enhancement

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this study, we propose a novel probabilistic model for separating clean speech signals from noisy mixtures by decomposing the mixture spectra into a structured speech part and a more flexible residual part. The main novelty in our model is that it uses a family of heavy-tailed distributions, so called the α-stable distributions, for modeling the residual signal. We develop an expectation-maximization algorithm for parameter estimation and a Monte Carlo scheme for posterior estimation of the clean speech. Our experiments show that the proposed method outperforms relevant factorization-based algorithms by a significant margin.

Original languageEnglish
Title of host publication2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages651-655
Number of pages5
ISBN (Print)9781538646588
DOIs
Publication statusPublished - 10 Sept 2018
Externally publishedYes
Event2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Calgary, Canada
Duration: 15 Apr 201820 Apr 2018

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2018-April
ISSN (Print)1520-6149

Conference

Conference2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
Country/TerritoryCanada
CityCalgary
Period15/04/1820/04/18

Keywords

  • Alpha-stable distributions
  • Audio source separation
  • Monte Carlo Expectation-Maximization
  • Speech enhancement

Fingerprint

Dive into the research topics of 'Alpha-Stable Low-Rank Plus Residual Decomposition for Speech Enhancement'. Together they form a unique fingerprint.

Cite this