Abstract
Incorporating prior knowledge about the sources and/or the mixture is a way to improve under-determined audio source separation performance. A great number of informed source separation techniques concentrate on taking priors on the sources into account, but fewer works have focused on constraining the mixing model. In this paper, we address the problem of underdetermined multichannel audio source separation in reverberant conditions. We target a semi-informed scenario where some room parameters are known. Two probabilistic priors on the frequency response of the mixing filters are proposed. Early reverberation is characterized by an autoregressive model while according to statistical room acoustics results, late reverberation is represented by an autoregressive moving average model. Both reverberation models are defined in the frequency domain. They aim to transcribe the temporal characteristics of the mixing filters into frequency-domain correlations. Our approach leads to a maximum a posteriori estimation of the mixing filters which is achieved thanks to the expectation-maximization algorithm. We experimentally show the superiority of this approach compared with a maximum likelihood estimation of the mixing filters.
| Original language | English |
|---|---|
| Pages (from-to) | 2453-2465 |
| Number of pages | 13 |
| Journal | IEEE/ACM Transactions on Audio Speech and Language Processing |
| Volume | 24 |
| Issue number | 12 |
| DOIs | |
| Publication status | Published - 1 Dec 2016 |
| Externally published | Yes |
Keywords
- EM algorithm
- MAP estimation
- Multichannel audio source separation
- mixing model
- probabilistic priors
Fingerprint
Dive into the research topics of 'Multichannel Audio Source Separation with Probabilistic Reverberation Priors'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver