Passer à la navigation principale Passer à la recherche Passer au contenu principal

Model-Based STFT Phase Recovery for Audio Source Separation

  • CNRS LTCI
  • University of Tamper

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

For audio source separation applications, it is common to estimate the magnitude of the short-time Fourier transform (STFT) of each source. In order to further synthesize time-domain signals, it is necessary to recover the phase of the corresponding complex-valued STFT. Most authors in this field choose a Wiener-like filtering approach, which boils down to use the phase of the original mixture. In this paper, a different standpoint is adopted. Many music events are partially composed of slowly varying sinusoids and the STFT phase increment over time of those frequency components takes a specific form. This allows phase recovery by an unwrapping technique once a short-term frequency estimate has been obtained. Herein, a novel iterative source separation procedure is proposed that builds upon these results. It consists in minimizing the mixing error by means of the auxiliary function method. This procedure is initialized by exploiting the unwrapping technique in order to generate estimates that benefit from a temporal continuity property. Experiments conducted on realistic music pieces show that, given accurate magnitude estimates, this procedure outperforms the state-of-the-art consistent Wiener filter.

langue originaleAnglais
Pages (de - à)1091-1101
Nombre de pages11
journalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume26
Numéro de publication6
Les DOIs
étatPublié - 1 juin 2018
Modification externeOui

Empreinte digitale

Examiner les sujets de recherche de « Model-Based STFT Phase Recovery for Audio Source Separation ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation