Passer à la navigation principale Passer à la recherche Passer au contenu principal

NMF with time-frequency activations to model nonstationary audio events

  • CNRS LTCI

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

Real-world sounds often exhibit time-varying spectral shapes, as observed in the spectrogram of a harpsichord tone or that of a transition between two pronounced vowels. Whereas the standard non-negative matrix factorization (NMF) assumes fixed spectral atoms, an extension is proposed where the temporal activations (coefficients of the decomposition on the spectral atom basis) become frequency dependent and follow a time-varying autoregressive moving average (ARMA) modeling. This extension can thus be interpreted with the help of a source/filter paradigm and is referred to as source/filter factorization. This factorization leads to an efficient single-atom decomposition for a single audio event with strong spectral variation (but with constant pitch). The new algorithm is tested on real audio data and shows promising results.

langue originaleAnglais
Numéro d'article5535132
Pages (de - à)744-753
Nombre de pages10
journalIEEE Transactions on Audio, Speech and Language Processing
Volume19
Numéro de publication4
Les DOIs
étatPublié - 21 févr. 2011
Modification externeOui

Empreinte digitale

Examiner les sujets de recherche de « NMF with time-frequency activations to model nonstationary audio events ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation