Résumé
Real-world sounds often exhibit time-varying spectral shapes, as observed in the spectrogram of a harpsichord tone or that of a transition between two pronounced vowels. Whereas the standard non-negative matrix factorization (NMF) assumes fixed spectral atoms, an extension is proposed where the temporal activations (coefficients of the decomposition on the spectral atom basis) become frequency dependent and follow a time-varying autoregressive moving average (ARMA) modeling. This extension can thus be interpreted with the help of a source/filter paradigm and is referred to as source/filter factorization. This factorization leads to an efficient single-atom decomposition for a single audio event with strong spectral variation (but with constant pitch). The new algorithm is tested on real audio data and shows promising results.
| langue originale | Anglais |
|---|---|
| Numéro d'article | 5535132 |
| Pages (de - à) | 744-753 |
| Nombre de pages | 10 |
| journal | IEEE Transactions on Audio, Speech and Language Processing |
| Volume | 19 |
| Numéro de publication | 4 |
| Les DOIs | |
| état | Publié - 21 févr. 2011 |
| Modification externe | Oui |
Empreinte digitale
Examiner les sujets de recherche de « NMF with time-frequency activations to model nonstationary audio events ». Ensemble, ils forment une empreinte digitale unique.Contient cette citation
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver