NMF with time-frequency activations to model non stationary audio events

Romain Hennequin, Roland Badeau, Bertrand David

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Real world sounds often exhibit non-stationary spectral characteristics such as those produced by a harpsichord or a guitar. The classical Non-negative Matrix Factorization (NMF) needs a number of atoms to accurately decompose the spectrogram of such sounds. An extension of NMF is proposed hereafter which includes time-frequency activations based on ARMA modeling. This leads to an efficient single-atom decomposition for a single audio event. The new algorithm is tested on real audio data and shows promising results.

Original languageEnglish
Title of host publication2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages445-448
Number of pages4
ISBN (Print)9781424442966
DOIs
Publication statusPublished - 1 Jan 2010
Externally publishedYes
Event2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Dallas, TX, United States
Duration: 14 Mar 201019 Mar 2010

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010
Country/TerritoryUnited States
CityDallas, TX
Period14/03/1019/03/10

Keywords

  • Music information retrieval
  • Non-negative matrix factorization
  • Unsupervised machine learning

Fingerprint

Dive into the research topics of 'NMF with time-frequency activations to model non stationary audio events'. Together they form a unique fingerprint.

Cite this