Passer à la navigation principale Passer à la recherche Passer au contenu principal

Hybrid Projective Nonnegative Matrix Factorization with Drum Dictionaries for Harmonic/Percussive Source Separation

  • Clement Laroche
  • , Matthieu Kowalski
  • , Helene Papadopoulos
  • , Gael Richard
  • Université Paris-Saclay
  • L2S, CNRS, Univ Paris-Sud

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

One of the most general models of music signals considers that such signals can be represented as a sum of two distinct components: a tonal part that is sparse in frequency and temporally stable and a transient (or percussive) part that is composed of short-term broadband sounds. In this paper, we propose a novel hybrid method built upon nonnegative matrix factorization (NMF) that decomposes the time frequency representation of an audio signal into such two components. The tonal part is estimated by a sparse and orthogonal nonnegative decomposition, and the transient part is estimated by a straightforward NMF decomposition constrained by a pre-learned dictionary of smooth spectra. The optimization problem at the heart of our method remains simple with very few hyperparameters and can be solved thanks to simple multiplicative update rules. The extensive benchmark on a large and varied music database against four state of the art harmonic/percussive source separation algorithms demonstrate the merit of the proposed approach.

langue originaleAnglais
Pages (de - à)1499-1511
Nombre de pages13
journalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume26
Numéro de publication9
Les DOIs
étatPublié - 1 sept. 2018
Modification externeOui

Empreinte digitale

Examiner les sujets de recherche de « Hybrid Projective Nonnegative Matrix Factorization with Drum Dictionaries for Harmonic/Percussive Source Separation ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation