Passer à la navigation principale Passer à la recherche Passer au contenu principal

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification

  • Université Paris-Saclay
  • Nancy Université
  • LORIA and INRIA Lorraine
  • LORIA Laboratoire Lorrain de Recherche en Informatique et ses Applications

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

This paper introduces the use of representations based on nonnegative matrix factorization (NMF) to train deep neural networks with applications to environmental sound classification. Deep learning systems for sound classification usually rely on the network to learn meaningful representations from spectrograms or hand-crafted features. Instead, we introduce a NMF-based feature learning stage before training deep networks, whose usefulness is highlighted in this paper, especially for multi-source acoustic environments such as sound scenes. We rely on two established unsupervised and supervised NMF techniques to learn better input representations for deep neural networks. This will allow us, with simple architectures, to reach competitive performance with more complex systems such as convolutional networks for acoustic scene classification. The proposed systems outperform neural networks trained on time-frequency representations on two acoustic scene classification datasets as well as the best systems from the 2016 DCASE challenge.

langue originaleAnglais
titre2017 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2017 - Proceedings
rédacteurs en chefNaonori Ueda, Jen-Tzung Chien, Tomoko Matsui, Jan Larsen, Shinji Watanabe
EditeurIEEE Computer Society
Pages1-6
Nombre de pages6
ISBN (Electronique)9781509063413
Les DOIs
étatPublié - 5 déc. 2017
Modification externeOui
Evénement2017 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2017 - Tokyo, Japon
Durée: 25 sept. 201728 sept. 2017

Série de publications

NomIEEE International Workshop on Machine Learning for Signal Processing, MLSP
Volume2017-September
ISSN (imprimé)2161-0363
ISSN (Electronique)2161-0371

Une conférence

Une conférence2017 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2017
Pays/TerritoireJapon
La villeTokyo
période25/09/1728/09/17

Empreinte digitale

Examiner les sujets de recherche de « Leveraging deep neural networks with nonnegative representations for improved environmental sound classification ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation