Passer à la navigation principale Passer à la recherche Passer au contenu principal

The Deep Learning Revolution in MIR: The Pros and Cons, the Needs and the Challenges

  • CNRS LTCI

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

This paper deals with the deep learning revolution in Music Information Research (MIR), i.e. the switch from knowledge-driven hand-crafted systems to data-driven deep-learning systems. To discuss the pro and cons of this revolution, we first review the basic elements of deep learning and explain how those can be used for audio feature learning or for solving difficult MIR tasks. We then discuss the case of hand-crafted features and demonstrate that, while those where indeed shallow and explainable at the start, they tended to be deep, data-driven and unexplainable over time, already before the reign of deep-learning. The development of these data-driven approaches was allowed by the increasing access to large annotated datasets. We therefore argue that these annotated datasets are today the central and most sustainable element of any MIR research. We propose new ways to obtain those at scale. Finally we highlight a set of challenges to be faced by the deep learning revolution in MIR, especially concerning the consideration of music specificities, the explainability of the models (X-AI) and their environmental cost (Green-AI).

langue originaleAnglais
titrePerception, Representations, Image, Sound, Music - 14th International Symposium, CMMR 2019, Revised Selected Papers
rédacteurs en chefRichard Kronland-Martinet, Sølvi Ystad, Mitsuko Aramaki
EditeurSpringer Science and Business Media Deutschland GmbH
Pages3-30
Nombre de pages28
ISBN (imprimé)9783030702090
Les DOIs
étatPublié - 1 janv. 2021
Modification externeOui
Evénement14th International Symposium on Perception, Representations, Image, Sound, Music, CMMR 2019 - Marseille, France
Durée: 14 oct. 201918 oct. 2019

Série de publications

NomLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12631 LNCS
ISSN (imprimé)0302-9743
ISSN (Electronique)1611-3349

Une conférence

Une conférence14th International Symposium on Perception, Representations, Image, Sound, Music, CMMR 2019
Pays/TerritoireFrance
La villeMarseille
période14/10/1918/10/19

Empreinte digitale

Examiner les sujets de recherche de « The Deep Learning Revolution in MIR: The Pros and Cons, the Needs and the Challenges ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation