Passer à la navigation principale Passer à la recherche Passer au contenu principal

Learning Multi-Pitch Estimation from Weakly Aligned Score-Audio Pairs Using a Multi-Label CTC Loss

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Detecting the simultaneous activity of pitches in music audio recordings is a central task within music processing, commonly known as multi-pitch estimation or frame-wise polyphonic music transcription. Deep-learning approaches recently achieved major improvements for this task, but the lack of annotated, large-size datasets beyond the piano solo scenario is still a limitation for fully exploiting their potential. In this paper, we propose a strategy for training a CNN-based multi-pitch estimator on weakly aligned score-audio pairs of pieces in different instrumentations. To this end, we make use of a multi-label variant of the connectionist temporal classification loss (MCTC), recently proposed for image recognition tasks. We re-formalize the MCTC loss to be applicable for multi-pitch estimation and perform several systematic experiments to analyze its behavior and robustness to training conditions. Finally, we report on multi-pitch estimation results for common datasets using weakly aligned training with MCTC, which performs similar than systems trained on strongly aligned scores.

langue originaleAnglais
titre2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2021
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages121-125
Nombre de pages5
ISBN (Electronique)9781665448703
Les DOIs
étatPublié - 1 janv. 2021
Evénement2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2021 - New Paltz, États-Unis
Durée: 17 oct. 202120 oct. 2021

Série de publications

NomIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Volume2021-October
ISSN (imprimé)1931-1168
ISSN (Electronique)1947-1629

Une conférence

Une conférence2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2021
Pays/TerritoireÉtats-Unis
La villeNew Paltz
période17/10/2120/10/21

Empreinte digitale

Examiner les sujets de recherche de « Learning Multi-Pitch Estimation from Weakly Aligned Score-Audio Pairs Using a Multi-Label CTC Loss ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation