Don't hide in the frames: Note-and pattern-based evaluation of automated melody extraction algorithms

  • Klaus Frieler
  • , Doǧaç Basaran
  • , Frank Höger
  • , Hélène Camille Crayencour
  • , Geoffroy Peeters
  • , Simon Dixon

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we address how to evaluate and improve the performance of automatic dominant melody extraction systems from a pattern mining perspective with a focus on jazz improvisations. Traditionally, dominant melody extraction systems estimate the melody on the frame-level, but for real-world musicological applications note-level representations are needed. For the evaluation of estimated note tracks, the current frame-wise metrics are not fully suitable and provide at most a first approximation. Furthermore, miningmelodic patterns (n-grams) poses another challenge because note-wise errors propagate geometrically with increasing length of the pattern. On the other hand, for certain derived metrics such as pattern commonalities between performers, extraction errors might be less critical if at least qualitative rankings can be reproduced. Finally, while searching for similar patterns in a melody database the number of irrelevant patterns in the result set increases with lower similarity thresholds. For reasons of usability, it would be interesting to know the behavior using imperfect automated melody extractions. We propose three novel evaluation strategies for estimated note-tracks based on three application scenarios: Pattern mining, pattern commonalities, and fuzzy pattern search. We apply the proposed metrics to one general state-of-the-art melody estimation method (Melodia) and to two variants of an algorithm that was optimized for the extraction of jazz solos melodies. A subset of the Weimar Jazz Database with 91 solos was used for evaluation. Results show that the optimized algorithm clearly outperforms the reference algorithm, which quickly degrades and eventually breaks down for longer n-grams. Frame-wise metrics provide indeed an estimate for note-wise metrics, but only for sufficiently good extractions, whereas F1 scores for longer n-grams cannot be predicted from frame-wise F1 scores at all. The ranking of pattern commonalities between performers can be reproduced with the optimized algorithms but not with the reference algorithm. Finally, the size of result sets of pattern similarity searches decreases for automated note extraction and for larger similarity thresholds but the difference levels out for smaller thresholds.

Original languageEnglish
Title of host publicationProceedings of DLfM 2019
Subtitle of host publicationThe 6th International Conference on Digital Libraries for Musicology, a Satellite Event of ISMIR 2019
PublisherAssociation for Computing Machinery
Pages25-32
Number of pages8
ISBN (Electronic)9781450372084
DOIs
Publication statusPublished - 20 Nov 2019
Event6th International Conference on Digital Libraries for Musicology, DLfM 2019, a Satellite Event of ISMIR 2019 - The Hague, Netherlands
Duration: 9 Nov 2019 → …

Publication series

NameACM International Conference Proceeding Series

Conference

Conference6th International Conference on Digital Libraries for Musicology, DLfM 2019, a Satellite Event of ISMIR 2019
Country/TerritoryNetherlands
CityThe Hague
Period9/11/19 → …

Keywords

  • Automatic melody extraction
  • Evaluation
  • Jazz
  • Pattern mining

Fingerprint

Dive into the research topics of 'Don't hide in the frames: Note-and pattern-based evaluation of automated melody extraction algorithms'. Together they form a unique fingerprint.

Cite this