Cover detection using dominant melody embeddings

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Automatic cover detection - the task of finding in an audio database all the covers of one or several query tracks- has long been seen as a challenging theoretical problem in the MIR community and as an acute practical problem for authors and composers societies. Original algorithms proposed for this task have proven their accuracy on small datasets, but are unable to scale up to modern real-life audio corpora. On the other hand, faster approaches designed to process thousands of pairwise comparisons resulted in lower accuracy, making them unsuitable for practical use. In this work, we propose a neural network architecture that is trained to represent each track as a single embedding vector. The computation burden is therefore left to the embedding extraction - that can be conducted offline and stored, while the pairwise comparison task reduces to a simple Euclidean distance computation. We further propose to extract each track's embedding out of its dominant melody representation, obtained by another neural network trained for this task. We then show that this architecture improves state-of-the-art accuracy both on small and large datasets, and is able to scale to query databases of thousands of tracks in a few seconds.

Original languageEnglish
Title of host publicationProceedings of the 20th International Society for Music Information Retrieval Conference, ISMIR 2019
EditorsArthur Flexer, Geoffroy Peeters, Julian Urbano, Anja Volk
PublisherInternational Society for Music Information Retrieval
Pages107-114
Number of pages8
ISBN (Electronic)9781732729919
Publication statusPublished - 1 Jan 2019
Event20th International Society for Music Information Retrieval Conference, ISMIR 2019 - Delft, Netherlands
Duration: 4 Nov 20198 Nov 2019

Publication series

NameProceedings of the 20th International Society for Music Information Retrieval Conference, ISMIR 2019

Conference

Conference20th International Society for Music Information Retrieval Conference, ISMIR 2019
Country/TerritoryNetherlands
CityDelft
Period4/11/198/11/19

Fingerprint

Dive into the research topics of 'Cover detection using dominant melody embeddings'. Together they form a unique fingerprint.

Cite this