TY - GEN
T1 - THE WORDS REMAIN THE SAME
T2 - 22nd International Conference on Music Information Retrieval, ISMIR 2021
AU - Vaglio, Andrea
AU - Hennequin, Romain
AU - Moussallam, Manuel
AU - Richard, Gaël
N1 - Publisher Copyright:
© 2021 Proceedings of the 22nd International Conference on Music Information Retrieval, ISMIR 2021. All Rights Reserved.
PY - 2021/1/1
Y1 - 2021/1/1
N2 - Cover detection has gained sustained interest in the scientific community and has recently made significant progress both in terms of scalability and accuracy. However, most approaches are based on the estimation of harmonic and melodic features and neglect lyrics information although it is an important invariant across covers. In this work, we propose a novel approach leveraging lyrics without requiring access to full texts though the use of lyrics recognition on audio. Our approach relies on the fusion of a singing voice recognition framework and a more classic tonal-based cover detection method. To the best of our knowledge, this is the first time that lyrics estimation from audio has been explicitly used for cover detection. Furthermore, we exploit efficient string matching and an approximated nearest neighbors search algorithm which lead to a scalable system which is able to operate on very large databases. Extensive experiments on the largest publicly available cover detection dataset demonstrate the validity of using lyrics information for this task.
AB - Cover detection has gained sustained interest in the scientific community and has recently made significant progress both in terms of scalability and accuracy. However, most approaches are based on the estimation of harmonic and melodic features and neglect lyrics information although it is an important invariant across covers. In this work, we propose a novel approach leveraging lyrics without requiring access to full texts though the use of lyrics recognition on audio. Our approach relies on the fusion of a singing voice recognition framework and a more classic tonal-based cover detection method. To the best of our knowledge, this is the first time that lyrics estimation from audio has been explicitly used for cover detection. Furthermore, we exploit efficient string matching and an approximated nearest neighbors search algorithm which lead to a scalable system which is able to operate on very large databases. Extensive experiments on the largest publicly available cover detection dataset demonstrate the validity of using lyrics information for this task.
M3 - Conference contribution
AN - SCOPUS:85140867937
T3 - Proceedings of the 22nd International Conference on Music Information Retrieval, ISMIR 2021
SP - 714
EP - 721
BT - Proceedings of the 22nd International Conference on Music Information Retrieval, ISMIR 2021
PB - International Society for Music Information Retrieval
Y2 - 7 November 2021 through 12 November 2021
ER -