Passer à la navigation principale Passer à la recherche Passer au contenu principal

Cross-Modal Music-Video Recommendation: A Study of Design Choices

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

In this work, we study music/video cross-modal recommendation, i.e. recommending a music track for a video or vice versa. We rely on a self-supervised learning paradigm to learn from a large amount of unlabelled data. We rely on a self-supervised learning paradigm to learn from a large amount of unlabelled data. More precisely, we jointly learn audio and video embeddings by using their co-occurrence in music-video clips. In this work, we build upon a recent video-music retrieval system (the VM-NET), which originally relies on an audio representation obtained by a set of statistics computed over handcrafted features. We demonstrate here that using audio representation learning such as the audio embeddings provided by the pre-trained MuSimNet, OpenL3, MusicCNN or by AudioSet, largely improves recommendations. We also validate the use of the cross-modal triplet loss originally proposed in the VM-NET compared to the binary cross-entropy loss commonly used in self-supervised learning. We perform all our experiments using the Music Video Dataset (MVD).

langue originaleAnglais
titreIJCNN 2021 - International Joint Conference on Neural Networks, Proceedings
EditeurInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronique)9780738133669
Les DOIs
étatPublié - 18 juil. 2021
Evénement2021 International Joint Conference on Neural Networks, IJCNN 2021 - Virtual, Online, Chine
Durée: 18 juil. 202122 juil. 2021

Série de publications

NomProceedings of the International Joint Conference on Neural Networks
Volume2021-July
ISSN (imprimé)2161-4393
ISSN (Electronique)2161-4407

Une conférence

Une conférence2021 International Joint Conference on Neural Networks, IJCNN 2021
Pays/TerritoireChine
La villeVirtual, Online
période18/07/2122/07/21

Empreinte digitale

Examiner les sujets de recherche de « Cross-Modal Music-Video Recommendation: A Study of Design Choices ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation