Passer à la navigation principale Passer à la recherche Passer au contenu principal

SINGER IDENTITY REPRESENTATION LEARNING USING SELF-SUPERVISED TECHNIQUES

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Significant strides have been made in creating voice identity representations using speech data. However, the same level of progress has not been achieved for singing voices. To bridge this gap, we suggest a framework for training singer identity encoders to extract representations suitable for various singing-related tasks, such as singing voice similarity and synthesis. We explore different selfsupervised learning techniques on a large collection of isolated vocal tracks and apply data augmentations during training to ensure that the representations are invariant to pitch and content variations. We evaluate the quality of the resulting representations on singer similarity and identification tasks across multiple datasets, with a particular emphasis on out-of-domain generalization. Our proposed framework produces high-quality embeddings that outperform both speaker verification and wav2vec 2.0 pre-trained baselines on singing voice while operating at 44.1 kHz. We release our code and trained models to facilitate further research on singing voice and related areas.

langue originaleAnglais
titre24th International Society for Music Information Retrieval Conference, ISMIR 2023 - Proceedings
rédacteurs en chefAugusto Sarti, Fabio Antonacci, Mark Sandler, Paolo Bestagini, Simon Dixon, Beici Liang, Gael Richard, Johan Pauwels
EditeurInternational Society for Music Information Retrieval
Pages448-456
Nombre de pages9
ISBN (Electronique)9781732729933
étatPublié - 1 janv. 2023
Evénement24th International Society for Music Information Retrieval Conference, ISMIR 2023 - Milan, Italie
Durée: 5 nov. 20239 nov. 2023

Série de publications

Nom24th International Society for Music Information Retrieval Conference, ISMIR 2023 - Proceedings

Une conférence

Une conférence24th International Society for Music Information Retrieval Conference, ISMIR 2023
Pays/TerritoireItalie
La villeMilan
période5/11/239/11/23

Empreinte digitale

Examiner les sujets de recherche de « SINGER IDENTITY REPRESENTATION LEARNING USING SELF-SUPERVISED TECHNIQUES ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation