Passer à la navigation principale Passer à la recherche Passer au contenu principal

Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification

  • Université Paris-Saclay

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

This paper presents a feature learning approach for speaker identification that is based on nonnegative matrix factorisation. Recent studies have shown that with such models, the dictionary atoms can represent well the speaker identity. The approaches proposed so far focused only on speaker variability and not on session variability. However, this later point is a crucial aspect in the success of the I-vector approach that is now the state-of-the-art in speaker identification. This paper proposes a method that relies on group nonnegative matrix factorisation and that is inspired by the I-vector training procedure. By doing so the proposed approach intends to capture both the speaker variability and the session variability. Results on a small corpus prove that the proposed approach can be competitive with I-vectors.

langue originaleAnglais
titre2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - Proceedings
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages5470-5474
Nombre de pages5
ISBN (Electronique)9781479999880
Les DOIs
étatPublié - 18 mai 2016
Modification externeOui
Evénement41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - Shanghai, Chine
Durée: 20 mars 201625 mars 2016

Série de publications

NomICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2016-May
ISSN (imprimé)1520-6149

Une conférence

Une conférence41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016
Pays/TerritoireChine
La villeShanghai
période20/03/1625/03/16

Empreinte digitale

Examiner les sujets de recherche de « Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation