Passer à la navigation principale Passer à la recherche Passer au contenu principal

Learning word embeddings: Unsupervised methods for fixed-size representations of variable-length speech segments

  • Johns Hopkins University
  • Université PSL
  • INSERM U869

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Fixed-length embeddings of words are very useful for a variety of tasks in speech and language processing. Here we systematically explore two methods of computing fixed-length embeddings for variable-length sequences. We evaluate their susceptibility to phonetic and speaker-specific variability on English, a high resource language, and Xitsonga, a low resource language, using two evaluation metrics: ABX word discrimination and ROC-AUC on same-different phoneme n-grams. We show that a simple downsampling method supplemented with length information can be competitive with the variable-length input feature representation on both evaluations. Recurrent autoencoders trained without supervision can yield even better results at the expense of increased computational complexity.

langue originaleAnglais
Pages (de - à)2683-2687
Nombre de pages5
journalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2018-September
Les DOIs
étatPublié - 1 janv. 2018
Modification externeOui
Evénement19th Annual Conference of the International Speech Communication, INTERSPEECH 2018 - Hyderabad, Inde
Durée: 2 sept. 20186 sept. 2018

Empreinte digitale

Examiner les sujets de recherche de « Learning word embeddings: Unsupervised methods for fixed-size representations of variable-length speech segments ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation