Passer à la navigation principale Passer à la recherche Passer au contenu principal

Searching through a Speech Memory for Text-Independent Speaker Verification

  • Dijana Petrovska-Delacrétaz
  • , Asmaa El Hannani
  • , Gérard Chollet
  • University of Fribourg
  • Telecom Paris

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionChapitreRevue par des pairs

Résumé

Current state-of-the-art speaker verification algorithms use Gaussian Mixture Models (GMM) to estimate the probability density function of the acoustic feature vectors. Previous studies have shown that phonemes have different discriminant power for the speaker verification task. In order to better exploit these differences, it seems reasonable to segment the speech in distinct speech classes and carry out the speaker modeling for each class separately. Because transcribing databases is a tedious task, we prefer to use datadriven segmentation methods. If the number of automatic classes is comparable to the number of phonetic units, we can make the hypothesis that these units correspond roughly to the phonetic units. We have decided to use the well known Dynamic Time Warping (DTW) method to evaluate the distance between two speech feature vectors. If the two speech segments belong to the same speech class, we could expect that the DTW distortion measure can capture the speaker specific characteristics. The novelty of the proposed method is the combination of the DTW distortion measure with data-driven segmentation tools. The first experimental results of the proposed method, in terms of Detection Error Tradeoff (DET) curves, are comparable to current state-of-the-art speaker verification results, as obtained in NIST speaker recognition evaluations.

langue originaleAnglais
titreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
rédacteurs en chefJosef Kittler, Mark S. Nixon
EditeurSpringer Verlag
Pages95-103
Nombre de pages9
ISBN (Electronique)9783540403029
Les DOIs
étatPublié - 1 janv. 2003
Modification externeOui

Série de publications

NomLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2688
ISSN (imprimé)0302-9743
ISSN (Electronique)1611-3349

Empreinte digitale

Examiner les sujets de recherche de « Searching through a Speech Memory for Text-Independent Speaker Verification ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation