Using data-driven and phonetic units for speaker verification

Asmaa El Hannani, Doroteo T. Toledano, Dijana Petrovska-Delacrétaz, Alberto Montero-Asenjo, Jean Hennebert

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recognition of speaker identity based on modeling the streams produced by phonetic decoders (phonetic speaker recognition) has gained popularity during the past few years. Two of the major problems that arise when phone based systems are being developed are the possible mismatches between the development and evaluation data and the lack of transcribed databases. Data-driven segmentation techniques provide a potential solution to these problems because they do not use transcribed data and can easily be applied on development data minimizing the mismatches. In this paper we compare speaker recognition results using phonetic and data-driven decoders. To this end, we have compared the results obtained with a speaker recognition system based on data-driven acoustic units and phonetic speaker recognition systems trained on Spanish and English data. Results obtained on the NIST 2005 Speaker Recognition Evaluation data show that the data-driven approach outperforms the phonetic one and that further improvements can be achieved by combining both approaches.

Original languageEnglish
Title of host publicationIEEE Odyssey 2006
Subtitle of host publicationWorkshop on Speaker and Language Recognition
DOIs
Publication statusPublished - 1 Dec 2006
Externally publishedYes
EventIEEE Odyssey 2006: Workshop on Speaker and Language Recognition - San Juan, Puerto Rico
Duration: 28 Jun 200630 Jun 2006

Publication series

NameIEEE Odyssey 2006: Workshop on Speaker and Language Recognition

Conference

ConferenceIEEE Odyssey 2006: Workshop on Speaker and Language Recognition
Country/TerritoryPuerto Rico
CitySan Juan
Period28/06/0630/06/06

Fingerprint

Dive into the research topics of 'Using data-driven and phonetic units for speaker verification'. Together they form a unique fingerprint.

Cite this