Exploiting high-level information provided by ALISP in speaker recognition

Asmaa El Hannani, Dijana Petrovska-Delacrétaz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The best performing systems in the area of automatic speaker recognition have focused on using short-term, low-level acoustic information, such as cepstral features. Recently, various works have demonstrated that high-level features convey more speaker information and can be added to the low-level features in order to increase the robustness of the system. This paper describes a text-independent speaker recognition system exploiting high-level information provided by ALISP (Automatic Language Independent Speech Processing), a data-driven segmentation. This system, denoted here as ALISP n-gram system, captures the speaker specific information only by analyzing sequences of ALISP units. The ALISP n-gram system was fused with an acoustic ALISP-based Gaussian Mixture Models (GMM) system exploiting the speaker discriminating properties of individual speech classes. The resulting fused system reduced the error rate over the individual systems on the NIST 2004 Speaker Recognition Evaluation data.

Original languageEnglish
Title of host publicationNonlinear Analyses and Algorithms for Speech Processing - International Conference on Non-Linear Speech Processing, NOLISP 2005, Revised Selected Papers
Pages66-71
Number of pages6
Publication statusPublished - 1 Dec 2005
Externally publishedYes
EventInternational Conference on Non-Linear Speech Processing, NOLISP 2005 - Barcelona, Spain
Duration: 19 Apr 200522 Apr 2005

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3817 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Non-Linear Speech Processing, NOLISP 2005
Country/TerritorySpain
CityBarcelona
Period19/04/0522/04/05

Fingerprint

Dive into the research topics of 'Exploiting high-level information provided by ALISP in speaker recognition'. Together they form a unique fingerprint.

Cite this