Segmental approaches for automatic speaker verification

Dijana Petrovska-Delacrétaz, Jan Černocký, Jean Hennebert, Gérard Chollet

Research output: Contribution to journalArticlepeer-review

Abstract

Speech is composed of different sounds (acoustic segments). Speakers differ in their pronunciation of these sounds. The segmental approaches described in this paper are meant to exploit these differences for speaker verification purposes. For such approaches, the speech is divided into different classes, and the speaker modeling is done for each class. The speech segmentation applied is based on automatic language independent speech processing tools that provide a segmentation of the speech requiring neither phonetic nor orthographic transcriptions of the speech data. Two different speaker modeling approaches, based on multilayer perceptrons (MLPs) and on Gaussian mixture models (GMMs), are studied. The MLP-based segmental systems have performance comparable to that of the global MLP-based systems, and in the mismatched train-test conditions slightly better results are obtained with the segmental MLP system. The segmental GMM systems gave poorer results than the equivalent global GMM systems.

Original languageEnglish
Pages (from-to)198-212
Number of pages15
JournalDigital Signal Processing: A Review Journal
Volume10
Issue number1
DOIs
Publication statusPublished - 1 Jan 2000
Externally publishedYes

Fingerprint

Dive into the research topics of 'Segmental approaches for automatic speaker verification'. Together they form a unique fingerprint.

Cite this