TY - JOUR
T1 - X-Vectors
T2 - New Quantitative Biomarkers for Early Parkinson's Disease Detection From Speech
AU - Jeancolas, Laetitia
AU - Petrovska-Delacrétaz, Dijana
AU - Mangone, Graziella
AU - Benkelfat, Badr Eddine
AU - Corvol, Jean Christophe
AU - Vidailhet, Marie
AU - Lehéricy, Stéphane
AU - Benali, Habib
N1 - Publisher Copyright:
© Copyright © 2021 Jeancolas, Petrovska-Delacrétaz, Mangone, Benkelfat, Corvol, Vidailhet, Lehéricy and Benali.
PY - 2021/2/19
Y1 - 2021/2/19
N2 - Many articles have used voice analysis to detect Parkinson's disease (PD), but few have focused on the early stages of the disease and the gender effect. In this article, we have adapted the latest speaker recognition system, called x-vectors, in order to detect PD at an early stage using voice analysis. X-vectors are embeddings extracted from Deep Neural Networks (DNNs), which provide robust speaker representations and improve speaker recognition when large amounts of training data are used. Our goal was to assess whether, in the context of early PD detection, this technique would outperform the more standard classifier MFCC-GMM (Mel-Frequency Cepstral Coefficients—Gaussian Mixture Model) and, if so, under which conditions. We recorded 221 French speakers (recently diagnosed PD subjects and healthy controls) with a high-quality microphone and via the telephone network. Men and women were analyzed separately in order to have more precise models and to assess a possible gender effect. Several experimental and methodological aspects were tested in order to analyze their impacts on classification performance. We assessed the impact of the audio segment durations, data augmentation, type of dataset used for the neural network training, kind of speech tasks, and back-end analyses. X-vectors technique provided better classification performances than MFCC-GMM for the text-independent tasks, and seemed to be particularly suited for the early detection of PD in women (7–15% improvement). This result was observed for both recording types (high-quality microphone and telephone).
AB - Many articles have used voice analysis to detect Parkinson's disease (PD), but few have focused on the early stages of the disease and the gender effect. In this article, we have adapted the latest speaker recognition system, called x-vectors, in order to detect PD at an early stage using voice analysis. X-vectors are embeddings extracted from Deep Neural Networks (DNNs), which provide robust speaker representations and improve speaker recognition when large amounts of training data are used. Our goal was to assess whether, in the context of early PD detection, this technique would outperform the more standard classifier MFCC-GMM (Mel-Frequency Cepstral Coefficients—Gaussian Mixture Model) and, if so, under which conditions. We recorded 221 French speakers (recently diagnosed PD subjects and healthy controls) with a high-quality microphone and via the telephone network. Men and women were analyzed separately in order to have more precise models and to assess a possible gender effect. Several experimental and methodological aspects were tested in order to analyze their impacts on classification performance. We assessed the impact of the audio segment durations, data augmentation, type of dataset used for the neural network training, kind of speech tasks, and back-end analyses. X-vectors technique provided better classification performances than MFCC-GMM for the text-independent tasks, and seemed to be particularly suited for the early detection of PD in women (7–15% improvement). This result was observed for both recording types (high-quality microphone and telephone).
KW - MFCC
KW - Parkinson's disease
KW - automatic detection
KW - deep neural networks
KW - early detection
KW - telediagnosis
KW - voice analysis
KW - x-vectors
U2 - 10.3389/fninf.2021.578369
DO - 10.3389/fninf.2021.578369
M3 - Article
AN - SCOPUS:85102275423
SN - 1662-5196
VL - 15
JO - Frontiers in Neuroinformatics
JF - Frontiers in Neuroinformatics
M1 - 578369
ER -