A Mixed Audio-Video SPD Network for Online Classification of Parkinsonian Speech Patterns

John Archila, Antoine Manzanera, Fabio Martínez

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Parkinson’s disease (PD) is a neurodegenerative disease that produces progressive motor impairments. Dysarthria (speech disorders) and hypomimia (face rigidity) are two major Parkinsonism patterns observed even at the early stages of the disease. Nonetheless, the clinical diagnosis is mainly observational and dependent on the specialists’ expertise. Besides, the categorization of each of these patterns is isolated, which may lead to delayed diagnosis and misplanning of treatments. This work introduces a non-invasive multimodal strategy that integrates video and audio modalities into the online characterization of speech exercises. Subjects were invited to pronounce sustained vowels while video and audio were recorded. Then, a temporal window is run along the sequence to build online covariance matrices of synchronized face landmarks position and characteristic voice frequencies. From these temporal covariance matrices are learned Riemannian descriptors that allow to discriminate between Parkinson’s and control subjects. From a study with 14 subjects, the proposed approach achieved a mean accuracy of 70% in sustained vowel pronunciation. Considering online predictions, the proposed approach evidenced a consistent accuracy of 0.77 during pronunciation of close vowels.

Original languageEnglish
Title of host publicationAdvances in Artificial Intelligence – IBERAMIA 2024 - 18th Ibero-American Conference on AI, Proceedings
EditorsLuís Correia, Aiala Rosá, Francisco Garijo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages110-121
Number of pages12
ISBN (Print)9783031803659
DOIs
Publication statusPublished - 1 Jan 2025
Externally publishedYes
Event18th Ibero-American Conference on Artificial Intelligence, IBERAMIA 2024 - Montevideo, Uruguay
Duration: 13 Nov 202415 Nov 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15277 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference18th Ibero-American Conference on Artificial Intelligence, IBERAMIA 2024
Country/TerritoryUruguay
CityMontevideo
Period13/11/2415/11/24

Keywords

  • Mixed audio-video SPD networks
  • online Parkinson’s Disease prediction

Fingerprint

Dive into the research topics of 'A Mixed Audio-Video SPD Network for Online Classification of Parkinsonian Speech Patterns'. Together they form a unique fingerprint.

Cite this