Full video processing for mobile audio-visual identity verification

Alexander Usoltsev, Khemiri Houssemeddine, Dijana Petrovska-Delacrétaz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper describes a bi-modal biometric verification system based on voice and face modalities, which takes advantage of the full video processing instead of using still-images. The bi-modal system is evaluated on the MOBIO corpus and results show a relative improvement of performance by nearly 10% when the whole video is used. The fusion between face and speaker verification systems, using linear logistic regression weights, gives a relative improvement of performance that varies between 30% and 60% comparing to the best uni-modal system. Proof-of-concept iPad application is developed based on the proposed bi-modal system.

Original languageEnglish
Title of host publicationICPRAM 2016 - Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods
EditorsMaria De Marsico, Gabriella Sanniti di Baja, Ana Fred
PublisherSciTePress
Pages552-557
Number of pages6
ISBN (Electronic)9789897581731
DOIs
Publication statusPublished - 1 Jan 2016
Externally publishedYes
Event5th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2016 - Rome, Italy
Duration: 24 Feb 201626 Feb 2016

Publication series

NameICPRAM 2016 - Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods

Conference

Conference5th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2016
Country/TerritoryItaly
CityRome
Period24/02/1626/02/16

Keywords

  • Biometrics
  • Face
  • Full video processing
  • Score fusion
  • Speech

Fingerprint

Dive into the research topics of 'Full video processing for mobile audio-visual identity verification'. Together they form a unique fingerprint.

Cite this