Robust facial alignment with internal denoising auto-encoder

Decky Aspandi, Oriol Martinez, Federico Sukno, Xavier Binefa

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The development of facial alignment models is growing rapidly thanks to the availability of large facial landmarked datasets and powerful deep learning models. However, important challenges still remain for facial alignment models to work on images under extreme conditions, such as severe occlusions or large variations in pose and illumination. Current attempts to overcome this limitation have mainly focused on building robust feature extractors with the assumption that the model will be able to discard the noise and select only the meaningful features. However, such an assumption ignores the importance of understanding the noise that characterizes unconstrained images, which has been shown to benefit computer vision models if used appropriately on the learning strategy. Thus, in this paper we investigate the introduction of specialized modules for noise detection and removal, in combination with our state-of-the-art facial alignment module and show that this leads to improved robustness both to synthesized noise and in-the-wild conditions. The proposed model is built by combining two major subnetworks: internal image denoiser (based on the Auto-Encoder architecture) and facial landmark localiser (based on the inception-resnet architecture). Our results on the 300-W and Menpo datasets show that our model can effectively handle different types of synthetic noise, which also leads to enhanced robustness in real-world unconstrained settings, reaching top state-of-the-art accuracy.

Original languageEnglish
Title of host publicationProceedings - 2019 16th Conference on Computer and Robot Vision, CRV 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages143-150
Number of pages8
ISBN (Electronic)9781728118383
DOIs
Publication statusPublished - 1 May 2019
Externally publishedYes
Event16th Conference on Computer and Robot Vision, CRV 2019 - Kingston, Canada
Duration: 29 May 201931 May 2019

Publication series

NameProceedings - 2019 16th Conference on Computer and Robot Vision, CRV 2019

Conference

Conference16th Conference on Computer and Robot Vision, CRV 2019
Country/TerritoryCanada
CityKingston
Period29/05/1931/05/19

Keywords

  • Auto Encoder
  • Facial Alignment
  • Noise Normalization

Fingerprint

Dive into the research topics of 'Robust facial alignment with internal denoising auto-encoder'. Together they form a unique fingerprint.

Cite this