Skip to main navigation Skip to search Skip to main content

Extended gaze following: Detecting objects in videos beyond the camera field of view

  • Benoit Massé
  • , Stéphane Lathuilière
  • , Pablo Mesejo
  • , Radu Horaud

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper we address the problems of detecting objects of interest in a video and of estimating their locations, solely from the gaze directions of people present in the video. Objects can be indistinctly located inside or outside the camera field of view. We refer to this problem as extended gaze following. The contributions of the paper are the followings. First, we propose a novel spatial representation of the gaze directions adopting a top-view perspective. Second, we develop several convolutional encoder/decoder networks to predict object locations and compare them with heuristics and with classical learning-based approaches. Third, in order to train the proposed models, we generate a very large number of synthetic scenarios employing a probabilistic formulation. Finally, our methodology is empirically validated using a publicly available dataset.

Original languageEnglish
Title of host publicationProceedings - 14th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728100890
DOIs
Publication statusPublished - 1 May 2019
Externally publishedYes
Event14th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2019 - Lille, France
Duration: 14 May 201918 May 2019

Publication series

NameProceedings - 14th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2019

Conference

Conference14th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2019
Country/TerritoryFrance
CityLille
Period14/05/1918/05/19

Fingerprint

Dive into the research topics of 'Extended gaze following: Detecting objects in videos beyond the camera field of view'. Together they form a unique fingerprint.

Cite this