Multiview approaches to event detection and scene analysis

Slim Essid, Sanjeel Parekh, Ngoc Q.K. Duong, Romain Serizel, Alexey Ozerov, Fabio Antonacci, Augusto Sarti

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

This chapter addresses sound scene and event classification in multiview settings, that is, settings where the observations are obtained from multiple sensors, each sensor contributing a particular view of the data (e.g., audio microphones, video cameras, etc.). We briefly introduce some of the techniques that can be exploited to effectively combine the data conveyed by the different views under analysis for a better interpretation. We first provide a high-level presentation of generic methods that are particularly relevant in the context of multiview and multimodal sound scene analysis. Then, we more specifically present a selection of techniques used for audiovisual event detection and microphone array-based scene analysis.

Original languageEnglish
Title of host publicationComputational Analysis of Sound Scenes and Events
PublisherSpringer International Publishing
Pages243-276
Number of pages34
ISBN (Electronic)9783319634500
ISBN (Print)9783319634494
DOIs
Publication statusPublished - 21 Sept 2017
Externally publishedYes

Keywords

  • Audio source localization and tracking
  • Audio source separation
  • Beamforming
  • Data fusion
  • Joint audiovisual scene analysis
  • Matrix factorization
  • Multichannel Wiener filtering
  • Multichannel audio
  • Multimodal scene analysis
  • Multiview scene analysis
  • Representation learning
  • Tensor factorization

Fingerprint

Dive into the research topics of 'Multiview approaches to event detection and scene analysis'. Together they form a unique fingerprint.

Cite this