Decomposing the video editing structure of a talk-show using nonnegative matrix factorization

  • S. Essid
  • , C. Févotte

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We introduce a novel video structuring scheme that exploits nonnegative matrix factorization (NMF) on count data (in a bag of features representation of the visual stream) to jointly discover latent structuring patterns and their activations in time. Our NMF variant employs the Kullback-Leibler divergence as a cost function and imposes a temporal smoothness constraint to the activations. It is solved by a majorization-minimization technique. Our method is shown to be successful for decomposing the high-level editing structure of talk-shows. It is evaluated using a challenging database of TV political-debate programs, and found to clearly outperform a reference HMM method.

Original languageEnglish
Title of host publication2012 IEEE International Conference on Image Processing, ICIP 2012 - Proceedings
Pages3105-3108
Number of pages4
DOIs
Publication statusPublished - 1 Dec 2012
Externally publishedYes
Event2012 19th IEEE International Conference on Image Processing, ICIP 2012 - Lake Buena Vista, FL, United States
Duration: 30 Sept 20123 Oct 2012

Publication series

NameProceedings - International Conference on Image Processing, ICIP
ISSN (Print)1522-4880

Conference

Conference2012 19th IEEE International Conference on Image Processing, ICIP 2012
Country/TerritoryUnited States
CityLake Buena Vista, FL
Period30/09/123/10/12

Keywords

  • Video structuring
  • bag of features
  • indexing
  • machine learning
  • matrix factorization
  • unsupervised classification

Fingerprint

Dive into the research topics of 'Decomposing the video editing structure of a talk-show using nonnegative matrix factorization'. Together they form a unique fingerprint.

Cite this