Automatic detection of known advertisements in radio broadcast with data-driven ALISP transcriptions

Houssemeddine Khemiri, Gérard Chollet, Dijana Petrovska-Delacrétaz

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents an audio indexing system to search for known advertisements in radio broadcast streams, using automatically acquired segmental units. These units, called Automatic Language Independent Speech Processing (ALISP) units, are acquired using temporal decomposition and vector quantization and modeled by Hidden Markov Models (HMMs). To detect commercials, ALISP transcriptions of reference advertisements are compared to the transcriptions of the test radio stream using the Levenshtein distance. The system is described and evaluated on one day broadcast audio streams from 11 French radio stations containing 2070 advertisements. With a set of 2,172 reference advertisements we achieve a mean precision rate of 99% with the corresponding recall value of 96%. Moreover, this system allowed us to detect some annotation errors.

Original languageEnglish
Pages (from-to)35-49
Number of pages15
JournalMultimedia Tools and Applications
Volume62
Issue number1
DOIs
Publication statusPublished - 1 Jan 2013

Keywords

  • ALISP tools
  • Advertisement detection
  • Copy detection
  • Data-driven speech segmentation
  • HMM models

Fingerprint

Dive into the research topics of 'Automatic detection of known advertisements in radio broadcast with data-driven ALISP transcriptions'. Together they form a unique fingerprint.

Cite this