Spectral and Temporal Periodicity Representations of Rhythm for the Automatic Classification of Music Audio Signal

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, we study the spectral and temporal periodicity representations that can be used to describe the characteristics of the rhythm of a music audio signal. A continuous-valued energy-function representing the onset positions over time is first extracted from the audio signal. From this function we compute at each time a vector which represents the characteristics of the local rhythm. Four feature sets are studied for this vector. They are derived from the amplitude of the discrete Fourier transform (DFT), the auto-correlation function (ACF), the product of the DFT and the ACF interpolated on a hybrid lag/frequency axis and the concatenated DFT and ACF coefficients. Then the vectors are sampled at some specific frequencies, which represent various ratios of the local tempo. The ability of these periodicity representations to describe the rhythm characteristics of an audio item is evaluated through a classification task. In this, we test the use of the periodicity representations alone, combined with tempo information and combined with a proposed set of rhythm features. The evaluation is performed using annotated and estimated tempo. We show that using such simple periodicity representations allows achieving high recognition rates at least comparable to previously published results.

Original languageEnglish
Pages (from-to)1242-1252
Number of pages11
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume19
Issue number5
DOIs
Publication statusPublished - 1 Jan 2011
Externally publishedYes

Keywords

  • Audio features
  • automatic indexing
  • rhythm classification
  • rhythm description

Fingerprint

Dive into the research topics of 'Spectral and Temporal Periodicity Representations of Rhythm for the Automatic Classification of Music Audio Signal'. Together they form a unique fingerprint.

Cite this