Perceptual tempo estimation using GMM-Regression

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Most current tempo estimation algorithms suffer from the so-called octave estimation problems (estimating twice, thrice, half or one-third of a reference tempo). However, it is difficult to qualify an error as octave error without a clear definition of what is the reference tempo. For this reason, and given that tempo is mostly a perceptual notion, we study here the estimation of perceptual tempo. We consider the perceptual tempo as defined by the results of the largescale experiment made at Last-FM in 2011. We assume that the perception of tempo is related to the rate of variation of four musical attributes: the variation of energy, of harmonic changes, of spectral balance and short-term-eventrepetitions. We then propose the use of GMM-Regression to find the relationship between the perceptual tempo and the four musical attributes. In an experiment, we show that the estimation of the tempo provided by GMM-Regression over these attributes outperforms the one provided by a state-of-the-art tempo estimation algorithm. For this task GMM-Regression also largely outperforms SVM-Regression. We finally study the estimation of three perceptual tempo classes ("Slow", "In Between", "Fast") using both GMM-Regression and SVM-Classification.

Original languageEnglish
Title of host publicationMIRUM 2012 - Proceedings of the 2nd International ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies, Co-located with ACM Multimedia 2012
Pages45-50
Number of pages6
DOIs
Publication statusPublished - 10 Dec 2012
Externally publishedYes
Event2nd International ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies, MIRUM 2012 - Co-located with ACM Multimedia 2012 - Nara, Japan
Duration: 2 Nov 20122 Nov 2012

Publication series

NameMIRUM 2012 - Proceedings of the 2nd International ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies, Co-located with ACM Multimedia 2012

Conference

Conference2nd International ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies, MIRUM 2012 - Co-located with ACM Multimedia 2012
Country/TerritoryJapan
CityNara
Period2/11/122/11/12

Keywords

  • GMM-Regression
  • Perceptual tempo
  • Tempo class

Fingerprint

Dive into the research topics of 'Perceptual tempo estimation using GMM-Regression'. Together they form a unique fingerprint.

Cite this