TY - GEN
T1 - Perceptual tempo estimation using GMM-Regression
AU - Peeters, Geoffroy
AU - Flocon-Cholet, Joachim
PY - 2012/12/10
Y1 - 2012/12/10
N2 - Most current tempo estimation algorithms suffer from the so-called octave estimation problems (estimating twice, thrice, half or one-third of a reference tempo). However, it is difficult to qualify an error as octave error without a clear definition of what is the reference tempo. For this reason, and given that tempo is mostly a perceptual notion, we study here the estimation of perceptual tempo. We consider the perceptual tempo as defined by the results of the largescale experiment made at Last-FM in 2011. We assume that the perception of tempo is related to the rate of variation of four musical attributes: the variation of energy, of harmonic changes, of spectral balance and short-term-eventrepetitions. We then propose the use of GMM-Regression to find the relationship between the perceptual tempo and the four musical attributes. In an experiment, we show that the estimation of the tempo provided by GMM-Regression over these attributes outperforms the one provided by a state-of-the-art tempo estimation algorithm. For this task GMM-Regression also largely outperforms SVM-Regression. We finally study the estimation of three perceptual tempo classes ("Slow", "In Between", "Fast") using both GMM-Regression and SVM-Classification.
AB - Most current tempo estimation algorithms suffer from the so-called octave estimation problems (estimating twice, thrice, half or one-third of a reference tempo). However, it is difficult to qualify an error as octave error without a clear definition of what is the reference tempo. For this reason, and given that tempo is mostly a perceptual notion, we study here the estimation of perceptual tempo. We consider the perceptual tempo as defined by the results of the largescale experiment made at Last-FM in 2011. We assume that the perception of tempo is related to the rate of variation of four musical attributes: the variation of energy, of harmonic changes, of spectral balance and short-term-eventrepetitions. We then propose the use of GMM-Regression to find the relationship between the perceptual tempo and the four musical attributes. In an experiment, we show that the estimation of the tempo provided by GMM-Regression over these attributes outperforms the one provided by a state-of-the-art tempo estimation algorithm. For this task GMM-Regression also largely outperforms SVM-Regression. We finally study the estimation of three perceptual tempo classes ("Slow", "In Between", "Fast") using both GMM-Regression and SVM-Classification.
KW - GMM-Regression
KW - Perceptual tempo
KW - Tempo class
UR - https://www.scopus.com/pages/publications/84870529222
U2 - 10.1145/2390848.2390861
DO - 10.1145/2390848.2390861
M3 - Conference contribution
AN - SCOPUS:84870529222
SN - 9781450315913
T3 - MIRUM 2012 - Proceedings of the 2nd International ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies, Co-located with ACM Multimedia 2012
SP - 45
EP - 50
BT - MIRUM 2012 - Proceedings of the 2nd International ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies, Co-located with ACM Multimedia 2012
T2 - 2nd International ACM Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies, MIRUM 2012 - Co-located with ACM Multimedia 2012
Y2 - 2 November 2012 through 2 November 2012
ER -