TY - GEN
T1 - Hidden discrete tempo model
T2 - 36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
AU - Joder, Cyril
AU - Essid, Slim
AU - Richard, Gaël
PY - 2011/8/18
Y1 - 2011/8/18
N2 - In this paper, we present the Hidden Discrete Tempo Model, an effective Dynamic Bayesian Network for audio to score matching. Its main feature is an explicit modeling of tempo, which directly influences the timing model of the musical performance. Thanks to a discretization of the tempo set, it allows for an efficient decoding by the Viterbi algorithm, and facilitates the introduction of features which directly depend on the local tempo. We take advantage of this property by using the cyclic tempogram descriptor in addition to chroma vectors and onset detection features. Experiment run on both classical piano and pop music show the very high accuracy of this model for audio to score alignment, as well as the usefulness of the tempo feature used.
AB - In this paper, we present the Hidden Discrete Tempo Model, an effective Dynamic Bayesian Network for audio to score matching. Its main feature is an explicit modeling of tempo, which directly influences the timing model of the musical performance. Thanks to a discretization of the tempo set, it allows for an efficient decoding by the Viterbi algorithm, and facilitates the introduction of features which directly depend on the local tempo. We take advantage of this property by using the cyclic tempogram descriptor in addition to chroma vectors and onset detection features. Experiment run on both classical piano and pop music show the very high accuracy of this model for audio to score alignment, as well as the usefulness of the tempo feature used.
KW - acoustic features
KW - automatic alignment
KW - dynamic Bayesian networks
KW - music information retrieval
U2 - 10.1109/ICASSP.2011.5946424
DO - 10.1109/ICASSP.2011.5946424
M3 - Conference contribution
AN - SCOPUS:80051645414
SN - 9781457705397
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 397
EP - 400
BT - 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings
Y2 - 22 May 2011 through 27 May 2011
ER -