TY - GEN
T1 - A conditional random field viewpoint of symbolic audio-to-score matching
AU - Joder, Cyril
AU - Essid, Slim
AU - Richard, Gaël
PY - 2010/12/1
Y1 - 2010/12/1
N2 - We present a new approach of symbolic audio-to-score alignment, with the use of Conditional Random Fields (CRFs). Unlike Hidden Markov Models, these graphical models allow the calculation of state conditional probabilities to be made on the basis of several audio frames. The CRF models that we propose exploit this property to take into account the rhythmic information of the musical score. Assuming that the tempo is locally constant, they confront the neighborhood of each frame with several tempo hypotheses. Experiments on a pop-music database show that this use of contextual information leads to a significant improvement of the alignment accuracy. In particular, the proportion of detected onsets inside a 100-ms tolerance window increases by more than 10% when a 1-s neighborhood is considered.
AB - We present a new approach of symbolic audio-to-score alignment, with the use of Conditional Random Fields (CRFs). Unlike Hidden Markov Models, these graphical models allow the calculation of state conditional probabilities to be made on the basis of several audio frames. The CRF models that we propose exploit this property to take into account the rhythmic information of the musical score. Assuming that the tempo is locally constant, they confront the neighborhood of each frame with several tempo hypotheses. Experiments on a pop-music database show that this use of contextual information leads to a significant improvement of the alignment accuracy. In particular, the proportion of detected onsets inside a 100-ms tolerance window increases by more than 10% when a 1-s neighborhood is considered.
KW - audio/score alignment
KW - conditional random fields
KW - indexing
KW - music information retrieval
U2 - 10.1145/1873951.1874100
DO - 10.1145/1873951.1874100
M3 - Conference contribution
AN - SCOPUS:78650978638
SN - 9781605589336
T3 - MM'10 - Proceedings of the ACM Multimedia 2010 International Conference
SP - 871
EP - 874
BT - MM'10 - Proceedings of the ACM Multimedia 2010 International Conference
T2 - 18th ACM International Conference on Multimedia ACM Multimedia 2010, MM'10
Y2 - 25 October 2010 through 29 October 2010
ER -