TY - GEN
T1 - Supervised and unsupervised sequence modelling for DRUM transcription
AU - Gillet, Olivier
AU - Richard, Gäel
PY - 2007/12/1
Y1 - 2007/12/1
N2 - We discuss in this paper two post-processings for drum transcription systems, which aim to model typical properties of drum sequences. Both methods operate on a symbolic representation of the sequence, which is obtained by quantizing the onsets of drum strokes on an optimal tatum grid, and by fusing the posterior probabilities produced by the drum transcription system. The first proposed method is a generalization of the N-gram model. We discuss several training and recognition strategies (style-dependent models, local models) in order to maximize the reliability and the specificity of the trained models. Alternatively, we introduce a novel unsupervised algorithm based on a complexity criterion, which finds the most regular and wellstructured sequence compatible with the acoustic scores produced by the transcription system. Both approaches are evaluated on a subset of the ENST-drums corpus, and yield performance improvements.
AB - We discuss in this paper two post-processings for drum transcription systems, which aim to model typical properties of drum sequences. Both methods operate on a symbolic representation of the sequence, which is obtained by quantizing the onsets of drum strokes on an optimal tatum grid, and by fusing the posterior probabilities produced by the drum transcription system. The first proposed method is a generalization of the N-gram model. We discuss several training and recognition strategies (style-dependent models, local models) in order to maximize the reliability and the specificity of the trained models. Alternatively, we introduce a novel unsupervised algorithm based on a complexity criterion, which finds the most regular and wellstructured sequence compatible with the acoustic scores produced by the transcription system. Both approaches are evaluated on a subset of the ENST-drums corpus, and yield performance improvements.
M3 - Conference contribution
AN - SCOPUS:78650867380
SN - 9783854032182
T3 - Proceedings of the 8th International Conference on Music Information Retrieval, ISMIR 2007
SP - 219
EP - 224
BT - Proceedings of the 8th International Conference on Music Information Retrieval, ISMIR 2007
T2 - 8th International Conference on Music Information Retrieval, ISMIR 2007
Y2 - 23 September 2007 through 27 September 2007
ER -