TY - GEN
T1 - Visual analysis for drum sequence transcription
AU - McGuinness, Kevin
AU - Gillet, Olivier
AU - O'Connor, Noel E.
AU - Richard, Gaël
PY - 2007/12/1
Y1 - 2007/12/1
N2 - A system is presented for analysing drum performance video sequences. A novel ellipse detection algorithm is introduced that automatically locates drum tops. This algorithm fits ellipses to edge clusters, and ranks them according to various fitness criteria. A background/foreground segmentation method is then used to extract the silhouette of the drummer and drum sticks. Coupled with a motion intensity feature, this allows for the detection of 'hits' in each of the extracted regions. In order to obtain a transcription of the performance, each of these regions is automatically labeled with the corresponding instrument class. A partial audio transcription and color cues are used to measure the compatibility between a region and its label, the Kuhn-Munkres algorithm is then employed to find the optimal labeling. Experimental results demonstrate the ability of visual analysis to enhance the performance of an audio drum transcription system.
AB - A system is presented for analysing drum performance video sequences. A novel ellipse detection algorithm is introduced that automatically locates drum tops. This algorithm fits ellipses to edge clusters, and ranks them according to various fitness criteria. A background/foreground segmentation method is then used to extract the silhouette of the drummer and drum sticks. Coupled with a motion intensity feature, this allows for the detection of 'hits' in each of the extracted regions. In order to obtain a transcription of the performance, each of these regions is automatically labeled with the corresponding instrument class. A partial audio transcription and color cues are used to measure the compatibility between a region and its label, the Kuhn-Munkres algorithm is then employed to find the optimal labeling. Experimental results demonstrate the ability of visual analysis to enhance the performance of an audio drum transcription system.
UR - https://www.scopus.com/pages/publications/78651452857
M3 - Conference contribution
AN - SCOPUS:78651452857
SN - 9788392134022
T3 - European Signal Processing Conference
SP - 312
EP - 316
BT - 15th European Signal Processing Conference, EUSIPCO 2007 - Proceedings
T2 - 15th European Signal Processing Conference, EUSIPCO 2007
Y2 - 3 September 2007 through 7 September 2007
ER -