TY - GEN
T1 - Multi-scale temporal fusion by boosting for music classification
AU - Foucard, Rémi
AU - Essid, Slim
AU - Lagrange, Mathieu
AU - Richard, Gaël
PY - 2011/1/1
Y1 - 2011/1/1
N2 - Short-term and long-term descriptors constitute complementary pieces of information in the analysis of audio signals. However, because they are extracted over different time horizons, it is difficult to exploit them concurrently in a fully effective manner. In this paper we propose a novel temporal fusion method that leverages the effectiveness of a given set of features by efficiently combining multi-scale versions of them. This fusion is achieved using a boosting technique exploiting trees as weak classifiers, which has the advantage of performing an embedded feature selection. We apply our algorithm to two standard classification tasks, namely musical instrument recognition and multi-tag classification. Our experiments indicate that the multi-scale approach is able to select different features at different scales and significantly outperforms the mono-scale systems in terms of classification performance.
AB - Short-term and long-term descriptors constitute complementary pieces of information in the analysis of audio signals. However, because they are extracted over different time horizons, it is difficult to exploit them concurrently in a fully effective manner. In this paper we propose a novel temporal fusion method that leverages the effectiveness of a given set of features by efficiently combining multi-scale versions of them. This fusion is achieved using a boosting technique exploiting trees as weak classifiers, which has the advantage of performing an embedded feature selection. We apply our algorithm to two standard classification tasks, namely musical instrument recognition and multi-tag classification. Our experiments indicate that the multi-scale approach is able to select different features at different scales and significantly outperforms the mono-scale systems in terms of classification performance.
UR - https://www.scopus.com/pages/publications/84873587720
M3 - Conference contribution
AN - SCOPUS:84873587720
SN - 9780615548654
T3 - Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011
SP - 663
EP - 668
BT - Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011
PB - International Society for Music Information Retrieval
T2 - 12th International Society for Music Information Retrieval Conference, ISMIR 2011
Y2 - 24 October 2011 through 28 October 2011
ER -