TY - GEN
T1 - A deep interpretation of classifier chains
AU - Read, Jesse
AU - Hollmén, Jaakko
N1 - Publisher Copyright:
© Springer International Publishing Switzerland 2014
PY - 2014/1/1
Y1 - 2014/1/1
N2 - In the “classifier chains” (CC) approach for multi-label classification, the predictions of binary classifiers are cascaded along a chain as additional features. This method has attained high predictive performance, and is receiving increasing analysis and attention in the recent multi-label literature, although a deep understanding of its performance is still taking shape. In this paper, we show that CC gets predictive power from leveraging labels as additional stochastic features, contrasting with many other methods, such as stacking and error correcting output codes, which use label dependence only as kind of regularization. CC methods can learn a concept which these cannot, even supposing the same base classifier and hypothesis space. This leads us to connections with deep learning (indeed, we show that CC is competitive precisely because it is a deep learner), and we employ deep learning methods – showing that they can supplement or even replace a classifier chain. Results are convincing, and throw new insight into promising future directions.
AB - In the “classifier chains” (CC) approach for multi-label classification, the predictions of binary classifiers are cascaded along a chain as additional features. This method has attained high predictive performance, and is receiving increasing analysis and attention in the recent multi-label literature, although a deep understanding of its performance is still taking shape. In this paper, we show that CC gets predictive power from leveraging labels as additional stochastic features, contrasting with many other methods, such as stacking and error correcting output codes, which use label dependence only as kind of regularization. CC methods can learn a concept which these cannot, even supposing the same base classifier and hypothesis space. This leads us to connections with deep learning (indeed, we show that CC is competitive precisely because it is a deep learner), and we employ deep learning methods – showing that they can supplement or even replace a classifier chain. Results are convincing, and throw new insight into promising future directions.
U2 - 10.1007/978-3-319-12571-8_22
DO - 10.1007/978-3-319-12571-8_22
M3 - Conference contribution
AN - SCOPUS:84910018782
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 251
EP - 262
BT - Advances in Intelligent DataAnalysis XIII - 13th International Symposium, IDA 2014, Proceedings
A2 - Blockeel, Hendrik
A2 - van Leeuwen, Matthijs
A2 - Vinciotti, Veronica
PB - Springer Verlag
T2 - PAKDD 2006 International Workshop on Knowledge Discovery in Life Science Literature, KDLL 2006
Y2 - 9 April 2006 through 9 April 2006
ER -