Abstract
The TreeRank algorithm was recently proposed in [1] and [2] as a scoring-based method based on recursive partitioning of the input space. This tree induction algorithm builds orderings by recursively optimizing the Receiver Operating Characteristic curve through a one-step optimization procedure called LeafRank. One of the aim of this paper is the in-depth analysis of the empirical performance of the variants of TreeRank/LeafRank method. Numerical experiments based on both artificial and real data sets are provided. Further experiments using resampling and randomization, in the spirit of bagging and random forests are developed [3, 4] and we show how they increase both stability and accuracy in bipartite ranking. Moreover, an empirical comparison with other efficient scoring algorithms such as RankBoost and RankSVM is presented on UCI benchmark data sets.
| Original language | English |
|---|---|
| Pages (from-to) | 475-496 |
| Number of pages | 22 |
| Journal | Pattern Analysis and Applications |
| Volume | 16 |
| Issue number | 4 |
| DOIs | |
| Publication status | Published - 1 Nov 2013 |
| Externally published | Yes |
Keywords
- AUC maximization
- Feature randomization
- ROC curve
- Ranking trees
- Resampling
- Scoring rules