TY - GEN
T1 - Adaptive learning from evolving data streams
AU - Bifet, Albert
AU - Gavaldà, Ricard
PY - 2009/10/16
Y1 - 2009/10/16
N2 - We propose and illustrate a method for developing algorithms that can adaptively learn from data streams that drift over time. As an example, we take Hoeffding Tree, an incremental decision tree inducer for data streams, and use as a basis it to build two new methods that can deal with distribution and concept drift: a sliding window-based algorithm, Hoeffding Window Tree, and an adaptive method, Hoeffding Adaptive Tree. Our methods are based on using change detectors and estimator modules at the right places; we choose implementations with theoretical guarantees in order to extend such guarantees to the resulting adaptive learning algorithm. A main advantage of our methods is that they require no guess about how fast or how often the stream will drift; other methods typically have several user-defined parameters to this effect. In our experiments, the new methods never do worse, and in some cases do much better, than CVFDT, a well-known method for tree induction on data streams with drift.
AB - We propose and illustrate a method for developing algorithms that can adaptively learn from data streams that drift over time. As an example, we take Hoeffding Tree, an incremental decision tree inducer for data streams, and use as a basis it to build two new methods that can deal with distribution and concept drift: a sliding window-based algorithm, Hoeffding Window Tree, and an adaptive method, Hoeffding Adaptive Tree. Our methods are based on using change detectors and estimator modules at the right places; we choose implementations with theoretical guarantees in order to extend such guarantees to the resulting adaptive learning algorithm. A main advantage of our methods is that they require no guess about how fast or how often the stream will drift; other methods typically have several user-defined parameters to this effect. In our experiments, the new methods never do worse, and in some cases do much better, than CVFDT, a well-known method for tree induction on data streams with drift.
UR - https://www.scopus.com/pages/publications/70349871603
U2 - 10.1007/978-3-642-03915-7_22
DO - 10.1007/978-3-642-03915-7_22
M3 - Conference contribution
AN - SCOPUS:70349871603
SN - 3642039146
SN - 9783642039140
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 249
EP - 260
BT - Advances in Intelligent Data Analysis VIII - 8th International Symposium on Intelligent Data Analysis, IDA 2009, Proceedings
T2 - 8th International Symposium on Intelligent Data Analysis, IDA 2009
Y2 - 31 August 2009 through 2 September 2009
ER -