Passer à la navigation principale Passer à la recherche Passer au contenu principal

Incremental k-Nearest Neighbors Using Reservoir Sampling for Data Streams

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

The online and potentially infinite nature of data streams leads to the inability to store the flow in its entirety and thus restricts the storage to a part of – and/or synopsis information from – the stream. To process these evolving data, we need efficient and accurate methodologies and systems, such as window models (e.g., sliding windows) and summarization techniques (e.g., sampling, sketching, dimensionality reduction). In this paper, we propose, RW-kNN, a k-Nearest Neighbors (kNN) algorithm that employs a practical way to store information about past instances using the biased reservoir sampling to sample the input instances along with a sliding window to maintain the most recent instances from the stream. We evaluate our proposal on a diverse set of synthetic and real datasets and compare against state-of-the-art algorithms in a traditional test-then-train evaluation. Results show how our proposed RW-kNN approach produces high-predictive performance for both real and synthetic datasets while using a feasible amount of resources.

langue originaleAnglais
titreDiscovery Science - 24th International Conference, DS 2021, Proceedings
rédacteurs en chefCarlos Soares, Luis Torgo
EditeurSpringer Science and Business Media Deutschland GmbH
Pages122-137
Nombre de pages16
ISBN (imprimé)9783030889418
Les DOIs
étatPublié - 1 janv. 2021
Evénement24th International Conference on Discovery Science, DS 2021 - Virtual, Online
Durée: 11 oct. 202113 oct. 2021

Série de publications

NomLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12986 LNAI
ISSN (imprimé)0302-9743
ISSN (Electronique)1611-3349

Une conférence

Une conférence24th International Conference on Discovery Science, DS 2021
La villeVirtual, Online
période11/10/2113/10/21

Empreinte digitale

Examiner les sujets de recherche de « Incremental k-Nearest Neighbors Using Reservoir Sampling for Data Streams ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation