Skip to main navigation Skip to search Skip to main content

AutoML for Stream k-Nearest Neighbors Classification

  • Telecom Paris
  • Ipatimup Diagnósticos
  • University of Waikato

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The last few decades have witnessed a significant evolution of technology in different domains, changing the way the world operates, which leads to an overwhelming amount of data generated in an open-ended way as streams. Over the past years, we observed the development of several machine learning algorithms to process big data streams. However, the accuracy of these algorithms is very sensitive to their hyper-parameters, which requires expertise and extensive trials to tune. Another relevant aspect is the high-dimensionality of data, which can causes degradation to computational performance. To cope with these issues, this paper proposes a stream k-nearest neighbors (kNN) algorithm that applies an internal dimension reduction to the stream in order to reduce the resource usage and uses an automatic monitoring system that tunes dynamically the configuration of the kNN algorithm and the output dimension size with big data streams. Experiments over a wide range of datasets show that the predictive and computational performances of the kNN algorithm are improved.

Original languageEnglish
Title of host publicationProceedings - 2020 IEEE International Conference on Big Data, Big Data 2020
EditorsXintao Wu, Chris Jermaine, Li Xiong, Xiaohua Tony Hu, Olivera Kotevska, Siyuan Lu, Weijia Xu, Srinivas Aluru, Chengxiang Zhai, Eyhab Al-Masri, Zhiyuan Chen, Jeff Saltz
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages597-602
Number of pages6
ISBN (Electronic)9781728162515
DOIs
Publication statusPublished - 10 Dec 2020
Externally publishedYes
Event8th IEEE International Conference on Big Data, Big Data 2020 - Virtual, Online, United States
Duration: 10 Dec 202013 Dec 2020

Publication series

NameProceedings - 2020 IEEE International Conference on Big Data, Big Data 2020

Conference

Conference8th IEEE International Conference on Big Data, Big Data 2020
Country/TerritoryUnited States
CityVirtual, Online
Period10/12/2013/12/20

Keywords

  • AutoML
  • Classification
  • Data Stream
  • Dimensionality Reduction
  • Parameter Tuning
  • k-Nearest Neighbors

Fingerprint

Dive into the research topics of 'AutoML for Stream k-Nearest Neighbors Classification'. Together they form a unique fingerprint.

Cite this