StreamDM: Advanced Data Mining in Spark Streaming

  • Albert Bifet
  • , Silviu Maniu
  • , Jianfeng Qian
  • , Guangjian Tian
  • , Cheng He
  • , Wei Fan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Real-time analytics are becoming increasingly important due to the large amount of data that is being created continuously. Drawing from our experiences at Huawei Noah's Ark Lab, we present and demonstrate here StreamDM, a new open source data mining and machine learning library, designed on top of Spark Streaming, an extension of the core Spark API that enables scalable stream processing of data streams. StreamDM is designed to be easily extended and used, either practitioners, developers, or researchers, and is the first library to contain advanced stream mining algorithms for Spark Streaming.

Original languageEnglish
Title of host publicationProceedings - 15th IEEE International Conference on Data Mining Workshop, ICDMW 2015
EditorsXindong Wu, Alexander Tuzhilin, Hui Xiong, Jennifer G. Dy, Charu Aggarwal, Zhi-Hua Zhou, Peng Cui
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1608-1611
Number of pages4
ISBN (Electronic)9781467384926
DOIs
Publication statusPublished - 29 Jan 2016
Event15th IEEE International Conference on Data Mining Workshop, ICDMW 2015 - Atlantic City, United States
Duration: 14 Nov 201517 Nov 2015

Publication series

NameProceedings - 15th IEEE International Conference on Data Mining Workshop, ICDMW 2015

Conference

Conference15th IEEE International Conference on Data Mining Workshop, ICDMW 2015
Country/TerritoryUnited States
CityAtlantic City
Period14/11/1517/11/15

Keywords

  • Software
  • Spark Streaming
  • open-source
  • stream mining

Fingerprint

Dive into the research topics of 'StreamDM: Advanced Data Mining in Spark Streaming'. Together they form a unique fingerprint.

Cite this