SAMOA: Scalable advanced massive online analysis

Gianmarco De Francisci Morales, Albert Bifet

Research output: Contribution to journalArticlepeer-review

Abstract

samoa (Scalable Advanced Massive Online Analysis) is a platform for mining big data streams. It provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms. It features a pluggable architecture that allows it to run on several distributed stream processing engines such as Storm, S4, and Samza. samoa is written in Java, is open source, and is available at http://samoa-project.net under the Apache Software License version 2.0.

Original languageEnglish
Pages (from-to)149-153
Number of pages5
JournalJournal of Machine Learning Research
Volume16
Publication statusPublished - 1 Jan 2015
Externally publishedYes

Keywords

  • Classification
  • Clustering
  • Data streams
  • Distributed systems
  • Machine learning
  • Regression
  • Toolbox

Fingerprint

Dive into the research topics of 'SAMOA: Scalable advanced massive online analysis'. Together they form a unique fingerprint.

Cite this