Abstract
samoa (Scalable Advanced Massive Online Analysis) is a platform for mining big data streams. It provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms. It features a pluggable architecture that allows it to run on several distributed stream processing engines such as Storm, S4, and Samza. samoa is written in Java, is open source, and is available at http://samoa-project.net under the Apache Software License version 2.0.
| Original language | English |
|---|---|
| Pages (from-to) | 149-153 |
| Number of pages | 5 |
| Journal | Journal of Machine Learning Research |
| Volume | 16 |
| Publication status | Published - 1 Jan 2015 |
| Externally published | Yes |
Keywords
- Classification
- Clustering
- Data streams
- Distributed systems
- Machine learning
- Regression
- Toolbox