Abstract
Big Data is a new term used to identify datasets that we cannot manage with current methodologies or data mining software tools due to their large size and complexity. Big Data mining is the capability of extracting useful information from these large datasets or streams of data. New mining techniques are necessary due to the volume, variability, and velocity, of such data. moa is a software framework with classification, regression, and frequent pattern methods, and the new Apache Samoa is a distributed streaming software for mining data streams.
| Original language | English |
|---|---|
| Pages (from-to) | 13-14 |
| Number of pages | 2 |
| Journal | CEUR Workshop Proceedings |
| Volume | 1478 |
| Publication status | Published - 1 Jan 2015 |
| Externally published | Yes |
| Event | 2nd Annual International Symposium on Information Management and Big Data, SIMBig 2015 - Cusco, Peru Duration: 2 Sept 2015 → 4 Sept 2015 |