Skip to main navigation Skip to search Skip to main content

GNUsmail: Open framework for on-line email classification

  • José M. Carmona-Cejudo
  • , Manuel Baena-García
  • , José Del Campo-Ávila
  • , Rafael Morales-Bueno
  • , Albert Bifet
  • University of Malaga
  • University of Waikato

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Real-time classification of massive email data is a challenging task that presents its own particular difficulties. Since email data presents an important temporal component, several problems arise: emails arrive continuously, and the criteria used to classify those emails can change, so the learning algorithms have to be able to deal with concept drift. Our problem is more general than spam detection, which has received much more attention in the literature. In this paper we present GNUsmail, an open-source extensible framework for email classification, which structure supports incremental and on-line learning. This framework enables the incorporation of algorithms developed by other researchers, such as those included in WEKA and MOA. We evaluate this framework, characterized by two overlapping phases (pre-processing and learning), using the ENRON dataset, and we compare the results achieved by WEKA and MOA algorithms.

Original languageEnglish
Title of host publicationECAI 2010
PublisherIOS Press
Pages1141-1142
Number of pages2
ISBN (Print)9781607506058
DOIs
Publication statusPublished - 1 Jan 2010
Externally publishedYes
Event2nd Workshop on Knowledge Representation for Health Care, KR4HC 2010, held in conjunction with the 19th European Conference in Artificial Intelligence, ECAI 2010 - Lisbon, Portugal
Duration: 17 Aug 201017 Aug 2010

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume215
ISSN (Print)0922-6389
ISSN (Electronic)1879-8314

Conference

Conference2nd Workshop on Knowledge Representation for Health Care, KR4HC 2010, held in conjunction with the 19th European Conference in Artificial Intelligence, ECAI 2010
Country/TerritoryPortugal
CityLisbon
Period17/08/1017/08/10

Fingerprint

Dive into the research topics of 'GNUsmail: Open framework for on-line email classification'. Together they form a unique fingerprint.

Cite this