Classifying real-world data with the DDα-procedure

Pavlo Mozharovskyi, Karl Mosler, Tatjana Lange

Research output: Contribution to journalArticlepeer-review

Abstract

The $${ DD}\alpha $$DDα-classifier, a nonparametric fast and very robust procedure, is described and applied to fifty classification problems regarding a broad spectrum of real-world data. The procedure first transforms the data from their original property space into a depth space, which is a low-dimensional unit cube, and then separates them by a projective invariant procedure, called $$\alpha $$α-procedure. To each data point the transformation assigns its depth values with respect to the given classes. Several alternative depth notions (spatial depth, Mahalanobis depth, projection depth, and Tukey depth, the latter two being approximated by univariate projections) are used in the procedure, and compared regarding their average error rates. With the Tukey depth, which fits the distributions’ shape best and is most robust, ‘outsiders’, that is data points having zero depth in all classes, appear. They need an additional treatment for classification. Evidence is also given about the dimension of the extended feature space needed for linear separation. The $${ DD}\alpha $$DDα-procedure is available as an R-package.

Original languageEnglish
Pages (from-to)287-314
Number of pages28
JournalAdvances in Data Analysis and Classification
Volume9
Issue number3
DOIs
Publication statusPublished - 2 Sept 2015
Externally publishedYes

Keywords

  • Alpha-procedure
  • Classification
  • Data depth
  • Features
  • Outsiders
  • Projection depth
  • Random Tukey depth
  • Spatial depth
  • Supervised learning

Fingerprint

Dive into the research topics of 'Classifying real-world data with the DDα-procedure'. Together they form a unique fingerprint.

Cite this