Passer à la navigation principale Passer à la recherche Passer au contenu principal

Quality scheme assessment in the clustering process

  • Athens Univ. of Econ. and Business

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Clustering is mostly an unsupervised procedure and most of the clustering algorithms depend on assumptions and initial guesses in order to define the subgroups presented in a data set. As a consequence, in most applications the final clusters require some sort of evaluation. The evaluation procedure has to tackle difficult problems, which can be qualitatively expressed as: i. quality of clusters, ii. the degree with which a clustering scheme fits a specific data set, iii. the optimal number of clusters in a partitioning. In this paper we present a scheme for finding the optimal partitioning of a data set during the clustering process regardless of the clustering algorithm used. More specifically, we present an approach for evaluation of clustering schemes (partitions) so as to find the best number of clusters, which occurs in a specific data set. A clustering algorithm produces different partitions for different values of the input parameters. The proposed approach selects the best clustering scheme (i.e., the scheme with the most compact and well-separated clusters), according to a quality index we define. We verified our approach using two popular clustering algorithms on synthetic and real data sets in order to evaluate its reliability. Moreover, we study the influence of different clustering parameters to the proposed quality index.

langue originaleAnglais
titrePrinciples of Data Mining and Knowledge Discovery - 4th European Conference, PKDD 2000, Proceedings
rédacteurs en chefDjamel A. Zighed, Jan Komorowski, Jan Zytkow
EditeurSpringer Verlag
Pages265-276
Nombre de pages12
ISBN (imprimé)9783540410669
Les DOIs
étatPublié - 1 janv. 2000
Modification externeOui
Evénement4th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2000 - Lyon, France
Durée: 13 sept. 200016 sept. 2000

Série de publications

NomLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1910
ISSN (imprimé)0302-9743
ISSN (Electronique)1611-3349

Une conférence

Une conférence4th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2000
Pays/TerritoireFrance
La villeLyon
période13/09/0016/09/00

Empreinte digitale

Examiner les sujets de recherche de « Quality scheme assessment in the clustering process ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation