Passer à la navigation principale Passer à la recherche Passer au contenu principal

Hard and fuzzy diagonal co-clustering for document-term partitioning

  • Université de Paris

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

We propose a hard and a fuzzy diagonal co-clustering algorithms built upon the double K-means to address the problem of document-term co-clustering. At each iteration, the proposed algorithms seek a diagonal block structure of the data by minimizing a criterion based on both the variance within the class and the centroid effect. In addition to be easy-to-interpret and effective on sparse binary and continuous data, the proposed algorithms, Hard Diagonal Double K-means (DDKM) and Fuzzy Diagonal Double K-means (F-DDKM), are also faster than other state-of-the-art clustering algorithms. We evaluate our contribution using synthetic data sets, and real data sets commonly used in document clustering.

langue originaleAnglais
Pages (de - à)133-147
Nombre de pages15
journalNeurocomputing
Volume193
Les DOIs
étatPublié - 12 juin 2016
Modification externeOui

Empreinte digitale

Examiner les sujets de recherche de « Hard and fuzzy diagonal co-clustering for document-term partitioning ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation