Résumé
We propose a hard and a fuzzy diagonal co-clustering algorithms built upon the double K-means to address the problem of document-term co-clustering. At each iteration, the proposed algorithms seek a diagonal block structure of the data by minimizing a criterion based on both the variance within the class and the centroid effect. In addition to be easy-to-interpret and effective on sparse binary and continuous data, the proposed algorithms, Hard Diagonal Double K-means (DDKM) and Fuzzy Diagonal Double K-means (F-DDKM), are also faster than other state-of-the-art clustering algorithms. We evaluate our contribution using synthetic data sets, and real data sets commonly used in document clustering.
| langue originale | Anglais |
|---|---|
| Pages (de - à) | 133-147 |
| Nombre de pages | 15 |
| journal | Neurocomputing |
| Volume | 193 |
| Les DOIs | |
| état | Publié - 12 juin 2016 |
| Modification externe | Oui |
Empreinte digitale
Examiner les sujets de recherche de « Hard and fuzzy diagonal co-clustering for document-term partitioning ». Ensemble, ils forment une empreinte digitale unique.Contient cette citation
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver