Résumé
Automatically segmenting text corpora into thematically related groups is a complex exploratory analysis problem. In this article, we outline our multi-stage exploratory analysis process and investigate the performance of a simple statistical model. After a description of this model and of its fitting procedure, we illustrate its performance on the segmentation of a corpus of CKM-related texts in English.
| langue originale | Anglais |
|---|---|
| Pages (de - à) | 13-22 |
| Nombre de pages | 10 |
| journal | Management Information Systems |
| Volume | 10 |
| état | Publié - 1 déc. 2004 |
| Evénement | Fifth International Conference on Data Mining, DATA MINING V - Malaga, Espagne Durée: 15 sept. 2004 → 17 sept. 2004 |
Empreinte digitale
Examiner les sujets de recherche de « A simple mixture model for unsupervised text categorisation ». Ensemble, ils forment une empreinte digitale unique.Contient cette citation
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver