Abstract
The present paper develops a probabilistic model to cluster the nodes of a dynamic graph, accounting for the content of textual edges as well as their frequency. Vertices are clustered in groups which are homogeneous both in terms of interaction frequency and discussed topics. The dynamic graph is considered stationary on a latent time interval if the proportions of topics discussed between each pair of node groups do not change in time during that interval. A classification variational expectation–maximization algorithm is adopted to perform inference. A model selection criterion is also derived to select the number of node groups, time clusters and topics. Experiments on simulated data are carried out to assess the proposed methodology. We finally illustrate an application to the Enron dataset.
| Original language | English |
|---|---|
| Pages (from-to) | 677-695 |
| Number of pages | 19 |
| Journal | Statistics and Computing |
| Volume | 29 |
| Issue number | 4 |
| DOIs | |
| Publication status | Published - 15 Jul 2019 |
| Externally published | Yes |
Keywords
- Dynamic random graph
- Latent Dirichlet allocation
- Model based clustering
- Stochastic block model
- Topic modeling
Fingerprint
Dive into the research topics of 'The dynamic stochastic topic block model for dynamic networks with textual edges'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver