TY - GEN
T1 - Unexpected Attributed Subgraphs
T2 - 15th IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2023
AU - Delarue, Simon
AU - Viard, Tiphaine
AU - Dessalles, Jean Louis
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/11/6
Y1 - 2023/11/6
N2 - Graphs are ubiquitous in real-world data, ranging from the study of social interactions to bioinformatics or the modelling of physical systems. These real-world graphs are typically sparse, possibly large and frequently contain additional information in the form of attributes, making them a complex object to understand. Graph summarization techniques can help facilitate the discovery of hidden patterns in underlying data by providing an interesting subset of the interactions and available attributes, which we broadly call a pattern. However, determining what is considered interesting in this context is not straightforward. We address this challenge by designing an interestingness measure based on the information-theoretic measure of Unexpectedness, linking the concepts of relevance and Kolmogorov complexity. We design a pattern mining algorithm to provide a summary of the initial data in the form of a set of unexpected patterns, that is, patterns for which there is a drop between their expected complexity and the observed complexity. Experimental results on five real-world datasets with state-of-the-art methods demonstrate that our method exhibits a small number of diversified patterns, providing a human-readable summary of the initial attributed graph; we show that our summaries quantitatively outperforms attribute-only and interaction-only baselines as well as other pattern mining methods, reinforcing the need for methods dealing with attributed graphs. We visualize summaries extracted with our method, in order to qualitatively validate their readability.
AB - Graphs are ubiquitous in real-world data, ranging from the study of social interactions to bioinformatics or the modelling of physical systems. These real-world graphs are typically sparse, possibly large and frequently contain additional information in the form of attributes, making them a complex object to understand. Graph summarization techniques can help facilitate the discovery of hidden patterns in underlying data by providing an interesting subset of the interactions and available attributes, which we broadly call a pattern. However, determining what is considered interesting in this context is not straightforward. We address this challenge by designing an interestingness measure based on the information-theoretic measure of Unexpectedness, linking the concepts of relevance and Kolmogorov complexity. We design a pattern mining algorithm to provide a summary of the initial data in the form of a set of unexpected patterns, that is, patterns for which there is a drop between their expected complexity and the observed complexity. Experimental results on five real-world datasets with state-of-the-art methods demonstrate that our method exhibits a small number of diversified patterns, providing a human-readable summary of the initial attributed graph; we show that our summaries quantitatively outperforms attribute-only and interaction-only baselines as well as other pattern mining methods, reinforcing the need for methods dealing with attributed graphs. We visualize summaries extracted with our method, in order to qualitatively validate their readability.
KW - complex networks
KW - information theory
KW - pattern mining
U2 - 10.1145/3625007.3627314
DO - 10.1145/3625007.3627314
M3 - Conference contribution
AN - SCOPUS:85190626828
T3 - Proceedings of the 2023 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2023
SP - 171
EP - 178
BT - Proceedings of the 2023 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2023
A2 - Aditya Prakash, B.
A2 - Wang, Dong
A2 - Weninger, Tim
PB - Association for Computing Machinery, Inc
Y2 - 6 November 2023 through 9 November 2023
ER -