TY - GEN
T1 - Finding subgraphs with maximum total density and limited overlap
AU - Balalau, Oana Denisa
AU - Bonchi, Francesco
AU - Chany, T. H.Hubert
AU - Gullo, Francesco
AU - Sozioz, Mauro
N1 - Publisher Copyright:
Copyright © 2015 ACM.
PY - 2015/2/2
Y1 - 2015/2/2
N2 - Finding dense subgraphs in large graphs is a key primitive in a variety of real-world application domains, encompass-ing social network analytics, event detection, biology, and finance. In most such applications, one typically aims at finding several (possibly overlapping) dense subgraphs which might correspond to communities in social networks or in-teresting events. While a large amount of work is devoted to finding a single densest subgraph, perhaps surprisingly, the problem of finding several dense subgraphs with limited overlap has not been studied in a principled way, to the best of our knowledge. In this work we define and study a natural generalization of the densest subgraph problem, where the main goal is to find at most k subgraphs with maximum to-tal aggregate density, while satisfying an upper bound on the pairwise Jaccard coefficient between the sets of nodes of the subgraphs. After showing that such a problem is NP-Hard, we devise an efficient algorithm that comes with provable guarantees in some cases of interest, as well as, an efficient practical heuristic. Our extensive evaluation on large real-world graphs confirms the efficiency and effectiveness of our algorithms.
AB - Finding dense subgraphs in large graphs is a key primitive in a variety of real-world application domains, encompass-ing social network analytics, event detection, biology, and finance. In most such applications, one typically aims at finding several (possibly overlapping) dense subgraphs which might correspond to communities in social networks or in-teresting events. While a large amount of work is devoted to finding a single densest subgraph, perhaps surprisingly, the problem of finding several dense subgraphs with limited overlap has not been studied in a principled way, to the best of our knowledge. In this work we define and study a natural generalization of the densest subgraph problem, where the main goal is to find at most k subgraphs with maximum to-tal aggregate density, while satisfying an upper bound on the pairwise Jaccard coefficient between the sets of nodes of the subgraphs. After showing that such a problem is NP-Hard, we devise an efficient algorithm that comes with provable guarantees in some cases of interest, as well as, an efficient practical heuristic. Our extensive evaluation on large real-world graphs confirms the efficiency and effectiveness of our algorithms.
U2 - 10.1145/2684822.2685298
DO - 10.1145/2684822.2685298
M3 - Conference contribution
AN - SCOPUS:84928713623
T3 - WSDM 2015 - Proceedings of the 8th ACM International Conference on Web Search and Data Mining
SP - 379
EP - 388
BT - WSDM 2015 - Proceedings of the 8th ACM International Conference on Web Search and Data Mining
PB - Association for Computing Machinery
T2 - 8th ACM International Conference on Web Search and Data Mining, WSDM 2015
Y2 - 31 January 2015 through 6 February 2015
ER -