skip to main content
10.1145/2512938.2512953acmconferencesArticle/Chapter ViewAbstractPublication PagescosnConference Proceedingsconference-collections
research-article

Hierarchical community decomposition via oblivious routing techniques

Published: 07 October 2013 Publication History

Abstract

The detection of communities in real-world large-scale complex networks is a fundamental step in many applications, such as describing community structure and predicting the dissemination of information. Unfortunately, community detection is a computationally expensive task. Indeed, many approaches with strong theoretic guarantees are infeasible when applied to networks of large scale. Numerous approaches have been designed to scale community detection algorithms, many of which leverage local optimizations or local greedy decisions to iteratively find the communities. Solely relying on local techniques to detect communities, rather than a global objective function, can fail to detect global structure of the network.
In this work, we instead formulate a notion of a hierarchical community decomposition (HCD), which takes a more global view of hierarchical community structure. Our main contributions are as follows. We formally define a (λ, delta)-HCD where λ parametrizes the connectivity within each sub-community at the same hierarchical level and δ parametrizes the relationship between communities across two consecutive levels. Based on a method of Racke originally designed for oblivious routing, we provide an algorithm to construct a HCD and prove that an (O(log n);O(1))-HCD can always be found for any n-node input graph. Further, our algorithm does not rely on a pre-specified number of communities or depth of decomposition. Since the algorithm is of exponential complexity, we also describe a practical efficient, yet heuristic, implementation and perform an experimental validation on synthetic and real-world networks. We experiment first with synthetic networks with well-defined "intended" decompositions, on which we verify the quality of the decompositions produced by our method. Armed with the confidence these positive results provide, we use our implementation to compute the hierarchical community structure of more complex, real-world networks.

References

[1]
Data archive at http://http://snap.stanford.edu/.
[2]
Reid Andersen, Fan Chung, and Kevin Lang. Local graph partitioning using pagerank vectors. In Foundations of Computer Science, 2006. FOCS'06. 47th Annual IEEE Symposium on, pages 475--486. IEEE, 2006.
[3]
Reid Andersen and Kevin J Lang. Communities from seed sets. In Proceedings of the 15th international conference on World Wide Web, pages 223--232. ACM, 2006.
[4]
Sanjeev Arora, Rong Ge, Sushant Sachdeva, and Grant Schoenebeck. Finding overlapping communities in social networks: toward a rigorous approach. In Proceedings of the 13th ACM Conference on Electronic Commerce, pages 37--54. ACM, 2012.
[5]
Sanjeev Arora, Satish Rao, and Umesh Vazirani. Expander flows, geometric embeddings and graph partitioning. In Proceedings of the thirty-sixth annual ACM symposium on Theory of computing, pages 222--231. ACM, 2004.
[6]
Maria Florina Balcan and Yingyu Liang. Modeling and detecting community hierarchies. International Workshop on Similarity Based Pattern Analysis and Recognition., 9(10):29--30, 2013.
[7]
M.F. Balcan, C. Borgs, M. Braverman, J. Chayes, and S.H. Teng. Finding endogenously formed communities. arXiv preprint arXiv:1201.4899, 2012.
[8]
M. Bienkowski, M. Korzeniowski, and H. Racke. A practical algorithm for constructing oblivious routing schemes. In In Proc. of the 15th SPAA, pages 24--33, 2003.
[9]
V.D Blondel, J.-L. Guillaume, R. Lambiotte, and E. Lefebvre. Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 10, 2008.
[10]
Ulrik Brandes and Thomas Erlebach. Network analysis: methodological foundations, volume 3418. Springer, 2005.
[11]
J. Cheeger. A lower bound for the smallest eigenvalue of the Laplacian. Probl. Analysis, Sympos. in Honor of Salomon Bochner, Princeton Univ. 1969, 195--199, 1970.
[12]
Fan RK Chung. Spectral graph theory. CBMS Regional Conference Series in Math, No. 92, 1997.
[13]
Aaron Clauset, Cristopher Moore, and Mark EJ Newman. Hierarchical structure and the prediction of missing links in networks. Nature, 453(7191):98--101, 2008.
[14]
Aaron Clauset, Mark EJ Newman, and Cristopher Moore. Finding community structure in very large networks. Physical review E, 70(6):066111, 2004.
[15]
Anne Condon and Richard M Karp. Algorithms for graph partitioning on the planted partition model. Random Structures and Algorithms, 18(2):116--140, 2001.
[16]
M Cosentino Lagomarsino, P Jona, B Bassetti, and H Isambert. Hierarchy and feedback in the evolution of the escherichia coli transcription network. Proceedings of the National Academy of Sciences, 104(13):5516--5520, 2007.
[17]
Facebook. https://www.facebook.com/about/graphsearch.
[18]
Gary William Flake, Steve Lawrence, and C Lee Giles. Efficient identification of web communities. In Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 150--160. ACM, 2000.
[19]
Gary William Flake, Robert E Tarjan, and Kostas Tsioutsiouliklis. Graph clustering and minimum cut trees. Internet Mathematics, 1(4):385--408, 2004.
[20]
Santo Fortunato. Community detection in graphs. Physics Reports, 486(3):75--174, 2010.
[21]
Santo Fortunato and Marc Barthelemy. Resolution limit in community detection. Proceedings of the National Academy of Sciences, 104(1):36--41, 2007.
[22]
Michael R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman & Co., New York, NY, USA, 1979.
[23]
Michelle Girvan and Mark EJ Newman. Community structure in social and biological networks. Proceedings of the National Academy of Sciences, 99(12):7821--7826, 2002.
[24]
Pablo M Gleiser and Leon Danon. Community structure in jazz. Advances in complex systems, 6(04):565--573, 2003.
[25]
C. Harrelson, K. Hildrum, and S. Rao. A polynomial-time decomposition to minimize congestion. In In Proc. of the 15th SPAA, 2003.
[26]
Dorit S Hochbaum. Approximation algorithms for NP-hard problems. PWS Publishing Co., 1996.
[27]
Jianbin Huang, Heli Sun, Jiawei Han, Hongbo Deng, Yizhou Sun, and Yaguang Liu. Shrink: a structural clustering algorithm for detecting hierarchical communities in networks. In Proceedings of the 19th ACM international conference on Information and knowledge management, pages 219--228. ACM, 2010.
[28]
R. Kannan, S. Vempala, and A. Vetta. On clusterings: Good, bad and spectral. Journal of the ACM, 51(3):497--515, May 2004.
[29]
Leo Katz. A new status index derived from sociometric analysis. Psychometrika, 18(1):39--43, 1953.
[30]
Jon M Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46(5):604--632, 1999.
[31]
Andrea Lancichinetti and Santo Fortunato. Community detection algorithms: A comparative analysis. Physical review E, 80(5):056117, 2009.
[32]
Tom Leighton and Satish Rao. Multicommodity max-flow min-cut theorems and their use in designing approximation algorithms. J. ACM, 46(6):787--832, November 1999.
[33]
Jure Leskovec, Jon Kleinberg, and Christos Faloutsos. Graph evolution: Densification and shrinking diameters. ACM Transactions on Knowledge Discovery from Data (TKDD), 1(1):2, 2007.
[34]
Jure Leskovec, Kevin J Lang, and Michael Mahoney. Empirical comparison of algorithms for network community detection. In Proceedings of the 19th international conference on World wide web, pages 631--640. ACM, 2010.
[35]
David Lusseau, Karsten Schneider, Oliver J Boisseau, Patti Haase, Elisabeth Slooten, and Steve M Dawson. The bottlenose dolphin community of doubtful sound features a large proportion of long-lasting associations. Behavioral Ecology and Sociobiology, 54(4):396--405, 2003.
[36]
Julian McAuley and Jure Leskovec. Learning to discover social circles in ego networks. In Advances in Neural Information Processing Systems 25, pages 548--556, 2012.
[37]
Karl Menger. Zur allgemeinen kurventheorie. Fundamenta Mathematicae, 10(1):96--115, 1927.
[38]
Nina Mishra, Robert Schreiber, Isabelle Stanton, and Robert E Tarjan. Clustering social networks. In Algorithms and Models for the Web-Graph, pages 56--67. Springer, 2007.
[39]
H. Racke. Minimizing congestion in general networks. In In Proc. of the 43rd FOCS, pages 43--52, 2002.
[40]
Harald Racke. Minimizing congestion in general networks. In Foundations of Computer Science, 2002. Proceedings. The 43rd Annual IEEE Symposium on, pages 43--52. IEEE, 2002.
[41]
Erzsébet Ravasz, Anna Lisa Somera, Dale A Mongru, Zoltán N Oltvai, and A-L Barabási. Hierarchical organization of modularity in metabolic networks. Science, 297(5586):1551--1555, 2002.
[42]
Satu Elisa Schaeffer. Graph clustering. Computer Science Review, 1(1):27--64, 2007.
[43]
Michael Schweinberger and Tom AB Snijders. Settings in social networks: A measurement model. Sociological Methodology, 33(1):307--341, 2003.
[44]
Jiří Šíma and Satu Schaeffer. On the NP-completeness of some graph cluster measures. SOFSEM 2006: Theory and Practice of Computer Science, pages 530--537, 2006.
[45]
Daniel A Spielman and Shang-Hua Teng. Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In Proceedings of the thirty-sixth annual ACM symposium on Theory of computing, pages 81--90. ACM, 2004.
[46]
Christo Wilson, Alessandra Sala, Krishna P. N. Puttaswamy, and Ben Y. Zhao. Beyond social graphs: User interactions in online social networks and their implications. ACM Trans. Web, pages 17:1--17:31, 2012.

Cited By

View all

Index Terms

  1. Hierarchical community decomposition via oblivious routing techniques

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    COSN '13: Proceedings of the first ACM conference on Online social networks
    October 2013
    254 pages
    ISBN:9781450320849
    DOI:10.1145/2512938
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 October 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. approximation algorithms
    2. community detection
    3. hierarchical community decomposition
    4. räcke decompositions

    Qualifiers

    • Research-article

    Conference

    COSN'13
    Sponsor:
    COSN'13: Conference on Online Social Networks
    October 7 - 8, 2013
    Massachusetts, Boston, USA

    Acceptance Rates

    COSN '13 Paper Acceptance Rate 22 of 138 submissions, 16%;
    Overall Acceptance Rate 69 of 307 submissions, 22%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media