skip to main content
10.1145/2736277.2741119acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Density-friendly Graph Decomposition

Published: 18 May 2015 Publication History

Abstract

Decomposing a graph into a hierarchical structure via k-core analysis is a standard operation in any modern graph-mining toolkit. k-core decomposition is a simple and efficient method that allows to analyze a graph beyond its mere degree distribution. More specifically, it is used to identify areas in the graph of increasing centrality and connectedness, and it allows to reveal the structural organization of the graph.
Despite the fact that k-core analysis relies on vertex degrees, k-cores do not satisfy a certain, rather natural, density property. Simply put, the most central k-core is not necessarily the densest subgraph. This inconsistency between k-cores and graph density provides the basis of our study.
We start by defining what it means for a subgraph to be locally-dense, and we show that our definition entails a nested chain decomposition of the graph, similar to the one given by k-cores, but in this case the components are arranged in order of increasing density. We show that such a locally-dense decomposition for a graph G = (V, E) can be computed in polynomial time. The running time of the exact decomposition algorithm is O(|V|^2|E|) but is significantly faster in practice. In addition, we develop a linear-time algorithm that provides a factor-2 approximation to the optimal locally-dense decomposition. Furthermore, we show that the k-core decomposition is also a factor-2 approximation, however, as demonstrated by our experimental evaluation, in practice k-cores have different structure than locally-dense subgraphs, and as predicted by the theory, k-cores are not always well-aligned with graph density.

References

[1]
J. Abello, M. Resende, and S. Sudarsky. Massive quasi-clique detection. In LATIN 2002: Theoretical Informatics, pages 598--612, 2002.
[2]
A. Agresti. Analysis of Ordinal Categorical Data. John Wiley & Sons, 2nd edition, 2010.
[3]
J. I. Alvarez-Hamelin, L. Dall'Asta, A. Barrat, and A. Vespignani. k-core decomposition: a tool for the visualization of large scale networks. CoRR, abs/cs/0504107, 2005.
[4]
Y. Asahiro, K. Iwama, H. Tamaki, and T. Tokuyama. Greedily finding a dense subgraph. SWAT, pages 136--148, 1996.
[5]
M. Ayer, H. Brunk, G. Ewing, and W. Reid. An empirical distribution function for sampling with incomplete information. The Annals of Mathematical Statistics, 26(4):641--647, 1955.
[6]
G. Bader and C. Hogue. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics, 4(1), 2003.
[7]
B. Balasundaram, S. Butenko, and I. V. Hicks. Clique relaxations in social network analysis: The maximum k-plex problem. Operations Research, 59(1):133--142, 2011.
[8]
B. Bollobás. The evolution of random graphs. Transactions of the American Mathematical Society, 286(1):257--274, 1984.
[9]
F. Bonchi, F. Gullo, A. Kaltenbrunner, and Y. Volkovich. Core decomposition of uncertain graphs. In Proceedings of the International Conference on Knowledge Discovery and Data Mining (KDD), pages 1316--1325, 2014.
[10]
C. Bron and J. Kerbosch. Algorithm 457: Finding all cliques of an undirected graph. Communications of the ACM, 16(9):575--577, 1973.
[11]
T. Calders, N. Dexters, J. J. M. Gillis, and B. Goethals. Mining frequent itemsets in a stream. Information Systems, 39:233--255, 2014.
[12]
S. Carmi, S. Havlin, S. Kirkpatrick, Y. Shavitt, and E. Shir. A model of internet topology using k-shell decomposition. Proceedings of the National Academy of Sciences, 104(27):11150--11154, 2007.
[13]
M. Charikar. Greedy approximation algorithms for finding dense components in a graph. APPROX, 2000.
[14]
W. Dinkelbach. On nonlinear fractional programming. Management Science, 13(7):492--498, 1967.
[15]
A. V. Goldberg. Finding a maximum density subgraph. University of California Berkeley Technical report, 1984.
[16]
P. Hagmann, L. Cammoun, X. Gigandet, R. Meuli, C. J. Honey, V. J. Wedeen, and O. Sporns. Mapping the structural core of human cerebral cortex. PLoS, Biology, 6(7):888--893, 2008.
[17]
J. Hástad. Clique is hard to approximate within n1 In Proceedings of the Annual Symposium on Foundations of Computer Science (FOCS), pages 627--636, 1996.
[18]
S. Khuller and B. Saha. On finding dense subgraphs. In Automata, Languages and Programming, volume 5555, pages 597--608, 2009.
[19]
M. Kitsak, L. K. Gallos, S. Havlin, F. Liljeros, L. Muchnik, H. E. Stanley, and H. A. Makse. Identification of in uential spreaders in complex networks. Nature physics, 6(11):888--893, 2010.
[20]
D. Matula and L. Beck. Smallest-last ordering and clustering and graph coloring algorithms. Journal of the ACM, 30(3):417--427, 1983.
[21]
R. J. Mokken. Cliques, clubs and clans. Quality and Quantity, 13(2):161--173, 1979.
[22]
J. Orlin. Max flows in O(nm) time, or better. In Proceedings of the Annual ACM Symposium on Theory of Computing (STOC), pages 765--774, 2013.
[23]
S. Seidman. Network structure and minimum degree. Social Networks, 5(3):269--287, 1983.
[24]
S. B. Seidman and B. L. Foster. A graph-theoretic generalization of the clique concept. Journal of Mathematical sociology, 6(1):139--154, 2010.
[25]
N. Tatti and A. Gionis. Discovering nested communities. In Machine Learning and Knowledge Discovery in Databases|European Conference, ECML PKDD 2013, pages 32--47, 2013.
[26]
C. E. Tsourakakis. A novel approach to finding near-cliques: The triangle-densest subgraph problem. CoRR, 2014.
[27]
C. E. Tsourakakis, F. Bonchi, A. Gionis, F. Gullo, and M. A. Tsiarli. Denser than the densest subgraph: extracting optimal quasi-cliques with quality guarantees. In The 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2013, pages 104--112, 2013.
[28]
J. Ugander, L. Backstrom, C. Marlow, and J. Kleinberg. Structural diversity in social contagion. Proceedings of the National Academy of Sciences, 109(16):5962--5966, 2012.
[29]
T. Uno. An efficient algorithm for solving pseudo clique enumeration problem. Algorithmica, 56(1):3--16, 2010.

Cited By

View all

Index Terms

  1. Density-friendly Graph Decomposition

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '15: Proceedings of the 24th International Conference on World Wide Web
    May 2015
    1460 pages
    ISBN:9781450334693

    Sponsors

    • IW3C2: International World Wide Web Conference Committee

    In-Cooperation

    Publisher

    International World Wide Web Conferences Steering Committee

    Republic and Canton of Geneva, Switzerland

    Publication History

    Published: 18 May 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. community detection
    2. dense subgraphs
    3. k-core analysis

    Qualifiers

    • Research-article

    Conference

    WWW '15
    Sponsor:
    • IW3C2

    Acceptance Rates

    WWW '15 Paper Acceptance Rate 131 of 929 submissions, 14%;
    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)38
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 27 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media