skip to main content
10.1145/2755573.2755591acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
research-article

Space and Time Efficient Parallel Graph Decomposition, Clustering, and Diameter Approximation

Published: 13 June 2015 Publication History

Abstract

We develop a novel parallel decomposition strategy for unweighted, undirected graphs, based on growing disjoint connected clusters from batches of centers progressively selected from yet uncovered nodes. With respect to similar previous decompositions, our strategy exercises a tighter control on both the number of clusters and their maximum radius. We present two important applications of our parallel graph decomposition: (1) $k$-center clustering approximation; and (2) diameter approximation. In both cases, we obtain algorithms which feature a polylogarithmic approximation factor and are amenable to a distributed implementation that is geared for massive (long-diameter) graphs. The total space needed for the computation is linear in the problem size, and the parallel depth is substantially sublinear in the diameter for graphs with low doubling dimension. To the best of our knowledge, ours are the first parallel approximations for these problems which achieve sub-diameter parallel time, for a relevant class of graphs, using only linear space. Besides the theoretical guarantees, our algorithms allow for a very simple implementation on clustered architectures: we report on extensive experiments which demonstrate their effectiveness and efficiency on large graphs as compared to alternative known approaches.

References

[1]
Project PEGASUS. www.cs.cmu.edu/~pegasus.
[2]
I. Abraham, S. Chechik, C. Gavoille, and D. Peleg. Forbidden-set distance labels for graphs of bounded doubling dimension. In ACM PODC, pages 192--200, 2010.
[3]
D. Ajwani, U. Meyer, and D. Veith. I/O-efficient hierarchical diameter approximation. In ESA, pages 72--83, 2012.
[4]
S. Baswana and S. Sen. A simple and linear time randomized algorithm for computing sparse spanners in weighted graphs. Random Structures & Algorithms, 30(4):532--563, 2007.
[5]
G. E. Blelloch, A. Gupta, I. Koutis, G. L. Miller, R. Peng, and K. Tangwongsan. Near linear-work parallel sdd solvers, low-diameter decomposition, and low-stretch subgraphs. In SPAA, pages 13--22, 2011.
[6]
P. Boldi, M. Rosa, and S. Vigna. HyperANF: approximating the neighbourhood function of very large graphs on a budget. In WWW, pages 625--634, 2011.
[7]
S. Chechik, D. Larkin, L. Roditty, G. Schoenebeck, R. E. Tarjan, and V. V. Williams. Better approximation algorithms for the graph diameter. In SODA, pages 1041--1052, 2014.
[8]
E. Cohen. Fast algorithms for constructing $t$-spanners and paths with stretch $t$. SIAM J. Comput., 28(1):210--236, 1998.
[9]
E. Cohen. Polylog-time and near-linear work approximation scheme for undirected shortest paths. J. ACM, 47(1):132--166, 2000.
[10]
P. Crescenzi, R. Grossi, M. Habib, L. Lanzi, and A. Marino. On computing the diameter of real-world undirected graphs. Theor. Comput. Sci., 514:84--95, 2013.
[11]
J. Dean and S. Ghemawat. Mapreduce: simplified data processing on large clusters. Communications of the ACM, 51(1):107--113, 2008.
[12]
A. Ene, S. Im, and B. Moseley. Fast clustering using mapreduce. In KDD, pages 681--689, 2011.
[13]
T. F. Gonzalez. Clustering to minimize the maximum intercluster distance. Theor. Comput. Sci., 38:293--306, 1985.
[14]
M. Goodrich, N. Sitchinava, and Q. Zhang. Sorting, searching, and simulation in the MapReduce framework. In ISAAC, pages 374--383, 2011.
[15]
D. S. Hochbaum and D. B. Shmoys. A best possible parallel approximation algorithm to a graph theoretic problem. Operational Research, pages 933--938, 1987.
[16]
U. Kang, C. E. Tsourakakis, A. P. Appel, C. Faloutsos, and J. Leskovec. Hadi: Mining radii of large graphs. TKDD, 5(2), 2011.
[17]
D. R. Karger and M. Ruhl. Finding nearest-neighbors in growth-restricted metrics. In STOC, pages 741--750, 2002.
[18]
H. Karloff, S. Suri, and S. Vassilvitskii. A model of computation for mapreduce. In SODA, pages 938--948, 2010.
[19]
P. N. Klein and S. Sairam. A parallel randomized approximation scheme for shortest paths. In STOC, pages 750--758, 1992.
[20]
Laboratory for Web Algorithmics, University of Milano. http://law.di.unimi.it
[21]
U. Meyer. On trade-offs in external-memory diameter-approximation. In SWAT, pages 426--436, 2008.
[22]
G. L. Miller, R. Peng, and S. C. Xu. Parallel graph decompositions using random shifts. In SPAA, pages 196--203, 2013.
[23]
C. R. Palmer, P. B. Gibbons, and C. Faloutsos. Anf: a fast and scalable tool for data mining in massive graphs. In KDD, pages 81--90, 2002.
[24]
A. Pietracaprina, G. Pucci, M. Riondato, F. Silvestri, and E. Upfal. Space-round tradeoffs for mapreduce computations. In ICS, pages 235--244, 2012.
[25]
Stanford Large Network Dataset Collection http://snap.stanford.edu/data
[26]
Spark: Lightning-fast cluster computing. http://spark.apache.org
[27]
J. D. Ullman and M. Yannakakis. High-probability parallel transitive-closure algorithms. SIAM J. Comput., 20(1):100--125, 1991.
[28]
V. V. Vazirani. Approximation algorithms. Springer, 2001.
[29]
M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauly, M. J. Franklin, S. Shenker, and I. Stoica. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In NSDI, pages 15--28, 2012.

Cited By

View all

Index Terms

  1. Space and Time Efficient Parallel Graph Decomposition, Clustering, and Diameter Approximation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SPAA '15: Proceedings of the 27th ACM symposium on Parallelism in Algorithms and Architectures
    June 2015
    362 pages
    ISBN:9781450335881
    DOI:10.1145/2755573
    • General Chair:
    • Guy Blelloch,
    • Program Chair:
    • Kunal Agrawal
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 June 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. diameter approximation
    2. graph decomposition
    3. k-center problem
    4. mapreduce
    5. parallel graph algorithms

    Qualifiers

    • Research-article

    Funding Sources

    • MIUR of Italy
    • NSF
    • University of Padova

    Conference

    SPAA '15

    Acceptance Rates

    SPAA '15 Paper Acceptance Rate 31 of 131 submissions, 24%;
    Overall Acceptance Rate 447 of 1,461 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media