skip to main content
10.1145/2684822.2685323acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

On Integrating Network and Community Discovery

Published: 02 February 2015 Publication History

Abstract

The problem of community detection has recently been studied widely in the context of the web and social media networks. Most algorithms for community detection assume that the entire network is available for online analysis. In practice, this is not really true, because only restricted portions of the network may be available at any given time for analysis. Many social networks such as Facebook have privacy constraints, which do not allow the discovery of the entire structure of the social network. Even in the case of more open networks such as Twitter, it may often be challenging to crawl the entire network from a practical perspective. For many other scenarios such as adversarial networks, the discovery of the entire network may itself be a costly task, and only a small portion of the network may be discovered at any given time. Therefore, it can be useful to investigate whether network mining algorithms can integrate the network discovery process tightly into the mining process, so that the best results are achieved for particular constraints on discovery costs. In this context, we will discuss algorithms for integrating community detection with network discovery. We will tightly integrate with the cost of actually discovering a network with the community detection process, so that the two processes can support each other and are performed in a mutually cohesive way. We present experimental results illustrating the advantages of the approach.

References

[1]
L. Backstrom, D. P. Huttenlocher, J. M. Kleinberg, and X. Lan, Groupformation in large social networks: membership, growth, and evolution, ACM KDD Conference, pp. 44--54, 2006.
[2]
B. Ball, B. Karrer and M. E. J. Newman, An efficient and principledmethod for detecting communities in networks, Physical Review, E84(3), 036103, 2011.
[3]
. Baykan, M. Henzinger, S. Keller, S. De Castelberg, and M.Kinzler, A comparison of techniques for sampling web pages, CoRR, abs/0902.1604, 2009. http://arxiv.org/abs/0902.1604
[4]
U. Brandes, M. Gaertler and D. Wagner, Experiments on graphclustering algorithms, European Symposium on Algorithms, pp.568--579, 2003.
[5]
C. Aggarwal and H. Wang, Managing and Mining Graph Data,Springer, 2010.
[6]
C. Aggarwal, Social Network Data Analytics, Springer, 2011.
[7]
D. Chakrabarti, R. Kumar, and A. Tomkins, Evolutionary clustering, ACM KDD Conference, pp. 554--560, 2006.
[8]
Y. Chi, X. Song, D. Zhou, K. Hino, and B. L. Tseng, Evolutionaryspectral clustering by incorporating temporal smoothness, ACM KDD Conference, pp. 153--162, 2007.
[9]
A. Clauset, M. E. J. Newman, and C. Moore, Finding communitystructure in very large networks, Physical Review, E 70(6),066111, 2004.
[10]
S. Fortunato, Community detection in graphs, PhysicsReports, 486(3), pp. 75--174, 2010.
[11]
D. Gibson, R. Kumar, and A. Tomkins, Discovering large densesubgraphs in massive graphs, VLDB Conference, pp. 721--732,2005.
[12]
. Gjoka, M. Kurant, C. T. Butts, and A. Markopoulou, Walking inFacebook: A Case Study of Unbiased Sampling of OSNs, IEEEINFOCOM Conference, pp. 1--9, 2010.
[13]
. Gjoka, C. T. Butts, M. Kurant, and A. Markopoulou, MultigraphSampling of Online Social Networks, IEEE Journal onMeasurement of Internet Topologies, 29(9), pp. 1893--1905, October2011.
[14]
. R. Henzinger, A. Heydon, M. Mitzenmacher, and M. Najork, Onnear-uniform URL sampling, WWW Conference, pp. 295--308,2000.
[15]
R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins, Trawlingthe web for emerging cyber-communities, WWW Conference, pp.1481--1493, 1999.
[16]
. Leskovec and C. Faloutsos, Sampling from largegraphs, ACM KDD Conference, pp. 631--636, 2006.
[17]
. Lin, X. Kong, P. Yu, Q. Wu, Y. Jia, andC. Li, Community detection in incomplete information networks, WWW Conference, pp. 341--350, 2012.
[18]
Y.-R. Lin, J. Sun, P. Castro, R. B. Konuru, H. Sundaram, andA. Kelliher, Extracting community structure through relationalhypergraphs, WWW Conference, pp. 1213--1214, 2009.
[19]
A. Y. Ng, M. Jordan, and Y. Weiss, On spectral clustering: Analysisand an algorithm, NIPS Conference, pp. 849--856, 2001.
[20]
. Qi, C. Aggarwal, and T. Huang, Communitydetection with edge content in social media networks, IEEE ICDEConference, pp. 534--545, 2012.
[21]
. Ribeiro and D. Towsley, Estimating and sampling graphs withmultidimensional random walks, ACM SIGCOMM Conference, pp.390--403, 2010.
[22]
S. Schaeffer, Graph Clustering, Computer Science Review,1(1), pp. 27--64, 2007.
[23]
J. Shi and J. Malik, Normalized Cuts and Image Segmentation, IEEE Transactions on Pattern Analysis and Intelligence, 22(8),pp. 888--905, 2000.
[24]
T. Yang, R. Jin, Y. Chi, and S. Zhu, Combining link and content for community detection: a discriminative approach, ACM KDD Conference, pp. 927--936, 2009.
[25]
. Ye, J. Lang, and F. Wu, Crawling online social graphs, Asia-Pacific Web Conference, pp. 236--242, 2010.
[26]
Z. Zeng, J. Wang, L. Zhou, and G. Karypis, Out-of-core coherentclosed quasi-clique mining from large dense graph databases, ACM TODS Journal, 31(2), 2007.
[27]
Y. Zhou, H. Cheng, and J. X. Yu, Graph clustering based onstructural/attribute similarities, VLDB Conference, pp. 718--729, 2009.
[28]
S. Zhang and H. Zhao, Community identification in networks withunbalanced structure, Physical Review, E 85(6), 066114, 2012.
[29]
S. Fortunato and M. Barthelemy, Resolution limit in community detection, Proceedings of the National Academy of Sciences, 104(1), pp. 36--41, 2007.

Cited By

View all

Index Terms

  1. On Integrating Network and Community Discovery

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WSDM '15: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining
    February 2015
    482 pages
    ISBN:9781450333177
    DOI:10.1145/2684822
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 February 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. community detection
    2. network discovery

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    WSDM 2015

    Acceptance Rates

    WSDM '15 Paper Acceptance Rate 39 of 238 submissions, 16%;
    Overall Acceptance Rate 498 of 2,863 submissions, 17%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 17 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media