skip to main content
research-article

A distributed streaming framework for edge–cloud triangle counting in graph streams

Published: 25 October 2023 Publication History

Abstract

The triangle counting problem in graph streams has been extensively studied in social network analysis, recommendation systems, user portraits and other fields. However, cloud computing based streaming algorithms cause high bandwidth occupation and long transmission latency due to limited bandwidth of the cloud. Recently, edge computing is promising to overcome the issue of transmitting large-scale data for cloud computing. However, directly applying edge computing in streaming triangle counting will reduce the accuracy of the triangle count estimation, due to the limitation of local computing at the edge network. We term the cooperations between edge computing and cloud computing for streaming triangle counting as edge–cloud triangle counting in graph streams. In this paper, we first propose a streaming framework for edge–cloud triangle counting in graph streams. Then, we propose a streaming triangle counting algorithm called Trie-based Edge Compression (TbEC) by using the binary trie at the edge network that enables lossless compression and efficient transmission to the cloud. In addition, to extend our algorithms for triangle counting in multigraphs, we present a dual deduplication strategy collaboratively using the trie-based data structure and a Bloom Filter. Our experiments with real-world datasets show that TbEC is (a) Accurate: yielding up to 3.35 × more accurate smaller estimation error than the state-of-the-art distributed streaming algorithm, (b) Fast: yielding up to 10.59 × faster than the state-of-the-art distributed streaming algorithm, (c) Scalable: scaling linearly with the number of edges in the input graph stream.

Highlights

A framework of edge–cloud triangle counting to count triangles in graph streams.
Algorithms TbEC and TbECOPT to reduce the transmission cost between the edge and cloud.
Algorithms TbEC-BF and TbECOPT-BF to accurately count triangles in multigraphs.

References

[1]
Zhou Z., Liu S., Xu G., Zhang W., On completing sparse knowledge base with transitive relation embedding, in: The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI Press, 2019, pp. 3125–3132,.
[2]
Hasan M.A., Dave V.S., Triangle counting in large networks: a review, WIREs Data Min. Knowl. Discov. 8 (2) (2018),.
[3]
Sotiropoulos K., Tsourakakis C.E., Triangle-aware spectral sparsifiers and community detection, in: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, ACM, 2021, pp. 1501–1509,.
[4]
Tsourakakis C.E., Drineas P., Michelakis E., Koutis I., Faloutsos C., Spectral counting of triangles via element-wise sparsification and triangle-based link recommendation, Soc. Netw. Anal. Min. 1 (2) (2011) 75–81,.
[5]
Jiang W., Jiao Y., Wang Q., Liang C., Guo L., Zhang Y., Sun Z., Xiong Y., Zhu Y., Triangle graph interest network for click-through rate prediction, in: WSDM ’22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21 - 25, 2022, ACM, 2022, pp. 401–409,.
[6]
Zhang Y., Gorlatch S., A bilateral recommendation strategy for mobile event-based social networks, in: MobiQuitous ’20: Computing, Networking and Services, Virtual Event / Darmstadt, Germany, December 7-9, 2020, ACM, 2020, pp. 415–424,.
[7]
Lumezanu C., Baden R., Spring N., Bhattacharjee B., Triangle inequality variations in the internet, in: Proceedings of the 9th ACM SIGCOMM Internet Measurement Conference, ACM, 2009, pp. 177–183,.
[8]
Ramesh D., Arora N.S., Spark’s graphx-based link prediction for social communication using triangle counting, Soc. Netw. Anal. Min. 9 (1) (2019) 28:1–28:12,.
[9]
Sardiu M.E., Gilmore J.M., Groppe B.D., Dutta A., Florens L., Washburn M.P., Topological scoring of protein interaction networks, Nature Commun. 10 (1) (2019) 1–14.
[10]
Bader D.A., Li F., Ganeshan A., Gundogdu A., Lew J., Rodriguez O.A., Du Z., Communication-efficient triangle counting, 2023, arXiv:2210.00389.
[11]
Gou X., Zou L., Sliding window-based approximate triangle counting over streaming graphs with duplicate edges, in: International Conference on Management of Data, ACM, 2021, pp. 645–657,.
[12]
Hoffa C., Mehta G., Freeman T., Deelman E., Keahey K., Berriman G.B., Good J., On the use of cloud computing for scientific workflows, in: Fourth International Conference on E-Science, IEEE, 2008, pp. 640–645,.
[13]
Bar-Yossef Z., Kumar R., Sivakumar D., Reductions in streaming algorithms, with an application to counting triangles in graphs, in: Proceedings of the Thirteenth Annual ACM-SIAM Symposium on Discrete Algorithms, ACM/SIAM, 2002, pp. 623–632. URL http://dl.acm.org/citation.cfm?id=545381.545464.
[14]
Tsourakakis C.E., Kang U., Miller G.L., Faloutsos C., DOULION: counting triangles in massive graphs with a coin, in: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2009, pp. 837–846,.
[15]
Ahmed N.K., Duffield N.G., Neville J., Kompella R.R., Graph sample and hold: a framework for big-graph analytics, in: The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2014, pp. 1446–1455,.
[16]
Lim Y., Kang U., MASCOT: memory-efficient and accurate sampling for counting local triangles in graph streams, in: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2015, pp. 685–694,.
[17]
Lim Y., Jung M., Kang U., Memory-efficient and accurate sampling for counting local triangles in graph streams: From simple to multigraphs, ACM Trans. Knowl. Discov. Data 12 (1) (2018) 4:1–4:28,.
[18]
Stefani L.D., Epasto A., Riondato M., Upfal E., Trièst: Counting local and global triangles in fully-dynamic streams with fixed memory size, in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2016, pp. 825–834,.
[19]
Shin K., Oh S., Kim J., Hooi B., Faloutsos C., Fast, accurate and provable triangle counting in fully dynamic graph streams, ACM Trans. Knowl. Discov. Data 14 (2) (2020) 12:1–12:39,.
[20]
Shin K., Hammoud M., Lee E., Oh J., Faloutsos C., Tri-fly: Distributed estimation of global and local triangle counts in graph streams, in: 22nd Pacific-Asia Conference, PAKDD, vol. 10939, Springer, 2018, pp. 651–663,.
[21]
Yu M., Song C., Gu J., Liu M., Distributed triangle counting algorithms in simple graph stream, in: 25th IEEE International Conference on Parallel and Distributed Systems, ICPADS, IEEE, 2019, pp. 294–301,.
[22]
Shin K., Lee E., Oh J., Hammoud M., Faloutsos C., Cocos: Fast and accurate distributed triangle counting in graph streams, ACM Trans. Knowl. Discov. Data 15 (3) (2021) 38:1–38:30,.
[23]
Yang X., Song C., Yu M., Gu J., Liu M., Distributed triangle approximately counting algorithms in simple graph stream, ACM Trans. Knowl. Discov. Data 16 (4) (2022) 79:1–79:43,.
[24]
Gaba G.S., Kumar G., Kim T., Monga H., Kumar P., Secure device-to-device communications for 5G enabled internet of things applications, Comput. Commun. 169 (2021) 114–128,.
[25]
Wen Z., Quoc D.L., Bhatotia P., Chen R., Lee M., ApproxIoT: Approximate analytics for edge computing, in: 38th IEEE International Conference on Distributed Computing Systems, ICDCS, IEEE, 2018, pp. 411–421,.
[26]
Fu X., Ghaffar T., Davis J.C., Lee D., EdgeWise: A better stream processing engine for the edge, in: USENIX Annual Technical Conference, USENIX, USENIX, 2019, pp. 929–946. URL https://www.usenix.org/conference/atc19/presentation/fu.
[27]
Tiwari A., Ramprasad B., Mortazavi S.H., Gabel M., de Lara E., Reconfigurable streaming for the mobile edge, in: Proceedings of the 20th International Workshop on Mobile Computing Systems and Applications, HotMobile, ACM, 2019, pp. 153–158,.
[28]
Zhao H., Yao L., Zeng Z., Li D., Xie J., Zhu W., Tang J., An edge streaming data processing framework for autonomous driving, Connect. Sci. 33 (2) (2021) 173–200,.
[29]
Wang S., Zhao Y., Huang L., Xu J., Hsu C., Qos prediction for service recommendations in mobile edge computing, J. Parallel Distrib. Comput. 127 (2019) 134–144,.
[30]
Vitter J.S., Random sampling with a reservoir, ACM Trans. Math. Softw. 11 (1) (1985) 37–57,.
[31]
Soni U., Lu Y., Hansen B., Purchase H.C., Kobourov S.G., Maciejewski R., The perception of graph properties in graph layouts, Comput. Graph. Forum 37 (3) (2018) 169–181,.
[32]
Jowhari H., Ghodsi M., New streaming algorithms for counting triangles in graphs, in: Computing and Combinatorics, 11th Annual International Conference, COCOON, vol. 3595, Springer, 2005, pp. 710–716,.
[33]
Buriol L.S., Frahling G., Leonardi S., Marchetti-Spaccamela A., Sohler C., Counting triangles in data streams, in: Proceedings of the Twenty-Fifth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, ACM, 2006, pp. 253–262,.
[34]
Stefani L.D., Epasto A., Riondato M., Upfal E., Trièst: Counting local and global triangles in fully dynamic streams with fixed memory size, ACM Trans. Knowl. Discov. Data 11 (4) (2017) 43:1–43:50,.
[35]
Riondato M., García-Soriano D., Bonchi F., Graph summarization with quality guarantees, in: International Conference on Data Mining, ICDM, IEEE, 2014, pp. 947–952,.
[36]
Beg M.A., Ahmad M., Zaman A., Khan I., Scalable approximation algorithm for graph summarization, in: 22nd Pacific-Asia Conference, PAKDD, vol. 10939, Springer, 2018, pp. 502–514,.
[37]
Gou X., Zou L., Zhao C., Yang T., Fast and accurate graph stream summarization, in: 35th IEEE International Conference on Data Engineering, ICDE, IEEE, 2019, pp. 1118–1129,.
[38]
Zhang D., Ni C., Zhang J., Zhang T., Yang P., Wang J., Yan H., A novel edge computing architecture based on adaptive stratified sampling, Comput. Commun. 183 (2022) 121–135,.
[39]
Xu J., Palanisamy B., Wang Q., Ludwig H., Gopisetty S., Amnis: Optimized stream processing for edge computing, J. Parallel Distrib. Comput. 160 (2022) 49–64,.
[40]
Pace P., Aloi G., Gravina R., Caliciuri G., Fortino G., Liotta A., An edge-based architecture to support efficient applications for healthcare industry 4.0, TII 15 (1) (2019) 481–489.
[41]
Li Q., Xu M., Chen M., NSFIB construction & aggregation with next hop of strict partial order, in: Proceedings of the IEEE INFOCOM 2013, IEEE, 2013, pp. 550–554,.
[42]
Chatterjee T., Ruj S., Bit S.D., Efficient data storage and name look-up in named data networking using connected dominating set and patricia trie, Autom. Control. Comput. Sci. 55 (4) (2021) 319–333,.
[43]
Larsson N.J., Extended application of suffix trees to data compression, in: Proceedings of the 6th Data Compression Conference (DCC), IEEE, 1996, pp. 190–199,.
[44]
Jung M., Lim Y., Lee S., Kang U., FURL: fixed-memory and uncertainty reducing local triangle counting for multigraph streams, Data Min. Knowl. Discov. 33 (5) (2019) 1225–1253.
[45]
Kang S., Lee K., Shin K., Are edge weights in summary graphs useful? - a comparative study, in: 26th Pacific-Asia Conference, PAKDD, 13280, Springer, 2022, pp. 54–67,.

Index Terms

  1. A distributed streaming framework for edge–cloud triangle counting in graph streams
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image Knowledge-Based Systems
        Knowledge-Based Systems  Volume 278, Issue C
        Oct 2023
        1022 pages

        Publisher

        Elsevier Science Publishers B. V.

        Netherlands

        Publication History

        Published: 25 October 2023

        Author Tags

        1. Triangle counting
        2. Approximate algorithms
        3. Streaming graphs
        4. Distributed streaming algorithms

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 0
          Total Downloads
        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 27 Dec 2024

        Other Metrics

        Citations

        View Options

        View options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media