skip to main content
10.1145/3534540.3534690acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Anti-vertex for neighborhood constraints in subgraph queries

Published: 12 June 2022 Publication History

Abstract

This paper focuses on subgraph queries where constraints are present in the neighborhood of the explored subgraphs. We describe anti-vertex, a declarative construct that indicates absence of a vertex, i.e., the resulting subgraph should not have a vertex in its specified neighborhood that matches the anti-vertex. We formalize the semantics of anti-vertex to benefit from automatic reasoning and optimization, and to enable standardized implementation across query languages and runtimes. The semantics are defined for various matching semantics that are commonly employed in subgraph querying (isomorphism, homomorphism, and no-repeated-edge matching) and for the widely adopted property graph model. We illustrate several examples where anti-vertices can be employed to help familiarize with the anti-vertex concept. We further showcase how anti-vertex support can be added in existing graph query languages by developing prototype extensions of Cypher language. Finally, we study how anti-vertices interact with the symmetry breaking technique in subgraph matching frameworks so that their meaning remains consistent with the expected outcome of constrained neighborhoods to connected vertices.

References

[1]
Roberto Alonso and Stephan Günnemann. Mining contrasting quasi-clique patterns. CoRR, abs/1810.01836, 2018.
[2]
Amazon, Inc. Amazon Neptune, 2022. Version 1.1.0.0.
[3]
Khaled Ammar, Frank McSherry, Semih Salihoglu, and Manas Joglekar. Distributed Evaluation of Subgraph Queries Using Worst-Case Optimal Low-Memory Dataflows. Proceedings of the VLDB Endowment, 11(6):691--704, February 2018.
[4]
Renzo Angles, Marcelo Arenas, Pablo Barcelo, Peter Boncz, George Fletcher, Claudio Gutierrez, Tobias Lindaaker, Marcus Paradies, Stefan Plantikow, Juan Sequeda, Oskar van Rest, and Hannes Voigt. G-CORE: A core for future graph query languages. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '18, page 1421--1432, 2018.
[5]
Renzo Angles, Marcelo Arenas, Pablo Barceló, Aidan Hogan, Juan Reutter, and Domagoj Vrgoč. Foundations of Modern Query Languages for Graph Databases. ACM Computing Surveys, 50(5), September 2017.
[6]
Bibek Bhattarai, Hang Liu, and H. Howie Huang. CECI: Compact Embedding Cluster Index for Scalable Subgraph Matching. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '19, page 1447--1462, 2019.
[7]
Fei Bi, Lijun Chang, Xuemin Lin, Lu Qin, and Wenjie Zhang. Efficient Subgraph Matching by Postponing Cartesian Products. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '16, pages 1199--1214, 2016.
[8]
James Cheng, Yiping Ke, Ada Wai-Chee Fu, Jeffrey Xu Yu, and Linhong Zhu. Finding Maximal Cliques in Massive Networks. ACM Transactions on Database Systems, 36(4), December 2011.
[9]
E. F. Codd. A Relational Model of Data for Large Shared Data Banks. Communications of the ACM, 13(6):377--387,June 1970.
[10]
Richard Cyganiak, David Wood, and Markus Lanthaler. RDF 1.1 Concepts and Abstract Syntax. Technical report, W3 Consortium, 2014.
[11]
Alin Deutsch, Nadime Francis, Alastair Green, Keith Hare, Bei Li, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Wim Martens, Jan Michels, Filip Murlak, Stefan Plantikow, Petra Selmer, Hannes Voigt, Oskar van Rest, Domagoj Vrgoc, Mingxi Wu, and Fred Zemke. Graph Pattern Matching in GQL and SQL/PGQ. CoRR, abs/2112.06217, 2021.
[12]
Alin Deutsch, Yu Xu, Mingxi Wu, and Victor E. Lee. Aggregation Support for Modern Graph Analytics in TigerGraph. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '20, page 377--392, 2020.
[13]
William Eberle and Lawrence Holder. Discovering structural anomalies in graph-based data. In Seventh IEEE international conference on data mining workshops (ICDMW 2007), pages 393--398. IEEE, 2007.
[14]
Hartmut Ehrig, Annegret Habel, and Hans-Jörg Kreowski. Introduction to Graph Grammars with Applications to Semantic Networks. Computers & Mathematics with Applications, 23(6--9):557--572, 1992.
[15]
Grace Fan, Wenfei Fan, Yuanhao Li, Ping Lu, Chao Tian, and Jingren Zhou. Extending graph patterns with conditions. In Proceedings of the ACM International Conference on Management of Data, pages 715--729, 2020.
[16]
Wenfei Fan, Yinghui Wu, and Jingbo Xu. Functional Dependencies for Graphs. In Proceedings of the 2016 International Conference on Management of Data, SIGMOD '16, page 1843--1857, 2016.
[17]
Nadime Francis, Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Mats Rydberg, Petra Selmer, and Andrés Taylor. Cypher: An Evolving Query Language for Property Graphs. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '18, page 1433--1445, 2018.
[18]
Joshua A. Grochow and Manolis Kellis. Network Motif Discovery Using Subgraph Enumeration and Symmetry-Breaking. In Research in Computational Molecular Biology, pages 92--106, 2007.
[19]
Annegret Habel, Reiko Heckel, and Gabriele Taentzer. Graph Grammars with Negative Application Conditions. Fundamenta Informaticae, 26(3--4):287--313, 1996.
[20]
Myoungji Han, Hyunjoon Kim, Geonmo Gu, Kunsoo Park, and Wook-Shin Han. Efficient Subgraph Matching: Harmonizing Dynamic Programming, Adaptive Matching Order, and Failing Set Together. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '19, pages 1429--1446, 2019.
[21]
Wook-Shin Han, Jinsoo Lee, and Jeong-Hoon Lee. Turboiso: Towards Ultrafast and Robust Subgraph Isomorphism Search in Large Graph Databases. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '13, pages 337--348, 2013.
[22]
Steve Harris and Andy Seaborne. SPARQL 1.1 Query Language. Technical report, W3 Consortium, 2013.
[23]
Sungpack Hong, Siegfried Depner, Thomas Manhardt, Jan Van Der Lugt, Merijn Verstraaten, and Hassan Chafi. PGX.D: A Fast Distributed Graph Processing Engine. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC '15, 2015.
[24]
Kasra Jamshidi, Rakesh Mahadasa, and Keval Vora. Peregrine: A Pattern-Aware Graph Mining System. In Proceedings of the Fifteenth European Conference on Computer Systems, EuroSys '20, 2020.
[25]
Kasra Jamshidi and Keval Vora. A Deeper Dive into Pattern-Aware Subgraph Exploration with Peregrine. ACM SIGOPS Operating Systems Review, 55(1):1--10, 2021.
[26]
Chathura Kankanamge, Siddhartha Sahu, Amine Mhedbhi, Jeremy Chen, and Semih Salihoglu. Graphflow: An Active Graph Database. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '17, page 1695--1698, 2017.
[27]
Hyeonji Kim, Juneyoung Lee, Sourav S. Bhowmick, Wook-Shin Han, JeongHoon Lee, Seongyun Ko, and Moath H.A. Jarrah. DUALSIM: Parallel Subgraph Enumeration in a Massive Graph on a Single Machine. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '16, pages 1231--1245, 2016.
[28]
Kyoungmin Kim, In Seo, Wook-Shin Han, Jeong-Hoon Lee, Sungpack Hong, Hassan Chafi, Hyungyu Shin, and Geonhwa Jeong. TurboFlux: A Fast Continuous Subgraph Matching System for Streaming Graph Data. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '18, pages 411--426, 2018.
[29]
Longbin Lai, Lu Qin, Xuemin Lin, and Lijun Chang. Scalable Subgraph Enumeration in MapReduce. Proceedings of the VLDB Endowment, 8(10):974--985, June 2015.
[30]
Longbin Lai, Lu Qin, Xuemin Lin, Ying Zhang, Lijun Chang, and Shiyu Yang. Scalable Distributed Subgraph Enumeration. Proceedings of the VLDB Endowment, 10(3):217--228, November 2016.
[31]
Amine Mhedhbi and Semih Salihoglu. Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins. Proceedings of the VLDB Endowment, 12(11):1692--1704, July 2019.
[32]
Neo4j, Inc. Neo4j Graph Database, 2022. Version 4.4.
[33]
Caleb C. Noble and Diane J. Cook. Graph-Based Anomaly Detection. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '03, page 631--636, 2003.
[34]
Zhengping Qian, Chenqiang Min, Longbin Lai, Yong Fang, Gaofeng Li, Youyang Yao, Bingqing Lyu, Xiaoli Zhou, Zhimin Chen, and Jingren Zhou. GAIA: A System for Interactive Analysis on Distributed Graphs Using a High-Level Language. In 18th USENIX Symposium on Networked Systems Design and Implementation, NSDI '21, pages 321--335. USENIX Association, April 2021.
[35]
Miao Qiao, Hao Zhang, and Hong Cheng. Subgraph Matching: On Compression and Computation. Proceedings of the VLDB Endowment, 11(2):176--188, October 2017.
[36]
Xuguang Ren, Junhu Wang, Wook-Shin Han, and Jeffrey Xu Yu. Fast and Robust Distributed Subgraph Enumeration. Proceedings of the VLDB Endowment, 12(11):1344--1356, July 2019.
[37]
Marko A. Rodriguez. The Gremlin Graph Traversal Machine and Language. In Proceedings of the 15th Symposium on Database Programming Languages, DBPL 2015, page 1--10, 2015.
[38]
Nicholas P. Roth, Vasileios Trigonakis, Sungpack Hong, Hassan Chafi, Anthony Potter, Boris Motik, and Ian Horrocks. PGX.D/Async: A Scalable Distributed Graph Pattern Matching Engine. In Proceedings of the Fifth International Workshop on Graph Data-Management Experiences & Systems, GRADES '17, 2017.
[39]
Shixuan Sun and Qiong Luo. In-Memory Subgraph Matching: An In-Depth Study. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '20, page 1083--1098, 2020.
[40]
Harsh Thakkar, Dharmen Punjani, Jens Lehmann, and Sören Auer. Two for One: Querying Property Graph Databases Using SPARQL via Gremlinator. In Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA), GRADES-NDA '18, 2018.
[41]
Oskar van Rest, Sungpack Hong, Jinha Kim, Xuming Meng, and Hassan Chafi. PGQL: A property graph query language. In Proceedings of the Fourth International Workshop on Graph Data Management Experiences and Systems, GRADES '16, 2016.
[42]
Zhengyi Yang, Longbin Lai, Xuemin Lin, Kongzhang Hao, and Wenjie Zhang. HUGE: An Efficient and Scalable Subgraph Enumeration System. In Proceedings of the ACM International Conference on Management of Data, SIGMOD '21, page 2049--2062, 2021.
[43]
Stéphane Zampelli, Yves Deville, and Pierre Dupont. Approximate Constrained Subgraph Matching. In Peter van Beek, editor, Principles and Practice of Constraint Programming, CP 2005, pages 832--836, Berlin, Heidelberg, 2005. Springer Berlin Heidelberg.

Cited By

View all
  • (2024)Understanding High-Performance Subgraph Pattern Matching: A Systems PerspectiveProceedings of the 7th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3661304.3661897(1-12)Online publication date: 14-Jun-2024
  1. Anti-vertex for neighborhood constraints in subgraph queries

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GRADES-NDA '22: Proceedings of the 5th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)
    June 2022
    84 pages
    ISBN:9781450393843
    DOI:10.1145/3534540
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 June 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SIGMOD/PODS '22
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 29 of 61 submissions, 48%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)13
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 23 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Understanding High-Performance Subgraph Pattern Matching: A Systems PerspectiveProceedings of the 7th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3661304.3661897(1-12)Online publication date: 14-Jun-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media