skip to main content
10.1145/3587259.3627560acmconferencesArticle/Chapter ViewAbstractPublication Pagesk-capConference Proceedingsconference-collections
research-article

NLFOA: Natural Language Focused Ontology Alignment

Published: 05 December 2023 Publication History

Abstract

For Ontology Alignment (OA), the task is to align semantically equivalent concepts and relations from different ontologies. This task plays a crucial role in many downstream tasks and applications in academia and industry. Since manually aligning ontologies is inefficient and costly, numerous approaches exist to do this automatically. However, most approaches are tailored to specific domains, are rule-based systems or based on feature engineering, and require external knowledge. The most recent advances in the field of OA rely on the widely proven effectiveness of pre-trained language models to represent the human-generated language that describes the entities in an ontology. However, these approaches additionally require sophisticated algorithms or Graph Neural Networks to exploit an ontology’s graphical structure to achieve state-of-the-art performance. In this work, we present NLFOA, or Natural Language Focused Ontology Alignment, which purely focuses on the natural language contained in ontologies to process the ontology’s semantics as well as graphical structure. An evaluation of our approach on common OA datasets shows superior results when finetuning with only a small number of training samples. Additionally, it demonstrates strong results in a zero-shot setting which could be employed in an active learning setup to reduce human labor when manually aligning ontologies significantly.

References

[1]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics. Minneapolis, MN, USA, 4171–4186.
[2]
Sefika Efeoglu. 2023. GraphMatcher: A Graph Representation Learning Approach for Ontology Matching: Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 174–180.
[3]
Vadim Ermolayev and Maxim Davidovsky. 2012. Agent-based Ontology Alignment: Basics, Applications, Theoretical foundations, and Demonstration. In Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics. Craiova, Romania, 1–12.
[4]
Daniel Faria, Catia Pesquita, Emanuel Santos, Matteo Palmonari, Isabel F Cruz, and Francisco M Couto. 2013. The AgreementMakerLight Ontology Matching system. In OTM Confederated International Conferences ’On the Move to Meaningful Internet Systems’. 527–541.
[5]
Daniel Faria, Marta Contreiras Silva, Pedro Cotovio, Patrícia Eugénio, and Catia Pesquita. 2022. Matcha and Matcha-DL Results for OAEI 2022. In CEUR Workshop Proceedings.
[6]
J Gargantilla and A Gomez-Perez. 2004. OntoWeb: A Survey on Ontology-based Applications. OntoWeb Consortium IST Project IST-2000-29243, Deliverable 1 (2004).
[7]
Francis Gosselin and Amal Zouaq. 2023. SEBMatcher Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 202–209.
[8]
Junheng Hao, Chuan Lei, Vasilis Efthymiou, Abdul Quamar, Fatma Özcan, Yizhou Sun, and Wei Wang. 2021. Medto: Medical Data to Ontology Matching using Hybrid Graph Neural Networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Washington DC, USA, 2946–2954.
[9]
Terry F Hayamizu, Sherri de Coronado, Gilberto Fragoso, Nicholas Sioutos, James A Kadin, and Martin Ringwald. 2012. The Mouse-Human Anatomy Ontology Mapping Project. Database: The Journal of Biological Databases and Curation 2012 (2012), bar066.
[10]
Yuan He, Jiaoyan Chen, Denvar Antonyrajah, and Ian Horrocks. 2022. BERTMap: A BERT-based Ontology Alignment System. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 5684–5691.
[11]
Yuan He, Jiaoyan Chen, Hang Dong, Ernesto Jiménez-Ruiz, Ali Hadian, and Ian Horrocks. 2022. Machine Learning-Friendly Biomedical Datasets for Equivalence and Subsumption Ontology Matching. In The Semantic Web – ISWC 2022. Hangzhou, China (virtual), 575–591.
[12]
Sven Hertling and Heiko Paulheim. 2021. ATBox Results for OAEI 2021. In CEUR Workshop Proceedings, Vol. 3063. 137–143.
[13]
Sven Hertling, Jan Portisch, and Heiko Paulheim. 2022. KERMIT - A Transformer-Based Approach for Knowledge Graph Matching. CoRR abs/2204.13931 (2022).
[14]
Vivek Iyer, Arvind Agarwal, and Harshit Kumar. 2021. VeeAlign: Multifaceted Context Representation Using Dual Attention for Ontology Alignment. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP). Online and Punta Cana, Dominican Republic, 10780–10792.
[15]
Ernesto Jiménez-Ruiz and Bernardo Cuenca Grau. 2011. LogMap: Logic-based and Scalable Ontology Matching. In Proceedings of the 10th International Conference on The Semantic Web (ISWC) – Volume Part I. Bonn, Germany, 273–288.
[16]
Jeff Johnson, Matthijs Douze, and Hervé Jégou. 2019. Billion-scale similarity search with GPUs. IEEE Transactions on Big Data (2019).
[17]
Peter Kardos, Zsolt Szántó, and Richárd Farkas. 2023. WomboCombo Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 216–219.
[18]
Leon Knorr and Jan Portisch. 2022. Fine-TOM Matcher Results for OAEI 2021. In CEUR Workshop Proceedings, Vol. 3063. online, 144–151.
[19]
Prodromos Kolyvakis, Alexandros Kalousis, and Dimitris Kiritsis. 2018. DeepAlignment: Unsupervised Ontology Matching with Refined Word Vectors. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), Volume 1 (Long Papers). New Orleans, LA, USA, 787–798.
[20]
Daniel Kossack, Niklas Borg, Leon Knorr, and Jan Portisch. 2022. TOM Matcher Results for OAEI 2021. In CEUR Workshop Proceedings, Vol. 3063. online, 193–198.
[21]
Weiwei Lin and Reiko Haga. 2021. Matching Cyber Security Ontologies through Genetic Algorithm-Based Ontology Alignment Technique. Security and Communication Networks 2021 (2021).
[22]
Ikechukwu Nkisi-Orji, Nirmalie Wiratunga, Stewart Massie, Kit-Ying Hui, and Rachel Heaven. 2019. Ontology Alignment based on Word Embedding and Random Forest Classification. In Joint European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD). Dublin, Ireland, 557–572.
[23]
Mina Abd Nikooie Pour, Alsayed Algergawy, Patrice Buche, Leyla J Castro, Jiaoyan Chen, Hang Dong, Omaima Fallatah, Daniel Faria, Irini Fundulaki, Sven Hertling, Yuan He, Ian Horrocks, Martin Huschka, Liliana Ibanescu, Ernesto Jimenez-Ruiz, Naouel Karam, Amir Laadhar, Patrick Lambrix, Huanyu Li, Ying Li, Franck Michel, Engy Nasr, Heiko Paulheim, Catia Pesquita, Tzanina Saveta, Pavel Shvaiko, Cassia Trojahn, Chantelle Verhey, Mingfang Wu, Beyza Yaman, Ondrej Zamazal, and Lu Zhou. 2023. Results of the Ontology Alignment Evaluation Initiative 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 84–128.
[24]
Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Hong Kong, China, 3973–3983.
[25]
Pavel Shvaiko and Jérôme Euzenat. 2011. Ontology Matching: state of the art and future challenges. IEEE Transactions on Knowledge and Data Engineering 25, 1 (2011), 158–176.
[26]
Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, and Tie-Yan Liu. 2020. MPNet: Masked and Permuted Pre-Training for Language Understanding. In Advances in Neural Information Processing Systems (NeurIPS), Vol. 33. online, 16857–16867.
[27]
Xiaobin Tang, Jing Zhang, Bo Chen, Yang Yang, Hong Chen, and Cuiping Li. 2020. BERT-INT:A BERT-based Interaction Model For Knowledge Graph Alignment. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI). Yokohama, Japan, 3174–3180.
[28]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention Is All You Need. In Advances in Neural Information Processing Systems (NIPS), Vol. 30. Long Beach, CA, USA, 5998–6008.
[29]
Javier Vela and Jorge Gracia. 2023. Cross-Lingual Ontology Matching with CIDER-LM: Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 158–165.
[30]
Lucy Lu Wang, Chandra Bhagavatula, Mark Neumann, Kyle Lo, Chris Wilhelm, and Waleed Ammar. 2018. Ontology Alignment in the Biomedical Domain Using Entity Definitions and Context. In Proceedings of the BioNLP 2018 Workshop. Melbourne, Australia, 47–55.
[31]
Zhu Wang. 2023. AMD Results for OAEI 2022. In CEUR Workshop Proceedings, Vol. 3324. Hangzhou, China (virtual), 145–152.
[32]
Jifang Wu, Jianghua Lv, Haoming Guo, and Shilong Ma. 2020. DAEOM: A Deep Attentional Embedding Approach for Biomedical Ontology Matching. Applied Sciences 10, 21 (2020), 7909.
[33]
Ondej Zamazal and Vojtch Svtek. 2017. The Ten-Year OntoFarm and its Fertilization within the Onto-Sphere. Web Semantics: Science, Services and Agents on the World Wide Web 43, C (2017), 46–53.

Index Terms

  1. NLFOA: Natural Language Focused Ontology Alignment
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          K-CAP '23: Proceedings of the 12th Knowledge Capture Conference 2023
          December 2023
          270 pages
          ISBN:9798400701412
          DOI:10.1145/3587259
          • Editors:
          • Brent Venable,
          • Daniel Garijo,
          • Brian Jalaian
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 05 December 2023

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tag

          1. Ontology Alignment Sentence Transformers Zero-Shot

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Conference

          K-CAP '23
          Sponsor:
          K-CAP '23: Knowledge Capture Conference 2023
          December 5 - 7, 2023
          FL, Pensacola, USA

          Acceptance Rates

          Overall Acceptance Rate 55 of 198 submissions, 28%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 89
            Total Downloads
          • Downloads (Last 12 months)59
          • Downloads (Last 6 weeks)4
          Reflects downloads up to 06 Jan 2025

          Other Metrics

          Citations

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media