skip to main content
10.1145/3587259.3627557acmconferencesArticle/Chapter ViewAbstractPublication Pagesk-capConference Proceedingsconference-collections
research-article
Open access

OKG: A Knowledge Graph for Fine-grained Understanding of Social Media Discourse on Inequality

Published: 05 December 2023 Publication History

Abstract

In recent years, social media platforms such as Twitter have allowed people to voice their opinions by engaging in online discussions. The availability of such discussions has garnered interest amongst researchers in analyzing the dynamics on critical topics, such as inequality. Most of the current strategies are, however, limited with respect to conveying the fine-grained opinions of users, focusing on tasks such as sentiment analysis or topic modeling that extract coarse categorizations. In this work, we address this challenge by integrating a Twitter corpus with the output of finer-grained semantic parsing for the analysis of social media discourse. To do so, we first introduce the OBservatory Integrated Ontology (OBIO) that integrates social media metadata with various types of linguistic knowledge. We then present the Observatory Knowledge Graph (OKG), a knowledge graph in terms of the ontology, populated with tweets on inequality. We lastly provide use cases showing how the knowledge graph can be used as the backbone of a social media observatory, to facilitate a deeper understanding of social media discourse.

References

[1]
Mehwish Alam, Aldo Gangemi, Valentina Presutti, and Diego Reforgiato Recupero. 2021. Semantic role labeling for knowledge graph extraction from text. Progress in Artificial Intelligence 10 (2021), 309–320.
[2]
Mehwish Alam, Manuel Kaschura, and Harald Sack. 2020. Apollo: Twitter Stream Analyzer of Trending Hashtags: A case-study of# COVID-19. In ISWC (Demos/Industry). 64–69.
[3]
Christopher Barrie and Justin Chun-ting Ho. 2021. academictwitteR: an R package to access the Twitter Academic Research Product Track v2 API endpoint. Journal of Open Source Software 6, 62 (2021), 3272.
[4]
Katrien Beuls, Paul Van Eecke, and Vanja Sophie Cangalovic. 2021. A computational construction grammar approach to semantic frame extraction. Linguistics Vanguard 7, 1 (2021), 20180015.
[5]
John G Breslin, Stefan Decker, Andreas Harth, and Uldis Bojars. 2006. SIOC: an approach to connect web-based communities. International Journal of Web Based Communities 2, 2 (2006), 133–142.
[6]
Vasco Campos, Ricardo Campos, Pedro Mota, and Alípio Jorge. 2022. Tweet2Story: a web app to extract narratives from twitter. In European Conference on Information Retrieval. Springer, 270–275.
[7]
Gustavo Candela, Pilar Escobar, Rafael C Carrasco, and Manuel Marco-Such. 2022. Evaluating the quality of linked open data in digital libraries. Journal of Information Science 48, 1 (2022), 21–43.
[8]
Emily Chen, Kristina Lerman, Emilio Ferrara, 2020. Tracking social media discourse about the covid-19 pandemic: Development of a public coronavirus twitter data set. JMIR public health and surveillance 6, 2 (2020), e19273.
[9]
Qingqing Chen and Andrew Crooks. 2022. Analyzing the vaccination debate in social media data pre-and post-COVID-19 pandemic. International Journal of Applied Earth Observation and Geoinformation 110 (2022), 102783.
[10]
Yiyi Chen, Harald Sack, and Mehwish Alam. 2022. Analyzing social media for measuring public attitudes toward controversies and their driving factors: a case study of migration. Social network analysis and mining 12, 1 (2022), 135.
[11]
G Cota 2020. The landscape of ontology reuse approaches. Appl. Practices Ontol. Des., Extraction, Reason 49 (2020), 21.
[12]
Leon Derczynski, Diana Maynard, Giuseppe Rizzo, Marieke Van Erp, Genevieve Gorrell, Raphaël Troncy, Johann Petrak, and Kalina Bontcheva. 2015. Analysis of named entity recognition and linking for tweets. Information Processing & Management 51, 2 (2015), 32–49.
[13]
Dimitar Dimitrov, Erdal Baran, Pavlos Fafalios, Ran Yu, Xiaofei Zhu, Matthäus Zloch, and Stefan Dietze. 2020. Tweetscov19-a knowledge base of semantically annotated tweets about the covid-19 pandemic. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 2991–2998.
[14]
Pavlos Fafalios, Vasileios Iosifidis, Eirini Ntoutsi, and Stefan Dietze. 2018. Tweetskb: A public and large-scale rdf corpus of annotated tweets. In The Semantic Web: 15th International Conference, ESWC 2018, Heraklion, Crete, Greece, June 3–7, 2018, Proceedings 15. Springer, 177–190.
[15]
Antske Fokkens, Aitor Soroa, Zuhaitz Beloki, Niels Ockeloen, German Rigau, Willem Robert Van Hage, and Piek Vossen. 2014. NAF and GAF: Linking linguistic annotations. In Proceedings 10th Joint ISO-ACL SIGSEM Workshop on Interoperable Semantic Annotation. 9–16.
[16]
Aldo Gangemi, Mehwish Alam, Luigi Asprino, Valentina Presutti, and Diego Reforgiato Recupero. 2016. Framester: A wide coverage linguistic linked data hub. In Knowledge Engineering and Knowledge Management: 20th International Conference, EKAW 2016, Bologna, Italy, November 19-23, 2016, Proceedings 20. Springer, 239–254.
[17]
Aldo Gangemi, Valentina Presutti, Diego Reforgiato Recupero, Andrea Giovanni Nuzzolese, Francesco Draicchio, and Misael Mongiovì. 2017. Semantic web machine reading with FRED. Semantic Web 8, 6 (2017), 873–893.
[18]
Sebastian Hellmann, Jens Lehmann, Sören Auer, and Martin Brümmer. 2013. Integrating NLP using linked data. In The Semantic Web–ISWC 2013: 12th International Semantic Web Conference, Sydney, NSW, Australia, October 21-25, 2013, Proceedings, Part II 12. Springer, 98–113.
[19]
Rupinder Paul Khandpur, Taoran Ji, Steve Jan, Gang Wang, Chang-Tien Lu, and Naren Ramakrishnan. 2017. Crowdsourcing cybersecurity: Cyber attack detection using social media. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 1049–1057.
[20]
Paul Kingsbury and Martha Palmer. 2003. Propbank: the next level of treebank. In Proceedings of Treebanks and lexical Theories, Vol. 3. Citeseer.
[21]
Xiaohua Liu, Kuan Li, Ming Zhou, and Zhongyang Xiong. 2011. Collective semantic role labeling for tweets with clustering. In Twenty-Second International Joint Conference on Artificial Intelligence.
[22]
Roel O Lutkenhaus, Jeroen Jansz, and Martine PA Bouman. 2019. Mapping the Dutch vaccination debate on Twitter: Identifying communities, narratives, and interactions. Vaccine: X 1 (2019), 100019.
[23]
Thomas Marcoux and Nitin Agarwal. 2021. Narrative Trends of COVID-19 Misinformation. In Text2Story@ ECIR. 77–80.
[24]
John P McCrae, Julia Bosque-Gil, Jorge Gracia, Paul Buitelaar, and Philipp Cimiano. 2017. The Ontolex-Lemon model: development and applications. In Proceedings of eLex 2017 conference. 19–21.
[25]
Pablo N Mendes, Max Jakob, Andrés García-Silva, and Christian Bizer. 2011. DBpedia spotlight: shedding light on the web of documents. In Proceedings of the 7th international conference on semantic systems. 1–8.
[26]
Alexander Pak, Patrick Paroubek, 2010. Twitter as a corpus for sentiment analysis and opinion mining. In LREc, Vol. 10. 1320–1326.
[27]
Gerasimos Razis and Ioannis Anagnostopoulos. 2014. Semantifying twitter: The influence tracker ontology. In 2014 9th International Workshop on Semantic and Social Media Adaptation and Personalization. IEEE, 98–103.
[28]
Marco Rospocher, Marieke van Erp, Piek Vossen, Antske Fokkens, Itziar Aldabe, German Rigau, Aitor Soroa, Thomas Ploeger, and Tessel Bogaard. 2016. Building event-centric knowledge graphs from news. Journal of Web Semantics 37 (2016), 132–151.
[29]
Hassan Saif, Yulan He, and Harith Alani. 2012. Semantic sentiment analysis of twitter. In The Semantic Web–ISWC 2012: 11th International Semantic Web Conference, Boston, MA, USA, November 11-15, 2012, Proceedings, Part I 11. Springer, 508–524.
[30]
Carlo Romano Marcello Alessandro Santagiustina and Massimo Warglien. 2022. The architecture of partisan debates: The online controversy on the no-deal Brexit. PLoS one 17, 6 (2022), e0270236.
[31]
Laura Sinikallio, Senka Drobac, Minna Tamper, Rafael Leal, Mikko Koho, Jouni Tuominen, Matti La Mela, Eero Hyvönen, 2021. Plenary debates of the Parliament of Finland as Linked Open Data and in Parla-CLARIN markup. In 3rd Conference on Language, Data and Knowledge, LDK 2021. Schloss Dagstuhl-Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing.
[32]
Anders Søgaard, Barbara Plank, and Hector Alonso. 2015. Using frame semantics for knowledge extraction from twitter. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 29.
[33]
Laura Spillner, Carlo R. M. A. Santagiustina, Thomas Mildner, and Robert Porzel. 2022. Towards conflictual narrative mechanics. In Proceedings of the IJCAI/ECAI Workshop on Semantic Techniques for Narrative-based Understanding.
[34]
Lara Tavoschi, Filippo Quattrone, Eleonora D’Andrea, Pietro Ducange, Marco Vabanesi, Francesco Marcelloni, and Pier Luigi Lopalco. 2020. Twitter as a sentinel tool to monitor public opinion on vaccination: an opinion mining analysis from September 2016 to August 2017 in Italy. Human vaccines & immunotherapeutics 16, 5 (2020), 1062–1069.
[35]
Ming-Hsiang Tsou, Chin-Te Jung, Chris Allen, Jiue-An Yang, Jean-Mark Gawron, Brian H Spitzberg, and Su Han. 2015. Social media analytics and research test-bed (SMART dashboard). In Proceedings of the 2015 international conference on social media & society. 1–7.
[36]
Andranik Tumasjan, Timm Sprenger, Philipp Sandner, and Isabell Welpe. 2010. Predicting elections with twitter: What 140 characters reveal about political sentiment. In Proceedings of the international AAAI conference on web and social media, Vol. 4. 178–185.
[37]
Akhila Sri Manasa Venigalla, Sridhar Chimalakonda, and Dheeraj Vagavolu. 2020. Mood of India during Covid-19-An interactive web portal based on emotion analysis of Twitter data. In Conference companion publication of the 2020 on computer supported cooperative work and social computing. 65–68.
[38]
Tom Willaert, Paul Van Eecke, Katrien Beuls, and Luc Steels. 2020. Building social media observatories for monitoring online opinion dynamics. Social Media+ Society 6, 2 (2020), 2056305119898778.
[39]
Changmeng Zheng, Junhao Feng, Ze Fu, Yi Cai, Qing Li, and Tao Wang. 2021. Multimodal relation extraction with efficient graph alignment. In Proceedings of the 29th ACM International Conference on Multimedia. 5298–5306.
[40]
Florian Zollmann, Jeffery Klaehn, Tina Sikka, Kristin Comeforo, Daniel Broudy, Mandy Tröger, Elizabeth Poole, Alison Edgley, and Andrew Mullen. 2018. The propaganda model and intersectionality: integrating separate paradigms. Media Theory 2, 2 (2018), 213–239.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
K-CAP '23: Proceedings of the 12th Knowledge Capture Conference 2023
December 2023
270 pages
ISBN:9798400701412
DOI:10.1145/3587259
  • Editors:
  • Brent Venable,
  • Daniel Garijo,
  • Brian Jalaian
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 December 2023

Check for updates

Author Tags

  1. Ontology Engineering and Population
  2. Social Media Discourse

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • European Union (Horizon 2020)

Conference

K-CAP '23
Sponsor:
K-CAP '23: Knowledge Capture Conference 2023
December 5 - 7, 2023
FL, Pensacola, USA

Acceptance Rates

Overall Acceptance Rate 55 of 198 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 414
    Total Downloads
  • Downloads (Last 12 months)389
  • Downloads (Last 6 weeks)41
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media