skip to main content
10.1145/3472813.3472814acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmhiConference Proceedingsconference-collections
research-article

Evaluation of Applied Machine Learning for Health Misinformation Detection via Survey of Medical Professionals on Controversial Topics in Pediatrics

Published: 26 October 2021 Publication History

Abstract

In this research, we present an evaluation of a system for detection of health misinformation using applied machine learning. The system incorporates computing automation, information retrieval, and natural language processing in conjunction with evidence-based medicine to generate a veracity score based on consensus from trusted medical knowledge bases. For our study, we pre-computed the veracity scores of controversial topics in pediatrics with our proposed system, and then also solicited evaluations of these topics from medical professionals in the neurodevelopmental field via a quantitative survey. Hence, this work provides a double-blind comparison on the veracity of medical claims between our proposed system's results and medical professionals' responses. The results showed that our system's automated assessment matched professional opinions of medical personnel with 80% precision. The survey also demonstrated the inherent challenge with health misinformation detection, as there was no consensus among the medical professionals for 50% of the controversial statements. Nevertheless, this evaluation shows promising results for using objective trust metrics such as the veracity score, in contrast with subjective trust metrics that rely on potentially biased crowdsourcing, ratings, and pre-trained labelling of data.

References

[1]
Anna Kata. 2012. Anti-Vaccine Activists, Web 2.0, and the Postmodern Paradigm–An Overview of Tactics and Tropes Used Online by the Anti-Vaccination Movement. Vaccine 30, 25 (2012), 3778–3789.
[2]
Trisha Greenhalgh. 2010. How to Read a Paper: The Basics of Evidence-Based Medicine. John Wiley & Sons.
[3]
Betty J Ackley. 2008. Evidence-Based Nursing Care Guidelines: Medical-Surgical Interventions. Elsevier Health Sciences.
[4]
Ronald Cornet and Nicolette de Keizer. 2008. Forty Years of SNOMED: A Literature Review. BMC Med. Inform. Decis. Mak. 8, Suppl 1 (2008), S2.
[5]
Catherine Smith and P Stavri. 2005. Consumer Health Vocabulary. Consum. Heal. Informatics (2005), 122–128.
[6]
Francesco Corcoglioniti, Marco Rospocher, and Alessio Palmero Aprosio. 2015. Extracting Knowledge from Text with PIKES. In International Semantic Web Conference.
[7]
Jon Brassey. 2005. TRIP Database: Identifying High Quality Medical Literature from a Range of Sources. New Rev. Inf. Netw. 11, 2 (2005), 229–234.
[8]
Rie Johnson and Tong Zhang. 2015. Effective Use of Word Order for Text Categorization with Convolutional Neural Networks. In North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT).
[9]
Bo Pang, Lillian Lee, and others. 2008. Opinion Mining and Sentiment Analysis. Found. Trends® Inf. Retr. 2, 1–2 (2008), 1–135.
[10]
Marie-Catherine De Marneffe and Christopher D Manning. 2008. Stanford Typed Dependencies Manual.
[11]
Iraklis Varlamis, Magdalini Eirinaki, and Malamati Louta. 2010. A Study on Social Network Metrics and their Application in Trust Networks. In Proceedings of the IEEE International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 168–175.
[12]
Sonja Grabner-Kräuter and Sofie Bitter. 2015. Trust in Online Social Networks: A Multifaceted Perspective. Forum Soc. Econ. 44, 1 (2015), 48–68.
[13]
Amine Abdaoui, Jérôme Azé, Sandra Bringay, and Pascal Poncelet. 2015. Collaborative Content-Based Method for Estimating User Reputation in Online Forums. In Web Information Systems Engineering. Springer, 292–299.
[14]
Meeyoung Park. 2013. HealthTrust: Assessing the Trustworthiness of Healthcare Information on the Internet. University of Kansas.
[15]
Paolo Ferragina and Ugo Scaiella. 2010. TAGME: On-The-Fly Annotation of Short Text Fragments (By Wikipedia Entities). In ACM International Conference on Information and Knowledge Management (CIKM), 1625–1628.
[16]
Gautam Kishore Shahi and Durgesh Nandini. 2020. FakeCovid–A Multilingual Cross-domain Fact Check News Dataset for COVID-19. arXiv Prepr. arXiv2006.11343 (2020).
[17]
Debanjana Kar, Mohit Bhardwaj, Suranjana Samanta, and Amar Prakash Azad. 2020. No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection. arXiv Prepr. arXiv2010.06906 (2020).
[18]
Lucy Lu Wang, Kyle Lo, Yoganand Chandrasekhar, Russell Reas, Jiangjiang Yang, Darrin Eide, K Funk, Rodney Michael Kinney, Ziyang Liu, W Merrill, P Mooney, D Murdick, Devvret Rishi, Jerry Sheehan, Zhihong Shen, B Stilson, A Wade, K Wang, Christopher Wilhelm, Boya Xie, D Raymond, Daniel S Weld, Oren Etzioni, and Sebastian Kohlmeier. 2020. CORD-19: The Covid-19 Open Research Dataset. arXiv 2004.05125 (2020).
[19]
Arkin Dharawat, Ismini Lourentzou, Alex Morales, and ChengXiang Zhai. 2020. Drink Bleach or Do What Now? Covid-HeRA: A Dataset for Risk-Informed Health Decision Making in the Presence of COVID19 Misinformation. arXiv Prepr. arXiv2010.08743 (2020).
[20]
Yin Aphinyanaphongs, Constantin Aliferis, and others. 2007. Text Categorization Models for Identifying Unproven Cancer Treatments on the Web. In World Congress on Medical Informatics (MedInfo), 968.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICMHI '21: Proceedings of the 5th International Conference on Medical and Health Informatics
May 2021
347 pages
ISBN:9781450389846
DOI:10.1145/3472813
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Applied Machine Learning
  2. Health Misinformation
  3. Social Media

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICMHI 2021

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 128
    Total Downloads
  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)3
Reflects downloads up to 14 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media