Claim Detection in Biomedical Twitter Posts

Wührl, Amelie; Klinger, Roman

Computer Science > Computation and Language

arXiv:2104.11639 (cs)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 23 Apr 2021 (v1), last revised 1 May 2021 (this version, v2)]

Title:Claim Detection in Biomedical Twitter Posts

Authors:Amelie Wührl, Roman Klinger

View PDF

Abstract:Social media contains unfiltered and unique information, which is potentially of great value, but, in the case of misinformation, can also do great harm. With regards to biomedical topics, false information can be particularly dangerous. Methods of automatic fact-checking and fake news detection address this problem, but have not been applied to the biomedical domain in social media yet. We aim to fill this research gap and annotate a corpus of 1200 tweets for implicit and explicit biomedical claims (the latter also with span annotations for the claim phrase). With this corpus, which we sample to be related to COVID-19, measles, cystic fibrosis, and depression, we develop baseline models which detect tweets that contain a claim automatically. Our analyses reveal that biomedical tweets are densely populated with claims (45 % in a corpus sampled to contain 1200 tweets focused on the domains mentioned above). Baseline classification experiments with embedding-based classifiers and BERT-based transfer learning demonstrate that the detection is challenging, however, shows acceptable performance for the identification of explicit expressions of claims. Implicit claim tweets are more challenging to detect.

Comments:	Accepted at the BioNLP Workshop at NAACL 2021
Subjects:	Computation and Language (cs.CL); Social and Information Networks (cs.SI)
Cite as:	arXiv:2104.11639 [cs.CL]
	(or arXiv:2104.11639v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2104.11639

Submission history

From: Roman Klinger [view email]
[v1] Fri, 23 Apr 2021 14:45:31 UTC (255 KB)
[v2] Sat, 1 May 2021 18:22:39 UTC (256 KB)

Computer Science > Computation and Language

Title:Claim Detection in Biomedical Twitter Posts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Claim Detection in Biomedical Twitter Posts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators