SWE2: SubWord Enriched and Significant Word Emphasized Framework for Hate Speech Detection

Mou, Guanyi; Ye, Pengyi; Lee, Kyumin

doi:10.1145/3340531.3411990

Computer Science > Computation and Language

arXiv:2409.16673 (cs)

[Submitted on 25 Sep 2024]

Title:SWE2: SubWord Enriched and Significant Word Emphasized Framework for Hate Speech Detection

Authors:Guanyi Mou, Pengyi Ye, Kyumin Lee

View PDF HTML (experimental)

Abstract:Hate speech detection on online social networks has become one of the emerging hot topics in recent years. With the broad spread and fast propagation speed across online social networks, hate speech makes significant impacts on society by increasing prejudice and hurting people. Therefore, there are aroused attention and concern from both industry and academia. In this paper, we address the hate speech problem and propose a novel hate speech detection framework called SWE2, which only relies on the content of messages and automatically identifies hate speech. In particular, our framework exploits both word-level semantic information and sub-word knowledge. It is intuitively persuasive and also practically performs well under a situation with/without character-level adversarial attack. Experimental results show that our proposed model achieves 0.975 accuracy and 0.953 macro F1, outperforming 7 state-of-the-art baselines under no adversarial attack. Our model robustly and significantly performed well under extreme adversarial attack (manipulation of 50% messages), achieving 0.967 accuracy and 0.934 macro F1.

Comments:	Published in CIKM 2020
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2409.16673 [cs.CL]
	(or arXiv:2409.16673v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2409.16673
Journal reference:	CIKM 2020
Related DOI:	https://doi.org/10.1145/3340531.3411990

Submission history

From: Guanyi Mou [view email]
[v1] Wed, 25 Sep 2024 07:05:44 UTC (722 KB)

Computer Science > Computation and Language

Title:SWE2: SubWord Enriched and Significant Word Emphasized Framework for Hate Speech Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SWE2: SubWord Enriched and Significant Word Emphasized Framework for Hate Speech Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators