PALI at SemEval-2021 Task 2: Fine-Tune XLM-RoBERTa for Word in Context Disambiguation

Xie, Shuyi; Ma, Jian; Yang, Haiqin; Jiang, Lianxin; Mo, Yang; Shen, Jianping

Computer Science > Artificial Intelligence

arXiv:2104.10375 (cs)

[Submitted on 21 Apr 2021 (v1), last revised 7 Jun 2021 (this version, v2)]

Title:PALI at SemEval-2021 Task 2: Fine-Tune XLM-RoBERTa for Word in Context Disambiguation

Authors:Shuyi Xie, Jian Ma, Haiqin Yang, Lianxin Jiang, Yang Mo, Jianping Shen

View PDF

Abstract:This paper presents the PALI team's winning system for SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation. We fine-tune XLM-RoBERTa model to solve the task of word in context disambiguation, i.e., to determine whether the target word in the two contexts contains the same meaning or not. In the implementation, we first specifically design an input tag to emphasize the target word in the contexts. Second, we construct a new vector on the fine-tuned embeddings from XLM-RoBERTa and feed it to a fully-connected network to output the probability of whether the target word in the context has the same meaning or not. The new vector is attained by concatenating the embedding of the [CLS] token and the embeddings of the target word in the contexts. In training, we explore several tricks, such as the Ranger optimizer, data augmentation, and adversarial training, to improve the model prediction. Consequently, we attain first place in all four cross-lingual tasks.

Comments:	7 pages, 2 figures, 2 tables, winning solution on En-Ar, En-Fr, En-Ru, and En-Zh cross-lingual tasks of SemEval'21 Task 2
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2104.10375 [cs.AI]
	(or arXiv:2104.10375v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2104.10375

Submission history

From: Haiqin Yang [view email]
[v1] Wed, 21 Apr 2021 06:24:49 UTC (5,528 KB)
[v2] Mon, 7 Jun 2021 08:35:35 UTC (5,528 KB)

Computer Science > Artificial Intelligence

Title:PALI at SemEval-2021 Task 2: Fine-Tune XLM-RoBERTa for Word in Context Disambiguation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:PALI at SemEval-2021 Task 2: Fine-Tune XLM-RoBERTa for Word in Context Disambiguation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators