ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models

Li, Junyi; Tang, Tianyi; Gong, Zheng; Yang, Lixin; Yu, Zhuohao; Chen, Zhipeng; Wang, Jingyuan; Zhao, Wayne Xin; Wen, Ji-Rong

Computer Science > Computation and Language

arXiv:2205.01523 (cs)

[Submitted on 3 May 2022]

Title:ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models

Authors:Junyi Li, Tianyi Tang, Zheng Gong, Lixin Yang, Zhuohao Yu, Zhipeng Chen, Jingyuan Wang, Wayne Xin Zhao, Ji-Rong Wen

View PDF

Abstract:Nowadays, pretrained language models (PLMs) have dominated the majority of NLP tasks. While, little research has been conducted on systematically evaluating the language abilities of PLMs. In this paper, we present a large-scale empirical study on general language ability evaluation of PLMs (ElitePLM). In our study, we design four evaluation dimensions, i.e. memory, comprehension, reasoning, and composition, to measure ten widely-used PLMs within five categories. Our empirical results demonstrate that: (1) PLMs with varying training objectives and strategies are good at different ability tests; (2) fine-tuning PLMs in downstream tasks is usually sensitive to the data size and distribution; (3) PLMs have excellent transferability between similar tasks. Moreover, the prediction results of PLMs in our experiments are released as an open resource for more deep and detailed analysis on the language abilities of PLMs. This paper can guide the future work to select, apply, and design PLMs for specific tasks. We have made all the details of experiments publicly available at this https URL.

Comments:	Accepted by NAACL 2022 main conference (Long Paper)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.01523 [cs.CL]
	(or arXiv:2205.01523v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.01523

Submission history

From: Junyi Li [view email]
[v1] Tue, 3 May 2022 14:18:10 UTC (123 KB)

Computer Science > Computation and Language

Title:ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators