What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?

Si, Chenglei; Wang, Shuohang; Kan, Min-Yen; Jiang, Jing

Computer Science > Computation and Language

arXiv:1910.12391 (cs)

[Submitted on 28 Oct 2019]

Title:What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?

Authors:Chenglei Si, Shuohang Wang, Min-Yen Kan, Jing Jiang

View PDF

Abstract:Multiple-Choice Reading Comprehension (MCRC) requires the model to read the passage and question, and select the correct answer among the given options. Recent state-of-the-art models have achieved impressive performance on multiple MCRC datasets. However, such performance may not reflect the model's true ability of language understanding and reasoning. In this work, we adopt two approaches to investigate what BERT learns from MCRC datasets: 1) an un-readable data attack, in which we add keywords to confuse BERT, leading to a significant performance drop; and 2) an un-answerable data training, in which we train BERT on partial or shuffled input. Under un-answerable data training, BERT achieves unexpectedly high performance. Based on our experiments on the 5 key MCRC datasets - RACE, MCTest, MCScript, MCScript2.0, DREAM - we observe that 1) fine-tuned BERT mainly learns how keywords lead to correct prediction, instead of learning semantic understanding and reasoning; and 2) BERT does not need correct syntactic information to solve the task; 3) there exists artifacts in these datasets such that they can be solved even without the full context.

Comments:	10 pages
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1910.12391 [cs.CL]
	(or arXiv:1910.12391v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1910.12391

Submission history

From: Chenglei Si [view email]
[v1] Mon, 28 Oct 2019 00:50:55 UTC (2,096 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shuohang Wang
Min-Yen Kan
Jing Jiang

export BibTeX citation

Computer Science > Computation and Language

Title:What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators