UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions

Rogoz, Ana-Cristina; Ionescu, Radu Tudor

Computer Science > Computation and Language

arXiv:2404.13343 (cs)

[Submitted on 20 Apr 2024]

Title:UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions

Authors:Ana-Cristina Rogoz, Radu Tudor Ionescu

View PDF HTML (experimental)

Abstract:This work explores a novel data augmentation method based on Large Language Models (LLMs) for predicting item difficulty and response time of retired USMLE Multiple-Choice Questions (MCQs) in the BEA 2024 Shared Task. Our approach is based on augmenting the dataset with answers from zero-shot LLMs (Falcon, Meditron, Mistral) and employing transformer-based models based on six alternative feature combinations. The results suggest that predicting the difficulty of questions is more challenging. Notably, our top performing methods consistently include the question text, and benefit from the variability of LLM answers, highlighting the potential of LLMs for improving automated assessment in medical licensing exams. We make our code available this https URL.

Comments:	Accepted at BEA 2024 (NAACL Workshop)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2404.13343 [cs.CL]
	(or arXiv:2404.13343v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2404.13343

Submission history

From: Radu Tudor Ionescu [view email]
[v1] Sat, 20 Apr 2024 10:41:02 UTC (691 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2024-04

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators