Article (Scientific journals)
LLM-based automatic short answer grading in undergraduate medical education.
GREVISSE, Christian
2024In BMC Medical Education, 24 (1), p. 1060
Peer Reviewed verified by ORBi
 

Files


Full Text
s12909-024-06026-5.pdf
Publisher postprint (2.46 MB)
Request a copy

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Automatic short answer grading; GPT-4; Gemini; Large language models; Medical education
Abstract :
[en] BACKGROUND: Multiple choice questions are heavily used in medical education assessments, but rely on recognition instead of knowledge recall. However, grading open questions is a time-intensive task for teachers. Automatic short answer grading (ASAG) has tried to fill this gap, and with the recent advent of Large Language Models (LLM), this branch has seen a new momentum. METHODS: We graded 2288 student answers from 12 undergraduate medical education courses in 3 languages using GPT-4 and Gemini 1.0 Pro. RESULTS: GPT-4 proposed significantly lower grades than the human evaluator, but reached low rates of false positives. The grades of Gemini 1.0 Pro were not significantly different from the teachers'. Both LLMs reached a moderate agreement with human grades, and a high precision for GPT-4 among answers considered fully correct. A consistent grading behavior could be determined for high-quality keys. A weak correlation was found wrt. the length or language of student answers. There is a risk of bias if the LLM knows the human grade a priori. CONCLUSIONS: LLM-based ASAG applied to medical education still requires human oversight, but time can be spared on the edge cases, allowing teachers to focus on the middle ones. For Bachelor-level medical education questions, the training knowledge of LLMs seems to be sufficient, fine-tuning is thus not necessary.
Disciplines :
Human health sciences: Multidisciplinary, general & others
Computer science
Education & instruction
Author, co-author :
GREVISSE, Christian  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Life Sciences and Medicine (DLSM) > Medical Education
External co-authors :
no
Language :
English
Title :
LLM-based automatic short answer grading in undergraduate medical education.
Publication date :
27 September 2024
Journal title :
BMC Medical Education
eISSN :
1472-6920
Publisher :
Springer Science and Business Media LLC, England
Volume :
24
Issue :
1
Pages :
1060
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBilu :
since 30 September 2024

Statistics


Number of views
19 (4 by Unilu)
Number of downloads
1 (1 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
1
OpenCitations
 
0
OpenAlex citations
 
1

Bibliography


Similar publications



Contact ORBilu