Article (Scientific journals)
Docimological Quality Analysis of LLM-Generated Multiple Choice Questions in Computer Science and Medicine
GREVISSE, Christian; PAVLOU, Maria Angeliki; SCHNEIDER, Jochen
2024In SN Computer Science, 5
Peer Reviewed verified by ORBi
 

Files


Full Text
_s42979-024-02963-6.pdf
Publisher postprint (2.42 MB)
Request a copy

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Automatic question generation; Large language models; Multiple choice questions; Generative pre-trained transformer; Moodle
Abstract :
[en] Assessment is an essential part of education, both for teachers who assess their students as well as learners who may evaluate themselves. Multiple-choice questions (MCQ) are one of the most popular types of knowledge assessment, e.g., in medical education, as they can be automatically graded and can cover a wide range of learning items. However, the creation of high-quality MCQ items is a time-consuming task. The recent advent of Large Language Models (LLM), such as Generative Pre-trained Transformer (GPT), caused a new momentum for automatic question generation solutions. Still, evaluating generated questions according to the best practices for MCQ item writing is needed to ensure docimological quality. In this article, we propose an analysis of the quality of LLM-generated MCQs. We employ zero-shot approaches in two domains, namely computer science and medicine. In the former, we make use of 3 GPT-based services to generate MCQs. In the latter, we developed a plugin for the Moodle learning management system that generates MCQs based on learning material. We compare the generated MCQs against common multiple-choice item writing guidelines. Among the major challenges, we determined that while LLMs are certainly useful in generating MCQs more efficiently, they sometimes create broad items with ambiguous keys or implausible distractors. Human oversight is also necessary to ensure instructional alignment between generated items and course contents. Finally, we propose solutions for AQG developers.
Disciplines :
Computer science
Human health sciences: Multidisciplinary, general & others
Author, co-author :
GREVISSE, Christian  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Life Sciences and Medicine (DLSM) > Medical Education
PAVLOU, Maria Angeliki  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Life Sciences and Medicine (DLSM) > Medical Education
SCHNEIDER, Jochen ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Medical Translational Research
External co-authors :
no
Language :
English
Title :
Docimological Quality Analysis of LLM-Generated Multiple Choice Questions in Computer Science and Medicine
Publication date :
10 June 2024
Journal title :
SN Computer Science
ISSN :
2662-995X
eISSN :
2661-8907
Publisher :
Springer Nature, Singapore, Singapore
Special issue title :
Advances in Applied Informatics
Volume :
5
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBilu :
since 10 June 2024

Statistics


Number of views
63 (22 by Unilu)
Number of downloads
4 (4 by Unilu)

Scopus citations®
 
2
Scopus citations®
without self-citations
2
OpenAlex citations
 
1

Bibliography


Similar publications



Contact ORBilu