TY - GEN
T1 - Effect of discrete and continuous parameter variation on difficulty in automatic item generation
AU - Chen, Binglin
AU - Zilles, Craig
AU - West, Matthew
AU - Bretl, Timothy
N1 - Publisher Copyright:
© Springer Nature Switzerland AG 2019.
PY - 2019
Y1 - 2019
N2 - Automatic item generation enables a diverse array of questions to be generated through the use of question templates and randomly-selected parameters. Such automatic item generators are most useful if the generated item instances are of either equivalent or predictable difficulty. In this study, we analyzed student performance on over 300 item generators from four university-level STEM classes collected over a period of two years. In most cases, we find that the choice of parameters fails to significantly affect the problem difficulty. In our analysis, we found it useful to distinguish parameters that were drawn from a small number (<10) of values from those that are drawn from a large—often continuous—range of values. We observed that values from smaller ranges were more likely to significantly impact difficulty, as sometimes they represented different configurations of the problem (e.g., upward force vs. downward force). Through manual review of the problems with significant difficulty variance, we found it was, in general, easy to understand the source of the variance once the data were presented. These results suggest that the use of automatic item generation by college faculty is warranted, because most problems don’t exhibit significant difficulty variation, and the few that do can be detected through automatic means and addressed by the faculty member.
AB - Automatic item generation enables a diverse array of questions to be generated through the use of question templates and randomly-selected parameters. Such automatic item generators are most useful if the generated item instances are of either equivalent or predictable difficulty. In this study, we analyzed student performance on over 300 item generators from four university-level STEM classes collected over a period of two years. In most cases, we find that the choice of parameters fails to significantly affect the problem difficulty. In our analysis, we found it useful to distinguish parameters that were drawn from a small number (<10) of values from those that are drawn from a large—often continuous—range of values. We observed that values from smaller ranges were more likely to significantly impact difficulty, as sometimes they represented different configurations of the problem (e.g., upward force vs. downward force). Through manual review of the problems with significant difficulty variance, we found it was, in general, easy to understand the source of the variance once the data were presented. These results suggest that the use of automatic item generation by college faculty is warranted, because most problems don’t exhibit significant difficulty variation, and the few that do can be detected through automatic means and addressed by the faculty member.
KW - Automatic item generation
KW - Item models
UR - http://www.scopus.com/inward/record.url?scp=85068322313&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85068322313&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-23204-7_7
DO - 10.1007/978-3-030-23204-7_7
M3 - Conference contribution
AN - SCOPUS:85068322313
SN - 9783030232030
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 71
EP - 83
BT - Artificial Intelligence in Education - 20th International Conference, AIED 2019, Proceedings
A2 - Isotani, Seiji
A2 - Millán, Eva
A2 - Ogan, Amy
A2 - McLaren, Bruce
A2 - Hastings, Peter
A2 - Luckin, Rose
PB - Springer
T2 - 20th International Conference on Artificial Intelligence in Education, AIED 2019
Y2 - 25 June 2019 through 29 June 2019
ER -