Am I Wrong, or Is the Autograder Wrong? Effects of AI Grading Mistakes on Learning

Tiffany Wenting Li, Silas Hsu, Max Fowler, Zhilin Zhang, Craig Zilles, Karrie Karahalios

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Errors in AI grading and feedback often have an intractable set of causes and are, by their nature, difficult to completely avoid. Since inaccurate feedback potentially harms learning, there is a need for designs and workflows that mitigate these harms. To better understand the mechanisms by which erroneous AI feedback impacts students' learning, we conducted surveys and interviews that recorded students' interactions with a short-answer AI autograder for "Explain in Plain English"code reading problems. Using causal modeling, we inferred the learning impacts of wrong answers marked as right (false positives, FPs) and right answers marked as wrong (false negatives, FNs). We further explored explanations for the learning impacts, including errors influencing participants' engagement with feedback and assessments of their answers' correctness, and participants' prior performance in the class. FPs harmed learning in large part due to participants' failures to detect the errors. This was due to participants not paying attention to the feedback after being marked as right, and an apparent bias against admitting one's answer was wrong once marked right. On the other hand, FNs harmed learning only for survey participants, suggesting that interviewees' greater behavioral and cognitive engagement protected them from learning harms. Based on these findings, we propose ways to help learners detect FPs and encourage deeper reflection on FNs to mitigate the learning harms of AI errors.

Original languageEnglish (US)
Title of host publicationICER 2023 - Proceedings of the 2023 ACM Conference on International Computing Education Research V.1
PublisherAssociation for Computing Machinery
Pages159-176
Number of pages18
ISBN (Electronic)9781450399760
DOIs
StatePublished - Aug 7 2023
Event19th Annual ACM International Computing Education Research Conference, ICER 2023 - Chicago, United States
Duration: Aug 7 2023Aug 11 2023

Publication series

NameICER 2023 - Proceedings of the 2023 ACM Conference on International Computing Education Research V.1

Conference

Conference19th Annual ACM International Computing Education Research Conference, ICER 2023
Country/TerritoryUnited States
CityChicago
Period8/7/238/11/23

Keywords

  • AI error
  • Bayesian modeling
  • EiPE
  • autograder
  • automated short answer grading
  • computer science education
  • explain in plain English
  • formative feedback
  • human-AI interaction

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Software
  • Education

Fingerprint

Dive into the research topics of 'Am I Wrong, or Is the Autograder Wrong? Effects of AI Grading Mistakes on Learning'. Together they form a unique fingerprint.

Cite this