TY - GEN
T1 - Hierarchical Dependencies in Classroom Settings Influence Algorithmic Bias Metrics
AU - Belitz, Clara
AU - Lee, Hae Jin
AU - Nasiar, Nidhi
AU - Fancsali, Stephen E.
AU - Ritter, Steve
AU - Almoubayyed, Husni
AU - Baker, Ryan S.
AU - Ocumpaugh, Jaclyn
AU - Bosch, Nigel
N1 - Publisher Copyright:
© 2024 ACM.
PY - 2024/3/18
Y1 - 2024/3/18
N2 - Measuring algorithmic bias in machine learning has historically focused on statistical inequalities pertaining to specific groups. However, the most common metrics (i.e., those focused on individual- or group-conditioned error rates) are not currently well-suited to educational settings because they assume that each individual observation is independent from the others. This is not statistically appropriate when studying certain common educational outcomes, because such metrics cannot account for the relationship between students in classrooms or multiple observations per student across an academic year. In this paper, we present novel adaptations of algorithmic bias measurements for regression for both independent and nested data structures. Using hierarchical linear models, we rigorously measure algorithmic bias in a machine learning model of the relationship between student engagement in an intelligent tutoring system and year-end standardized test scores. We conclude that classroom-level influences had a small but significant effect on models. Examining significance with hierarchical linear models helps determine which inequalities in educational settings might be explained by small sample sizes rather than systematic differences.
AB - Measuring algorithmic bias in machine learning has historically focused on statistical inequalities pertaining to specific groups. However, the most common metrics (i.e., those focused on individual- or group-conditioned error rates) are not currently well-suited to educational settings because they assume that each individual observation is independent from the others. This is not statistically appropriate when studying certain common educational outcomes, because such metrics cannot account for the relationship between students in classrooms or multiple observations per student across an academic year. In this paper, we present novel adaptations of algorithmic bias measurements for regression for both independent and nested data structures. Using hierarchical linear models, we rigorously measure algorithmic bias in a machine learning model of the relationship between student engagement in an intelligent tutoring system and year-end standardized test scores. We conclude that classroom-level influences had a small but significant effect on models. Examining significance with hierarchical linear models helps determine which inequalities in educational settings might be explained by small sample sizes rather than systematic differences.
KW - Algorithmic bias
KW - Intelligent tutoring systems
KW - Interactive learning environments
KW - Predictive analytics
UR - http://www.scopus.com/inward/record.url?scp=85187554409&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85187554409&partnerID=8YFLogxK
U2 - 10.1145/3636555.3636869
DO - 10.1145/3636555.3636869
M3 - Conference contribution
AN - SCOPUS:85187554409
T3 - ACM International Conference Proceeding Series
SP - 210
EP - 218
BT - LAK 2024 Conference Proceedings - 14th International Conference on Learning Analytics and Knowledge
PB - Association for Computing Machinery
T2 - 14th International Conference on Learning Analytics and Knowledge, LAK 2024
Y2 - 18 March 2024 through 22 March 2024
ER -