TY - GEN
T1 - Informing Expert Feature Engineering through Automated Approaches
T2 - 13th International Conference on Learning Analytics and Knowledge: Towards Trustworthy Learning Analytics, LAK 2023
AU - Hur, Paul
AU - Machaka, Nessrine
AU - Krist, Christina
AU - Bosch, Nigel
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/3/13
Y1 - 2023/3/13
N2 - While classroom video data are detailed sources for mining student learning insights, their complex and unstructured nature makes them less than straightforward for researchers to analyze. In this paper, we compared the differences between the processes of expert-informed manual feature engineering and automated feature engineering using positional data for predicting student group interaction in four middle school and high school mathematics classroom videos. Our results highlighted notable differences, including improved model accuracy for the combined (manual features + automated features) models compared to the only-manual-features models (mean AUC = .778 vs. .706) at the cost of feature interpretability, increased number of features for automated feature engineering (1523 vs. 178), and engineering approach (domain-agnostic in automated vs. domain-knowledge-informed in manual). We carried out feature importance analyses and discuss the implications of the results for potentially augmenting human perspectives about qualitatively coding classroom video data by confirming and expanding views on which body areas and characteristics may be relevant to the target interaction behavior. Lastly, we discuss our study's limitations and future work.
AB - While classroom video data are detailed sources for mining student learning insights, their complex and unstructured nature makes them less than straightforward for researchers to analyze. In this paper, we compared the differences between the processes of expert-informed manual feature engineering and automated feature engineering using positional data for predicting student group interaction in four middle school and high school mathematics classroom videos. Our results highlighted notable differences, including improved model accuracy for the combined (manual features + automated features) models compared to the only-manual-features models (mean AUC = .778 vs. .706) at the cost of feature interpretability, increased number of features for automated feature engineering (1523 vs. 178), and engineering approach (domain-agnostic in automated vs. domain-knowledge-informed in manual). We carried out feature importance analyses and discuss the implications of the results for potentially augmenting human perspectives about qualitatively coding classroom video data by confirming and expanding views on which body areas and characteristics may be relevant to the target interaction behavior. Lastly, we discuss our study's limitations and future work.
KW - classroom video data
KW - expert-informed feature engineering
KW - student positional data
UR - http://www.scopus.com/inward/record.url?scp=85149316543&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85149316543&partnerID=8YFLogxK
U2 - 10.1145/3576050.3576090
DO - 10.1145/3576050.3576090
M3 - Conference contribution
AN - SCOPUS:85149316543
T3 - ACM International Conference Proceeding Series
SP - 630
EP - 636
BT - LAK 2023 Conference Proceedings - Towards Trustworthy Learning Analytics - 13th International Conference on Learning Analytics and Knowledge
PB - Association for Computing Machinery
Y2 - 13 March 2023 through 17 March 2023
ER -