TY - GEN
T1 - Unsupervised Approach for Modeling Content Structures of MOOCs
AU - ALSaad, Fareedah
AU - Alawini, Abdussalam
N1 - Publisher Copyright:
© 2020 Proceedings of the 13th International Conference on Educational Data Mining, EDM 2020. All rights reserved.
PY - 2020
Y1 - 2020
N2 - With the increased number of MOOC offerings, it is unclear how these courses are related. Previous work has focused on capturing the prerequisite relationships between courses, lectures, and concepts. However, it is also essential to model the content structure of MOOC courses. Constructing a precedence graph that models the similarities and variations of learning paths followed by similar MOOCs would help both students and instructors. Students can personalize their learning by choosing the desired learning path and lectures across several courses guided by the precedence graph. Similarly, by examining the precedence graph, instructors can 1) identify knowledge gaps in their MOOC offerings, and 2) find alternative course plans. In this paper, we propose an unsupervised approach to build the precedence graph of similar MOOCs, where nodes are clusters of lectures with similar content, and edges depict alternative precedence relationships. Our approach to cluster similar lectures based on PCK-Means clustering algorithm that incorporates pairwise constraints: Must-Link and Cannot-Link with the standard K-Means algorithm. To build the precedence graph, we link the clusters according to the precedence relations mined from current MOOCs. Experiments over real-world MOOC data show that PCK-Means with our proposed pairwise constraints outperform the K-Means algorithm in both Adjusted Mutual Information (AMI) and Fowlkes-Mallows scores (FMI).
AB - With the increased number of MOOC offerings, it is unclear how these courses are related. Previous work has focused on capturing the prerequisite relationships between courses, lectures, and concepts. However, it is also essential to model the content structure of MOOC courses. Constructing a precedence graph that models the similarities and variations of learning paths followed by similar MOOCs would help both students and instructors. Students can personalize their learning by choosing the desired learning path and lectures across several courses guided by the precedence graph. Similarly, by examining the precedence graph, instructors can 1) identify knowledge gaps in their MOOC offerings, and 2) find alternative course plans. In this paper, we propose an unsupervised approach to build the precedence graph of similar MOOCs, where nodes are clusters of lectures with similar content, and edges depict alternative precedence relationships. Our approach to cluster similar lectures based on PCK-Means clustering algorithm that incorporates pairwise constraints: Must-Link and Cannot-Link with the standard K-Means algorithm. To build the precedence graph, we link the clusters according to the precedence relations mined from current MOOCs. Experiments over real-world MOOC data show that PCK-Means with our proposed pairwise constraints outperform the K-Means algorithm in both Adjusted Mutual Information (AMI) and Fowlkes-Mallows scores (FMI).
KW - Alternative Learning Paths
KW - Clustering
KW - Common Learning Path
KW - Pairwise Constraints
KW - Precedence Graph
KW - Precedence Relations
UR - http://www.scopus.com/inward/record.url?scp=85173242971&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85173242971&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85173242971
T3 - Proceedings of the 13th International Conference on Educational Data Mining, EDM 2020
SP - 18
EP - 28
BT - Proceedings of the 13th International Conference on Educational Data Mining, EDM 2020
A2 - Rafferty, Anna N.
A2 - Whitehill, Jacob
A2 - Romero, Cristobal
A2 - Cavalli-Sforza, Violetta
PB - International Educational Data Mining Society
T2 - 13th International Conference on Educational Data Mining, EDM 2020
Y2 - 10 July 2020 through 13 July 2020
ER -