TY - GEN
T1 - Collective development of large scale data science products via modularized assignments
T2 - 51st ACM SIGCSE Technical Symposium on Computer Science Education, SIGCSE 2020
AU - Bhavya,
AU - Boughoula, Assma
AU - Green, Aaron
AU - Zhai, Cheng Xiang
N1 - Publisher Copyright:
© 2020 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2020/2/26
Y1 - 2020/2/26
N2 - Many universities are offering data science (DS) courses to fulfill the growing demands for skilled DS practitioners. Assignments and projects are essential parts of the DS curriculum as they enable students to gain hands-on experience in real-world DS tasks. However, most current assignments and projects are lacking in at least one of two ways: 1) they do not comprehensively teach all the steps involved in the complete workflow of DS projects; 2) students work on separate problems individually or in small teams, limiting the scale and impact of their solutions. To overcome these limitations, we envision novel synergistic modular assignments where a large number of students work collectively on all the tasks equired to develop a large-scale DS product. The resulting product can be continuously improved with students' contributions every semester. We report our experience with developing and deploying such an assignment in an Information Retrieval course. Through the assignment, students collectively developed a search engine for finding expert faculty specializing in a given field. This shows the utility of such assignments both for teaching useful DS skills and driving innovation and research. We share useful lessons for other instructors to adopt similar assignments for their DS courses.
AB - Many universities are offering data science (DS) courses to fulfill the growing demands for skilled DS practitioners. Assignments and projects are essential parts of the DS curriculum as they enable students to gain hands-on experience in real-world DS tasks. However, most current assignments and projects are lacking in at least one of two ways: 1) they do not comprehensively teach all the steps involved in the complete workflow of DS projects; 2) students work on separate problems individually or in small teams, limiting the scale and impact of their solutions. To overcome these limitations, we envision novel synergistic modular assignments where a large number of students work collectively on all the tasks equired to develop a large-scale DS product. The resulting product can be continuously improved with students' contributions every semester. We report our experience with developing and deploying such an assignment in an Information Retrieval course. Through the assignment, students collectively developed a search engine for finding expert faculty specializing in a given field. This shows the utility of such assignments both for teaching useful DS skills and driving innovation and research. We share useful lessons for other instructors to adopt similar assignments for their DS courses.
KW - Experience report
KW - Practical data science education
KW - Synergistic modular assignments
UR - http://www.scopus.com/inward/record.url?scp=85081537039&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85081537039&partnerID=8YFLogxK
U2 - 10.1145/3328778.3366961
DO - 10.1145/3328778.3366961
M3 - Conference contribution
AN - SCOPUS:85081537039
T3 - SIGCSE 2020 - Proceedings of the 51st ACM Technical Symposium on Computer Science Education
SP - 1200
EP - 1206
BT - SIGCSE 2020 - Proceedings of the 51st ACM Technical Symposium on Computer Science Education
Y2 - 11 March 2020 through 14 March 2020
ER -