PDF

Description

This report presents an educational computing environment for data science education at scale, highlighted in use at the University of California, Berkeley. With the rise of online learners in massively open computing courses (MOOCs), we detail a relevant technical case study of the decisions made in converting an introductory undergraduate data science course into a series of data science edX MOOCs. The focus of this study is on the student and instructor workflow, distributed system infrastructure, cost analysis, cloud resource allocation, and autograding integration in the scaling process. We implement an analytics pipeline for collecting data from Jupyter notebooks and propose a Deep Knowledge Tracing modification to model student progress on coding assignments.

Details

Files

Statistics

from
to
Export
Download Full History