DSC 102. Systems for Scalable Analytics (4 units)
Link to catalog page: https://catalog.ucsd.edu/courses/DSC.html#dsc102
Description
This course introduces the principles of computing systems and infrastructure for scaling analytics to large datasets. Topics include memory hierarchy, distributed systems, model selection, heterogeneous datasets, and deployment at scale. The course will also discuss the design of systems such as MapReduce/Hadoop and Spark, in conjunction with their implementation. Students will also learn how dataflow operations can be used to perform data preparation, cleaning, and feature engineering. Prerequisites: DSC 100. Restricted to students with upper-division standing. Restricted to students within the DS25 major. All other students will be allowed as space permits.
Prerequisite courses
Loading...
Successor courses
DSC 102 is a prerequisite of the following 2 courses: