DSE 230. Scalable Data Analysis (4 units)
Link to catalog page: https://catalog.ucsd.edu/courses/MAS.html#dse230
Description
The course exercises the data scientist’s scalability toolbox, covering such concepts as map-reduce, streaming analysis, external memory algorithms, as well as their implementation options in popular frameworks (e.g., Hadoop and its ecosystem: HBase, Hive, Pig and Spark, etc.). The class will include assignments of analyzing large existing databases.
Prerequisite courses
DSE 230 has no prerequisite courses.
Successor courses
No courses have DSE 230 as a prerequisite.