2021-2022 Catalog 
    
    Apr 30, 2024  
2021-2022 Catalog [ARCHIVED CATALOG]

Add to Portfolio (opens a new window)

MSDS 694 - Distributed Computing


Unit(s): 1

Students learn the MapReduce technique of distributed computing. The fundamental principles are first learned with the Python multiprocessing library, in which students build their own con-current MapReduce framework. Considerable time is spent exploring practical application of mapping and reducing for various types of real world data. Distributed statistical and machine learning approaches are explored. Finally, Hadoop streaming MapReduce jobs (in Python) are launched on AWS-EMR.


Restriction: Level Restricted to Graduate; Field of study restricted to Data Science Major
College of Arts and Sciences



Add to Portfolio (opens a new window)