2020-2021 Catalog 
    
    Mar 28, 2024  
2020-2021 Catalog [ARCHIVED CATALOG]

Add to Portfolio (opens a new window)

MSDS 694 - Distributed Computing


Unit(s): 1

Students learn the MapReduce technique of distributed computing. The fundamental principles are first learned with the Python multiprocessing library, in which students build their own con-current MapReduce framework. Considerable time is spent exploring practical application of mapping and reducing for various types of real world data. Distributed statistical and machine learning approaches are explored. Finally, Hadoop streaming MapReduce jobs (in Python) are launched on AWS-EMR.


Restriction: Level Restricted to Graduate; Field of study restricted to Data Science Major
College of Arts and Sciences



Add to Portfolio (opens a new window)