|
|
Dec 21, 2024
|
|
MSDS 694 - Distributed Computing Unit(s): 1
Students learn the MapReduce technique of distributed computing. The fundamental principles are first learned with the Python multiprocessing library, in which students build their own con-current MapReduce framework. Considerable time is spent exploring practical application of mapping and reducing for various types of real world data. Distributed statistical and machine learning approaches are explored. Finally, Hadoop streaming MapReduce jobs (in Python) are launched on AWS-EMR.
Restriction: Level Restricted to Graduate; Field of study restricted to Data Science Major College of Arts and Sciences
Add to Portfolio (opens a new window)
|
|
|