Goal for Lab 7
• Objectives:
• Understanding the concept of MapReduce Model
• Building a file system in a distributed way that store large data trunks
• Merging all things together into a docker image
• Writing some sample demo on running MapReduce models
To Start
• Read the Hadoop wiki page and learn it
• https://wiki.apache.org/hadoop
• Install a Hadoop Docker image, you know where to find it
• Finish the report on your experience of realizing two classic demos
• Word Count: https://hadoop.apache.org/docs/stable/hadoop-mapreduceclient/hadoop-mapreduce-client-
core/MapReduceTutorial.html#Example:_WordCount_v1.0
• Quantity Analysis: https://docs.mongodb.com/manual/tutorial/map-reduceexamples/