Problem
a) Describe the workings of the JobTracker and TaskTracker in MapReduce.
b) What are the different ways in which MapReduce provides fault tolerance?
c) What is the Shuffle procedure in MapReduce?
d) Describe the MapReduce join procedures for Sort-Merge join, Partition join, N-way Map-side join, and Simple N-way join.
e) How does the Resource Manager work in YARN?