Problem
1. Would we expect map skew to be a bigger problem when there are ten reducers or a hundred reducers?
2. Would we expect the problem of map skew to increase or decrease when we combine counts from each file before emitting them?
3. For each of the following The Quant Shop prediction challenges dream up the most massive possible data source that might reasonably exist, who might have it, and what biases might lurk in its view of the world.
(a) Miss Universe.
(b) Movie gross.
(c) Baby weight.
(d) Art auction price.
(e) White Christmas.
(f) Football champions.
(g) Ghoul pool.
(h) Gold/oil prices.