Outliers have a more dramatic effect on smaller data sets. In the data file "Beatles", the data consists of the sizes (in seconds and MB) of the 27 number 1 hits on the Beatles album 1.
a) Generate the box plot and histogram of the sizes of these songs.
b) Identify any outliers. What is the size of this song, in minutes and megabytes?
c) What is the effect of excluding this song on the mean and median of the sizes of the songs?
d) Which summary, the mean or median, is the better summary of the center of the distribution of sizes?
e) Which summary, the mean or median, is the more useful summary if you want to know if you can fit this album on your iPod?