The data set shown below gives the calories and salt content in 17 brands of meat hot dogs.
• Enter the data set into excel. Using excel perform a least squares regression on the data set. Use Sodium as the y variable and Calories as the x variable. Make a scatter plot with the regression line plotted as well. Using excel compute the correlation coefficient.
• Insert the plot your created into a word document. Discuss the scatter plot and regression line. Are their outliers? Does your correlation coefficient support your analysis? Does it tell you anything else?
• If there is an outlier remove the outlier from the data set and repeat the previous two parts.
• Can you conclude that hot dogs with more calories will have more sodium (i.e. discuss correlation and causation)?
Brand |
Calories |
Sodium (mg) |
1 |
173 |
458 |
2 |
191 |
506 |
3 |
182 |
473 |
4 |
190 |
545 |
5 |
172 |
496 |
6 |
147 |
360 |
7 |
146 |
387 |
8 |
139 |
386 |
9 |
175 |
507 |
10 |
136 |
393 |
11 |
179 |
405 |
12 |
153 |
372 |
13 |
107 |
144 |
14 |
195 |
511 |
15 |
135 |
405 |
16 |
140 |
428 |
17 |
138 |
339 |