MA 320-1 Project three Description
This assignment is to help you understand the idea of regression analysis and other inferential statistics.
Twenty movies from 1999 and 2000 are listed in the attached Excel sheet with data entered.
You should have four values (variables) for each movie. Two of the values are found through the Rotten Tomatoes website (www.rottentomatoes.com). The 'Tomatometer' is the percentage of movie critics who liked the movie and the 'Audience' value is what website users thought of the movie. Finally, there are values for Worldwide Gross (amount the movie made) and the Production Budget (how much the movie cost) for each movie.
Now do the following:
1) Plot four different regression graphs (X v Y): Tomatometer v Audience, Budget v Gross, Budget v Tomatometer, Audience v Gross and find the best-fit line equation and correlation coefficient (r-value) for each one. X represents the independent variable and Y represents the dependent variable.
2) Use the r-value and Table I to determine if your best-fit lines are significant at 0.05 significant level. (use 18 degrees of freedom here)
Reject H0: p = 0 if the absolute value of r is greater than the value given in the table. The values are for a two-tailed test; d.f. = n - 2.
d.f.
|
α = .05 |
α = 0.01 |
1 |
0.999 |
0.999 |
2 |
0.95 |
0.999 |
|
0.878 |
0.959 |
4 |
0.811 |
0.917 |
5 |
0.754 |
0.875 |
6 |
0.707 |
0.834 |
7 |
0.666 |
0.798 |
8 |
0.632 |
0.765 |
9 |
0.602 |
0.735 |
10 |
0.576 |
0.708 |
11 |
0.553 |
0.684 |
12 |
0.532 |
0.661 |
13 |
0.514 |
0.641 |
14 |
0.497 |
0.623 |
15 |
0.482 |
0.606 |
16 |
0.468 |
0.59 |
17 |
0.456 |
0.575 |
18 |
0.444 |
0.561 |
19 |
0.433 |
0.549 |
20 |
0.423 |
0.537 |
25 |
0.381 |
0.487 |
30 |
249 |
0.449 |
35 |
0.325 |
0.418 |
40 |
0.304 |
0.393 |
45 |
0.288 |
0.372 |
50 |
0.273 |
0.354 |
60 |
0.25 |
0.325 |
70 |
0.232 |
0.302 |
80 |
0.217 |
0.283 |
90 |
0.205 |
0.267 |
100 |
0.195 |
0.254 |
Reject the null hypothesis if the smaller number of positive or negative signs is less than or equal to the value in the table.
|
One-tailed, α = 0.005 |
α = 0.01 |
α = 0.025 |
α = 0.05 |
n |
Two-tailed, α = 0.01 |
α = 0.02 |
α = 0.025 |
α = 0.10 |
8 |
0 |
0 |
0 |
1 |
9 |
0 |
0 |
1 |
1 |
10 |
0 |
0 |
1 |
1 |
11 |
0 |
1 |
1 |
2 |
12 |
1 |
1 |
2 |
2 |
13 |
1 |
1 |
2 |
3 |
14 |
1 |
2 |
3 |
3 |
15 |
2 |
2 |
3 |
3 |
16 |
2 |
2 |
3 |
4 |
17 |
2 |
3 |
4 |
4 |
18 |
3 |
3 |
4 |
5 |
19 |
3 |
4 |
4 |
5 |
20 |
3 |
4 |
5 |
5 |
21 |
4 |
4 |
5 |
5 |
22 |
4 |
5 |
5 |
6 |
23 |
4 |
5 |
6 |
7 |
24 |
5 |
5 |
6 |
7 |
25 |
5 |
6 |
6 |
7 |
3) Calculate and interpret two 95% confidence intervals to estimate the true mean Budget value. The first will assume normality and use a z-interval; the second will assume a t-interval.
4) Finally do a write-up on your results including your regression graphs, your confidence intervals and hypothesis test, and answers to the following questions using your graphs:
A) Using regression, does it appear that critics and audience members agree on what a movie should be rated? Why or why not?
B) Does it appear that large budgets lead to high-grossing movies? That is, are larger budgets associated with higher grossing movies? Why or why not?
C) Does it appear that movie critics like high-budget movies? In other words, is there a relationship between higher ratings and higher budgets? Why or why not?
D) Are higher ratings by audience members associated with higher gross amounts earned by the movie? Why or why not?
The paper should be in Word format with a title page and references. There should be an introductory and a conclusory paragraph. Answers should be in complete sentences free of spelling and grammatical mistakes.
Attachment:- xid.xlsx