question 1 in an article a random sample of 42


Question 1. In an article, a random sample of 42 felons convicted of "impulsive murder" was compared to a random sample of 40 felons convicted of premeditated murder.  Both samples of felons were released from prison before completing the original sentence.  For the sample of 42 convicted of "impulsive murder," 13 recorded parole violations.  For the sample of 40 convicted of premeditated murder, 22 recorded parole violations

Can the statistician infer at the 1% significance level that the proportion of parole violations is greater for the felons convicted of premeditated murder?

H0:

H1:

Test statistic:

Rejection region:

Calculated test statistic:

Conclusion:

Estimate with 99% confidence the difference in population proportions.

Question 2. In an important study centered on teenage smoking, a sample of 70 high school students in Edmonton claimed they smoked an average of 24.64 cigarettes per day, with a standard deviation of 9.23 cigarettes.  A different sample of 70 high school students in Ontario claimed that, on average, they smoked 22.84 cigarettes daily, with a standard deviation of 8.75.  At the 5% level of significance, test the hypothesis that the average number of cigarettes smoked per day is the same in both places.

H0:

H1:

Test statistic:

Rejection region:

Calculated test statistic:

Question 3. Briefly describe how you would diagnose each of the following conditions/problems related to regression analysis.

a) the potential for multicollinearity

b) the probability distribution of the error variable is normal

c) the mean of the distribution of the error variable is zero

d) the standard deviation of the error variable is constant (i.e., homoscedastic)

Question 4. With regard to an indicator such as infant deaths per 1,000 births, it is a pretty safe assumption that, as a whole, the developed nations in Europe, USA, and Japan statistically differ from the less developed countries in South America, the Middle East, and Asia.  The question is, do infant deaths per 1,000 in the less-developed countries in Latin America, the Middle East, and Africa differ?  The data below are reported by country in each region.

South America (n=12)

Middle East (n=10)

Africa (n=14)

25.7

109.9

16.0

40.0

181.6

119.0

32.0

111.0

21.9

108.1

71.0

6.1

91.0

75.0

63.0

23.3

69.0

76.0

24.0

68.0

128.0

56.0

43.0

9.7

26.0

107.7

45.0

7.5

40.0

42.0

44.0

15.6

19.4

28.0

 

63.0

17.1

 

 

 

 

 

Conduct an analysis of variance for this data at the 5% level of significance.  For this problem, SST = 2547.30 and SSE = 56966.04. H0:

H1:

Test statistic:

Rejection region:

Calculated test statistic:

Conclusion:

Question 5.  An English teacher investigated some of the factors that affect an individual student's final grade in his course. She proposed the multiple regression model:

598_What is the coefficient of determination1.png

        where

 y  =  final mark (out of 100)

x1  =  number of lectures skipped

x2  =  number of late assignments

x3  =  mid-term test mark (out of 100)

            The teacher recorded the data for 50 randomly selected students. The computer output is shown below.

                THE REGRESSION EQUATION IS

218_What is the coefficient of determination.png

 

Predictor

 

Coef

StDev

T

Constant

 

41.6

 

17.8

 

2.337

 

x1

 

-3.18

 

 

1.66

 

 

-1.916

 

x2

 

-1.17

 

 

1.13

 

 

-1.035

 

x3

 

0.63

 

 

0.13

 

 

4.846

 

       R2 = 30.0%

  Analysis of Variance

Source of Variation

 

df

 

SS

 

MS

 

F

 

Regression

 

3

 

3716

 

1238.667

 

6.558

 

Error

 

46

 

8688

 

188.870

 

 

Total

49

 

12404

 

 

 

a) Do these data provide enough evidence to conclude at the 5% significance level that the model is useful in predicting the final mark?  Explain.

b) Do these data provide enough evidence to conclude at the 5% significance level that the final mark and the number of skipped lectures are linearly related?  Explain.

c) Do these data provide enough evidence at the 1% significance level to conclude that the final mark and the mid-term mark are linearly related?  Explain.

d) What is the coefficient of determination?  What does this statistic tell you?

e) Interpret the coefficients b1 and b3.  Be specific.

Solution Preview :

Prepared by a verified Expert
Basic Statistics: question 1 in an article a random sample of 42
Reference No:- TGS0442610

Now Priced at $40 (50% Discount)

Recommended (98%)

Rated (4.3/5)