All calculations and relevant Minitab output must be included to receive full credit.
Be sure to word-process your solutions and copy and paste the appropriate outputs from Minitab.
Show all steps used in arriving at the final answers.
Incomplete solutions will receive partial credit. This exam covers content from Modules One through Three.
(1) Consider the GASTURBINE data set and corresponding output from Minitab.
Note that all tests should be performed at the α = 0.05 level. Use the complete data set in your analysis.
The first 10 observations are given for illustrative purposes.
Complete parts a) through f) below.
ENGINE
|
SHAFTS
|
RPM
|
CPRATIO
|
INLET-TEMP
|
EXH-TEMP
|
AIRFLOW
|
POWER
|
HEATRATE
|
Traditional
|
1
|
27245
|
9.2
|
1134
|
602
|
7
|
1630
|
14622
|
Traditional
|
1
|
14000
|
12.2
|
950
|
446
|
15
|
2726
|
13196
|
Traditional
|
1
|
17384
|
14.8
|
1149
|
537
|
20
|
5247
|
11948
|
Traditional
|
1
|
11085
|
11.8
|
1024
|
478
|
27
|
6726
|
11289
|
Traditional
|
1
|
14045
|
13.2
|
1149
|
553
|
29
|
7726
|
11964
|
Traditional
|
1
|
6211
|
15.7
|
1172
|
517
|
176
|
52600
|
10526
|
Traditional
|
1
|
6210
|
17.4
|
1177
|
510
|
193
|
57500
|
10387
|
Traditional
|
1
|
3600
|
13.5
|
1146
|
503
|
315
|
89600
|
10592
|
Traditional
|
1
|
3000
|
15.1
|
1146
|
524
|
375
|
113700
|
10460
|
Traditional
|
1
|
3000
|
15
|
1171
|
525
|
514
|
164300
|
10086
|
Regression Analysis: HEATRATE versus RPM, CPRATIO, ...
The regression equation is
HEATRATE = 14314 + 0.0806 RPM - 6.8 CPRATIO - 9.51 INLET-TEMP + 14.2 EXH-TEMP
- 2.55 AIRFLOW + 0.00426 POWER
Predictor Coef SE Coef T P
Constant 14314 1112 12.87 0.000
RPM 0.08058 0.01611 5.00 0.000
CPRATIO -6.78 30.38 -0.22 0.824
INLET-TEMP -9.507 1.529 -6.22 0.000
EXH-TEMP 14.155 3.469 4.08 0.000
AIRFLOW -2.553 1.746 -1.46 0.149
POWER 0.004257 0.004217 1.01 0.317
S = 458.757
R-Sq = 92.5%
R-Sq(adj) = 91.7%
Analysis of Variance
Source DF SS MS F P
Regression 6 155269735 25878289 122.96 0.000
Residual Error 60 12627473 210458
Total 66 167897208
1. Write a first-order model in general form for the model that includes RPM, CPRATIO, INLET-TEMP, EXH-TEMP, AIRFLOW, and
POWER to predict HEATRATE.
2. Write out the least squares prediction equation for the model that was fit in Minitab.
3. Calculate and give an interpretation of the coefficients based on a one-unit change in each xi.
4. Calculate and give an interpretation of the effect on HEATRATE based on a 1-unit change in AIRFLOW together with a 200-unit change in POWER.
5. Interpret the overall model F-test.
State the appropriate hypothesis test and associated numerator and denominator degrees of freedom used for this test as well as the critical value that the test statistic is compared to.
State the conclusion you would make regarding the null hypothesis.
Specifically, would you reject or fail to reject the null hypothesis, and what does this conclusion means about the model parameters?
Does this tell us anything about the significance of the individual predictors? Why or why not?
6. Report and interpret the model R2.
7.Which predictors are significant in the model? Report the appropriate hypothesis test and formal conclusion you would make regarding RPM and CPRATIO. In your conclusion, state their p-values and test statistics.
Would you suggest removing all non-significant predictors at once and refitting the model? Why or why not?
(2) Using the FLAG data set (first 10 observations given), fit a model that predicts COST based on DOTEST.
Use the complete data set in your analysis.
Show the relevant output from Minitab in your answer. The first 10 observations are given for informational purposes.
CONTRACT
|
COST
|
DOTEST
|
STATUS
|
1
|
1379.43
|
1386.29
|
1
|
2
|
134.03
|
85.71
|
1
|
3
|
202.33
|
248.89
|
0
|
4
|
397.12
|
467.49
|
0
|
5
|
158.54
|
117.72
|
1
|
6
|
1128.11
|
1008.91
|
1
|
7
|
400.33
|
472.98
|
1
|
8
|
581.64
|
785.39
|
0
|
9
|
353.96
|
370.02
|
0
|
10
|
138.71
|
174.25
|
0
|
Calculate a confidence and prediction interval for DOTEST = 100.
Interpret the confidence and prediction intervals given in the output.
Do you see any problems with the interpretation of the prediction interval in terms of what we are trying to predict?
Why are confidence intervals always more narrow than prediction intervals?
(3) Consider the EXPRESS data set (first 10 observations given).
Use the complete data set in your analysis. Show the relevant output from Minitab in your answers.
The first 10 observations are given for illustrative purposes.
Weight
|
Distance
|
Cost
|
5.9
|
47
|
2.6
|
3.2
|
145
|
3.9
|
4.4
|
202
|
8
|
6.6
|
160
|
9.2
|
0.75
|
280
|
4.4
|
0.7
|
80
|
1.5
|
6.5
|
240
|
14.5
|
4.5
|
53
|
1.9
|
0.6
|
100
|
1
|
7.5
|
190
|
14
|
1. Draw a scatterplot of Cost vs. each of the predictors. Do you see any evidence of a quadratic relationship?
2. Write a general second-order model (not including interaction terms) for Cost(y).
3. Give the null and alternative hypothesis for determining whether both of the second-order terms are statistically significant (nested model hypothesis).
4. Identify which of the two general nested models is the complete model and which is the reduced model.
5. Using Minitab, produce an output and write the least squares regression equation for the second-order model AND the reduced model that was fit in Minitab.
6. Compute the test statistic and perform the appropriate F-test. Be sure to state the degrees of freedom and the correct F critical value that you are comparing your test statistic to. Formally state your conclusion. Hint: To compute the test statistic, you need to separately fit both the complete and reduced models. To fit the complete model, you need to add the appropriate variables to your data set.
7. Consider the EXPRESS data set (first 10 observations given). Use the complete data set in your analysis. Show the relevant output from Minitab in your answers. The first 10 observations are given for illustrative purposes.
Weight
|
Distance
|
Cost
|
5.9
|
47
|
2.6
|
3.2
|
145
|
3.9
|
4.4
|
202
|
8
|
6.6
|
160
|
9.2
|
0.75
|
280
|
4.4
|
0.7
|
80
|
1.5
|
6.5
|
240
|
14.5
|
4.5
|
53
|
1.9
|
0.6
|
100
|
1
|
7.5
|
190
|
14
|
1. Write out a complete general first-order model including an interaction term for Cost as the outcome.
2. Using Minitab, produce an output and write the least squares regression equation with the interaction term that was fit in Minitab.
3. If there is a significant interaction effect, but the individual predictors (main effects) that make up that interaction are not significant in the model, would you suggest removing the main effects? Why or why not?