1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.
a. How many subjects have missing values for medlonic and act94?
b. Is there a severe split in frequencies between groups?
According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.
c. What are the cutoff values for outliers in each group?
d. Which outlying cases should be deleted for each group?
Average ACT score 1994 Stem-and-Leaf Plot for
medloinc= below the median for low inc % 1993
Frequency Stem & Leaf
7.00 14 . 1223789
9.00 15 . 234478888
5.00 16 . 12788
4.00 17 . 1378
2.00 18 . 09
1.00 19 . 6
3.00 20 . 069
1.00 Extremes (>=22.5)
Stem width: 1.0
Each leaf: 1 case(s)
e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?
Tests of Normality
|
|
above or below median loinc
|
Kolmogorov-Smirnova
|
Shapiro-Wilk
|
|
Statistic
|
df
|
Sig.
|
Statistic
|
df
|
Sig.
|
average ACT score 1994
|
below the median for low inc % 1993
|
.162
|
32
|
.032
|
.903
|
32
|
.007
|
above the median for low inc % 1993
|
.166
|
32
|
.025
|
.921
|
32
|
.023
|
According to the information and the test of normality, it appears that this is a normal distribution. Therefore, for the transformation, we would select 'Square Root."
|
f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.
In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.
2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?