Part 1
ANALYSIS ASSIGNMENT 1
QUESTIONS
1. Determine the range, standard deviation, and variance for the indicated data set.
a. SET A
b. SET B
c. SET C
d. SET D
e. SET E
f. SET F
Set A Ages of people that attended an Alcoholics Anonymous (AA) therapy session: 25,
44, 20, 37, 56, 51, 34, 41, 36, 61, 33, 31, 47, 63, 67
SET B Grades on the second test of ECT 120 class: 77, 80, 87, 67, 88, 77, 12, 93, 78, 44,
71, 86, 40, 100, 76, 83, 66, 62, 89, 79, 81, 52, 30, 71, 76, 100, 74, 65, 82, 100,
56, 35, 51, 59, 29,13, 71
SET C Number of defective resistors per package of 100: 5, 10, 3, 2, 1, 9, 4, 10, 5, 3, 5,
5, 7, 11, 20
SET D States within the USA with National Parks:
STATE
|
Number of National Parks
|
Alaska
|
8
|
Arizona
|
3
|
Arkansas
|
1
|
California
|
8
|
Colorado
|
2
|
Florida
|
3
|
Hawaii
|
2
|
Idaho
|
1
|
Kentucky
|
1
|
Maine
|
1
|
Michigan
|
1
|
Minnesota
|
1
|
Montana
|
2
|
Nevada
|
2
|
New Mexico
|
1
|
North Carolina
|
1
|
North Dakota
|
1
|
Oregon
|
1
|
South Dakota
|
2
|
Tennessee
|
1
|
Texas
|
2
|
Utah
|
5
|
Virginia
|
1
|
Washington
|
3
|
Wyoming
|
2
|
SET E Normal Monthly Precipitation in MiamiFlorida. Data taken over 30 years from
1961- 1990.
Month
|
Precipitation
Inches (in.)
|
January
|
2.01
|
February
|
2.08
|
March
|
2.39
|
April
|
2.85
|
May
|
6.21
|
June
|
9.33
|
July
|
5.70
|
August
|
7.58
|
September
|
7.63
|
October
|
5.64
|
November
|
2.66
|
December
|
1.83
|
SET F Annually Year Precipitation in selected US Cities. Data taken over 30 years from 1961- 1990.
City
|
Precipitation
Inches (in.)
|
New York City
|
47.25
|
Dodge City, KS
|
21.49
|
Detroit, MI
|
32.62
|
Honolulu, HI
|
22.02
|
Atlanta, GA
|
50.77
|
Hilo, HI
|
129.19
|
Tallahassee, FL
|
65.71
|
Orlando, FL
|
48.11
|
Bakersfield, CA
|
5.72
|
Alamosa, CO
|
7.57
|
Chicago, IL
|
35.85
|
Los Angeles, CA
|
14.77
|
Bridgeport, CT
|
41.66
|
New Orleans, LA
|
61.88
|
Las Vegas, NV
|
4.13
|
Binghamton, NY
|
36.99
|
Dallas, TX
|
33.70
|
Houston
|
46.07
|
Seattle
|
38
|
2. Jacklyn scored 100, 95, 80, and 92 on her last four calculus tests. What must she score on the last test in order to earn the grade of A? The grading scale of an A is 90 -100.
3. Ms. Alicia Renee Mc Cormick's Report Card
Course Credit Course Letter Grade
4 B
3 C
3 A
1 A
3 B
3 A
What is Alicia's GPA?
4. Below is Tom's grade distribution and recorded scores from his physics class. What is Tom's grade (weighted average)?
Sample Grade Distribution Chart
HW 10% [Drop the lowest]
Midterm 25%
Final 30%
Quiz 30% [Drop the lowest]
Lab Reports 15%
HW: 25 60 90 85 100 100 100 100 95
Quiz: 65 75 100 69 100 100 100 100 100
Lab Reports: 80 90 85 80 85 90 85 100 100
Midterm: 85
Final: 88
5. What is Tom's grade using the following grading scale? Refer to question 4.
Grade
|
PointRange
|
A+
|
100
|
A
|
90-100
|
B
|
80-89
|
C
|
70-79
|
D
|
60-69
|
F
|
0-59
|
Part 2
ANALYSIS ASSIGNMENT 2
1. The Chi-Square Test
Jury Selection (adapted from the Freedman, Pisani, Purves classic text)
One study of grand juries in Alameda County, California, compared the demographic characteristics of jurors with the general population, to see if jury panels were representative. The results for age are shown below. The investigators wanted to know if the 66 jurors were selected at random from the population of Alameda County. (Only persons over 21 and over are considered; the county age distribution is known from Public Health Department data.) The study was published in the UCLA Law Review.
Age
|
Count-wide %
|
# of jurors observed
|
# of jurors expected
|
(O-E)
|
(O-E)2/E
|
21-40
|
42%
|
5
|
|
|
|
41-50
|
23%
|
9
|
|
|
|
51-60
|
16%
|
19
|
|
|
|
over 60
|
19%
|
33
|
|
|
|
Total
|
100%
|
66
|
|
|
|
a) Should you do a test of homogeneity or a test of independence? Why?
b) Do we have evidence that grand juries are selected at random for the population of Alameda County?
2. Use MATLAB to generate a linear (1st Order) approximation for the population in the USA from 1986 to 2005.
3. Use MATLAB to generate a quadratic (2nd Order) approximation for the population in the USA from 1986 to 2005.
Population in the United States of America from 1986 to 2005.
Year
|
Population
|
1986
|
240,132,887
|
1987
|
242,288,918
|
1988
|
244,498,982
|
1989
|
246,819,230
|
1990
|
249,464,396
|
1991
|
252,153,092
|
1992
|
255,029,699
|
1993
|
257,782,608
|
1994
|
260,327,021
|
1995
|
262,803,276
|
1996
|
265,228,572
|
1997
|
267,783,607
|
1998
|
270,248,003
|
1999
|
272,690,813
|
2000
|
281,421,906
|
2001
|
285,317,559
|
2002
|
287,973,924
|
2003
|
290,788,976
|
2004
|
293,656,842
|
2005
|
296,410,404
|
Part -3: PROJECT
Pick one of the datasets on the CD-ROM that came with the new copy of the textbook. Data files are available in ASCII, Excel, Minitab, JMP, SAS, SPSS, and S-Plus formats. Alternatively, you can use any of the datasets of the PowerPoint presentations or Reading Materials.You must state in your project where your dataset came from. However, in either case make sure that the data is curvilinear and suitable for multiple regression.
1) Create a multiple regression model.
2) Formulate a null and alternative hypotheses.
3) Identify and explain the test statistic that will be used.
4) Run the Multiple Regression Software Tool to fully analyze the data and report its results.
5) Reject or do not reject your null hypotheses and explain why.
6) Accept or do not accept your alternative hypothesis.
7) Explain why your model is the best fit for the data.