This assessment relates to problem solving task 2 - solving a set of given problems through the application of statistical and other decision making techniques.
Please answer all three questions.
Background
A health agency is taking a survey regarding the evaluation of all the hospitals in Melbourne to find out the statistical calculation and analyze the attributes of the data. The agency needs a report that can analyze the number of admissions, type of control and type of service and other factors. They focus on the service provided by each hospital whether it is for non-government, non-federal, for profit, federal government and so on.
For analytical purposes a random sample of 60 data is generated out of 300 population data and the statistical summary of the sample data is tabulated in Table 1.
Table 1 Statistical Summary
|
|
Variable
|
Mean
|
Median
|
Standard
deviation
|
Minimum
|
Maximum
|
Range
|
P value
|
Count
|
Admissions
|
6959.00
|
4636.50
|
6995.56
|
441.00
|
37375.00
|
2668.50
|
0.168
|
60
|
Your overall task is to investigate the effect of various variables on the number of admissions in Melbourne hospitals using various descriptive analyses which is extremely helpful to formulate a conclusion.
To assist with your investigation, you are required to answer questions 1 and 2 below.
Question 1
a) For a series of random samples of 60, are the mean values of these random samples normally distributed? Explain
b) Calculate the standard error of the mean and explain the meaning of this value.
c) Determine the 95% confidence interval and explain its meaning in the context of the overall problem.
d) What is the probability that a sample of 60 hospitals selected at random in the Melbourne area will have a mean greater than 7000.00 admissions?
e) If the admissions times were more variable, what effect would this have on the confidence interval?
Question 2
Assume that the average admission for all hospitals in Melbourne is 7500. Conduct a statistical hypothesis test to determine if the admission of hospitals in Melbourne is significantly different from the average admission 6959. Mention any assumptions and include relevant hypotheses and report the results and conclusion in the conventional manner.
a) Write down both the null and alternative hypotheses
b) Carry out the t test and report the p-value, and the test statistic
c) Write an appropriate conclusion in the context of the problem. Based on your answers to questions 1 and 2 please write a report of the effect ofvarious variables on the number of admissions in Melbourne hospitals
Question 3
Most of the time houses prices depend on the local market conditions. In addition one of the factors is the number of bedrooms (as bedrooms increase prices increases). Recently Come Real Estate Agency has conducted a survey and selected a random sample of 211 for July 2015 sale in Melbourne and the data analyzed is summarized as follows.
SUMMARY OUTPUT
|
Regression Statistics
|
Multiple R
|
0.817326539
|
R Square
|
0.668022671
|
Adjusted R Square
|
0.66615763
|
Standard Error
|
115.8071494
|
Observations
|
180
|
ANOVA
|
|
df
|
SS
|
MS
|
F
|
Significance F
|
Regression
|
1
|
4803674
|
4803674
|
358.1811918
|
1.74254E-44
|
Residual
|
178
|
2387211
|
13411.3
|
|
|
Total
|
179
|
7190885
|
|
|
|
|
Coefficients
|
Standard
Error
|
t Stat
|
P-value
|
Lower 95%
|
Upper 95%
|
Lower 95.0%
|
Upper
95.0%
|
Intercept
|
-137.8814237
|
25.56878832
|
-5.39257
|
2.18686E-07
|
-188.3383819
|
-87.4244654
|
-188.3383819
|
-
87.424465
|
Bedrooms
|
178.6267021
|
9.438326385
|
18.92568
|
1.74254E-44
|
160.0012892
|
197.252115
|
160.0012892
|
197.25212
|
House Price
|
|
|
Mean
|
317.6166667
|
Standard Error
|
14.93923611
|
Median
|
271.8
|
Mode
|
230.5
|
Standard Deviation
|
200.4308849
|
Sample Variance
|
40172.53961
|
Kurtosis
|
9.869866365
|
Skewness
|
2.287541039
|
Range
|
1544.3
|
a) Write down the regression equation.
b) State the R-squared value and the standard error and explain what they mean with respect to the data.
c) Write down the value of the gradient of the regression line and explain what it means for this data.
d) Are the values for the constant and the gradient (slope) significant (i.e. significantly different from zero) in this case? Justify your answer.
e) Conduct a hypothesis test on the slope coefficient to test whether there is a linear relationship between number of bedrooms and prices of the houses. Include the null and alternative hypotheses; key test results and anappropriate conclusion.
f) Does the linear regression provide a good model? Give statistical reasons based on the scatterplot, p-values, the standard error and coefficient of determination.
g) If you were developing a model to predict the prices of the houses on the number of bedrooms, what other factors would you like to be able to include?