A real estate association in a suburban community would like to study the relationship between the size of a single-family house (as measured by number of rooms) and the selling price of the house (in $ thousands). Two different neighborhoods are included in the study, one on the east side of the community (=0) and the other on the west side (=1). A random sample of 20 houses was selected, with the results stored in Neighbor. For (a) through (k), do not include an interaction term.
a. State the multiple regression equation that predicts the selling price, based on the number of rooms and the neighborhood.
b. Interpret the regression coefficients in (a).
c. Predict the selling price for a house with nine rooms that is located in the east-side neighborhood. Construct a 95% confidence interval estimate and 95% prediction interval. (use PHstat)
d. Perform a residual analysis on the results and determine whether the regression assumptions are valid.
e. Is there a significant relationship between selling price and the two independent variables (rooms and neighborhood) at the 0.05 level of significance?
f. At the 0.05 level of significance, determine whether each independent variable makes a contribution to the regression model. Indicate the most appropriate regression model for this set of data.
g. Construct and interpret a 95% confidence interval estimate of the population slope for the relationship between selling price and number of rooms.
h. Construct and interpret a 95% confidence interval estimate of the population slope for the relationship between selling price and neighborhood.
i. Compute and interpret the adjusted r2.
j. (omit...)
k. What assumption do you need to make about the slope of selling price with number of rooms.
l. Add an interaction term to the model and, at the 0.05 level of significance, determine whether it makes a significant contribution to the model.
m. On the basis of the results of (f) and (l), which model is most appropriate? Explain.