Although you will be provided with guidance with regard to addressing the assignment tasks, you will need to complete the tasks in your own time. The document “Instructions for Obtaining Excel Outputs for Part III of the BEO1106 Assignment” (see the Assessment Information page on the unit website) provides “step by step” instructions for obtaining the Excel outputs that are necessary in any of the assignment tasks.
To avoid any complications associated with misplaced assignments, make a photocopy of your assignment before you hand in the original to your tutor for correction.
Presentation
• Your answers must be presented in task number order and be clearly labelled with the appropriate task number. Answers to each task must start on a new page.
• Your assignment must be presented in Microsoft (MS) Word. Copy and paste any relevant Excel outputs to this document immediately before any relevant written answers to each task.
• If you are unfamiliar with the use of the MS Word Equations Editor, you may write algebraic/mathematical/statistical symbols and notation in neat handwritten form.
• Your answers must be clear. You must highlight relevant items on any required Excel outputs and make reference to them in your written answers.
• When asked to perform a manual calculation (i.e. the use of MS Excel is not specified) you must show all working. This must include intermediate steps where relevant. Failure to do so will result in a loss of marks.
• Completed assignments are to be presented for correction on A4 paper, stapled in the top left hand corner. Please print on one side of the paper only.
• Do not submit the assignment with fancy bindings, folders or plastic envelopes.
• An Assessment Declaration (see the Assessment Information page on the unit website) is required and must be stapled to the front of your assignment.
• Do not include the assignment questions nor the population property data with your submitted assignment.
• You are permitted to consult reference textbooks and notes and to communicate with other students. However, the work you hand in for correction must your own. Be aware that the University penalties for plagiarism are severe.
• Your corrected Part II of the assignment must be submitted with Part III in order for Part III to be corrected.
Introduction:
The Assignment Data (PopulationPropertyData2014.xls) file, which you can access from the Assessment Information page on the unit website contains, in the range A1:I401, real estate sales data for a population of 400 properties around Melbourne in a particular week. In Part I of the assignment you have selected a random sample of 50 properties each containing observations, where appropriate, of the eight variables V1 to V8. In Part II of the assignment you have performed some statistical analyses on a number of these variables using your sample data file, SamplePropertyData.xls. The variables in the data set are as follows:
V1 = Region around Melbourne where property is located (1 = North, 2 = West, 3 = East, 4 = Central)
V2 = Property type (0 = Unit, 1 = House)
V3 = Sale result (1 = Sold at auction, 2 = Passed-in, 3 = Private sale, 4 = Sold before auction). Note that a blank cell for this variable simply indicates that the property did not sell.
V4 = Building type (1 = Brick, 2 = Brick veneer, 3 = Weatherboard, 4 = Vacant land)
V5 = Number of rooms
V6 = Land size (Square metres)
V7 = Sold Price ($000s)
V8 = Advertised Price ($000s)
Column A (PN), contains the property identification numbers for the 400 properties.
You should continue the required work on Part III of the assignment using the Excel worksheet, SamplePropertyData.xls, that you were working on at the completion of Part II of the assignment.
Assignment Tasks (Part III):
Answers to the assignment tasks must be based on the sample data file that you created in Part I of the assignment. As for Part II, most tasks in Part III of the assignment require you to obtain an Excel output prior to performing some analysis. Copy and paste these outputs to your assignment MS Word document immediately preceding any subsequent analysis. Explanations must be precise and to the point. Charts and tables must have appropriate titles and numerical values must be rounded to an appropriate number of decimal places and accompanied by the correct units of measure.
There are four tasks in Part III of the assignment. You must meet all task requirements to receive full marks. No marks will be awarded to a task that requires an Excel output, if a print copy of the output has not been included with your answer. No marks will be awarded if only the Excel outputs are submitted without comments, explanations or the analysis required in the task.
The total mark available for the entire assignment is 60. The total mark you receive for your assignment will be converted to a mark out of 20 before being aggregated with your test and examination marks to produce your final result for the unit
Task:
A reminder, in this task, to make sure that you show any necessary working.
Failure to do so will result in the loss of marks.
(a) By reference to numerical summary measures in the Descriptive Statistics table obtained in Task, provide three pieces of distinct evidence that might suggest that your sample “Sold Price” data has been obtained from a normally distributed population. What is your conclusion? Note: Make sure your three pieces of distinct evidence only contain one relating to the shape of the sample data.
(b) Regardless of your conclusion in (a), assume your sample “Sold Price” data have been obtained from a normally distributed population, and calculate, using Standard Normal tables, approximately how many “Sold Price” data values in your sample you would expect to lie within 1.5 standard deviations of the mean (i.e. between z = –1.5 and z = +1.5).
(c) Use your sorted “Sold Price” sample data from Task 4, and the mean and standard deviation from the Descriptive Statistics table of Task, to manually count the number of “Sold Price” data values in your sample that lie within 1.5 standard deviations of the mean. State whether this count matches, approximately, your answer to (b) and hence whether this result confirms (or not) your conclusion in (a).
Task:
(a) Use Excel to produce a Descriptive Statistics table for the “Sold Price” variable in your sample suitable for constructing an interval estimate of the population mean “Sold Price”. Hence determine:
(i) A point estimate of the mean “Sold Price” of the population of properties.
(ii) A 90% confidence interval estimate of the mean “Sold Price” of the population of properties.
(b) Make a brief verbal statement explaining the meaning of the confidence interval estimate obtained in (a) in the context of the variable in this task.
(c) If the population mean “Sold Price” is actually 650 ($000s), would you consider the interval estimate obtained in (a), to be satisfactory? Explain why or why not.
Task:
(a) By reference to the Descriptive Statistics table obtained in Task, determine a 99% confidence interval estimate of the mean “Sold Price” of the population of properties using the following formula:
(sample statistic) ? (critical z or t) ? (standard error of the sample statistic)
(b) Comment on the precision of this interval, in particular, compare the precision associated with this interval with that obtained in Task 7. Explain why the direction of the change in precision that you have observed in these two intervals ought to be obvious prior to constructing the two intervals.
Task:
(a) Use Excel to produce a Descriptive Statistics table for the brick veneer properties in your sample suitable for constructing an interval estimate of the population proportion of brick veneer properties. Hence determine:
(i) A point estimate of the proportion of brick veneer properties in the population.
(ii) A 99% confidence interval estimate of the proportion of brick veneer properties in the population.
(b) Make a brief verbal statement explaining the meaning of the confidence interval estimate obtained in (a) in the context of the variable in this task.
(c) If the population proportion of brick veneer properties is actually 42%, would you consider the interval estimate obtained in (a), to be satisfactory? Explain why or why not.