Instructions for stage 3, All
Overview: Each student will be given a sample from a large population and each student will analyse their own sample. The data and a short description of the data is given in a separate excel file
https://app.box.com/s/zlzjujmo3cq1jst35jummv8118pc0h0e
Section 0: Cover sheet
You do need a cover sheet with the following
Student number, (this is very important it lets the marker check you have used numbers based on your student
Section 1: Introduction
Give an introduction to the assignment
Section 2: the problems of getting survey data in the real world
The data set in the assignment is not real world data because every student needs their own sample to prevent copying and to understand the main concept in statistics the "sampling distribution of an estimate"
Discuss the problems of getting survey data in the real world, also discuss ethics you need to consider when gathering survey data. Also give an example of a real world product and give suitable survey questions.
Section 3: Description of the data set
Describe the dataset and describe each of the variables,
For each variable answer the question is it categorical or numerical?
Section 4: Summary of the data set
Use the filter given in the data to find your sample and use excel or suitable webpages such as https://www.calculatorsoup.com/calculators/statistics/descriptivestatistics.php
https://www.shodor.org/interactivate/activities/Histogram/
https://www.wessa.net/rwasp_backtobackhist.wasp
to do the following.
a) For each variable below give a graphical and numerical summaries that describes the variable (in other words give the appropriate univariate statistics and graphical displays)
i) variable: Income
ii) Variable: How much they would pay?
Also provide give appropriate comments
b) For each pair of variables below give appropriate graphical and numerical summaries that describes the variable (in other words give the appropriate bivariate statistics and graphical displays)
i) The variables: Gender and "do they like the product"
ii) The variables "How much they would pay?"and gender
Also provide appropriate comments
Section 5
5a) Find a 95% confidence interval for the proportion of people that prefer version 2
5b) Just considering the people that like the product, find a 95% confidence interval for the mean of the variable "How much they would pay".
Assignment link - https://www.dropbox.com/s/pewqshz7oylv26w/Assignment.rar?dl=0