Write description about the dataset


Assignment Overview:

This assignment will test your skill to collect, summarize and present data using Microsoft Excel and/or other approved tools. It will also test your understanding to interpret the output produced by the software to solve business problems.

You will need to use the dataset provided as well as collecting your own dataset and produce a numerical and graphical summary. You will need to submit an Excel file following the requirement as explained below.

Assignment Description:

There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is edited from the original dataset provided by "Inside AirBNB" compiled on the 14 September 2019.

Dataset 2: You will need to collect a dataset via survey to answer the question given in Section 6 below. You will need to collect data from at least 30 international students.

Your tasks are to provide a description for each dataset in Section 1, and to answer the following research questions given in Section 2 to Section 6 using dataset 1 or dataset 2 as indicated in each section.

Section 1: Description about Data:

a. Dataset 1: Give a short but clear description about this dataset. Is this primary or secondary data? What are the cases? How many variables are there in the dataset?

b. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What are the variables and their types?

Section 2: What are the proportions of different room types of AirBNB in Sydney?

Using Dataset 1, describe the proportion of the different room types available for rent in Sydney AirBNB. You need to provide the frequency and the proportion (either as a decimal or a percentage) as well as graphical display that easily shows the proportion of the room types.

Section 3: What is the AirBNB price distribution of private room after an iteration of outlier removal?

Using Dataset 1, perform one iteration of outlier detection on the price of private rooms using the method described in the lecture notes. After removing those outliers, describe the price distribution of private rooms using both numerical and graphical summary which shows the remaining outliers, if any.

Section 4: Is there a difference in the number of available days in the next 365 days among different room types?

Using Dataset 1, describe the distribution of availability for 365 days in the future, for each room type. You need to provide both numerical summary as well as graphical display which shows the outliers, if any.

Section 5: Is there any relationship between Longitude and Price?

Using Dataset 1, describe the relationship between the longitude of an AirBNB property location and its price. You need to provide both numerical summary as well as graphical display.

Section 6: Is there any relationship between gender and room type accommodation?

Using Dataset 2, describe the relationship between the gender of an international student and their current room type accommodation, e.g. whether the student currently shares a room, lives in a private room (but shares an apartment or a house) or lives in an apartment or a house by him/herself. You need to provide both numerical summary and graphical display.

Be it be any problem, regarding the topic Statistics and Data Analysis, the professionals of the Statistics and Data Analysis Assignment Help service is always here to assist all the students, who are not able to write and submit a good quality and high-scoring assignment paper.

Tags: Statistics and Data Analysis Assignment Help, Statistics and Data Analysis Homework Help, Statistics and Data Analysis Solved Assignments, Statistics and Data Analysis Coursework, Longitude and Price Assignment Help, Longitude and Price Homework Help

Attachment:- Statistics and Data Analysis.rar

Request for Solution File

Ask an Expert for Answer!!
Applied Statistics: Write description about the dataset
Reference No:- TGS03026594

Expected delivery within 24 Hours