Problem
Make a subset of the dataset using R from Part1 with only the top 5...
Make a subset of the dataset from Part1 with only the top 5 departments based on the number of employees working in that department. The top 5 departments should be computed using R code. Then, use %in% operator to create the required subset. Use a sample size of 50 for each of the following. Set the start seed for random numbers as 5495
For reference: the data set is
boston <- read.csv( "https://people.bu.edu/kalathur/datasets/bostonCityEarnings.csv", colClasses = c("character", "character", "character", "integer", "character"))
1) Order the data using the Department variable.
2) Draw a stratified sample using proportional sizes based on the Department variable.
3) Show the frequencies for the selected departments.
4) Show the percentages of these with respect to sample size.