Assignment Project: Cars Case Study
This project requires you to understand what mode of transport employees prefers to commute to their office. The dataset "Cars-dataset" includes employee information about their mode of transport as well as their personal and professional details like age, salary, work exp. We need to predict whether or not an employee will use Car as a mode of transport. Also, which variables is a significant predictor behind this decision.
Following is expected out of the candidate in this assessment.
EDA
Perform an EDA on the data
Illustrate the insights based on EDA
What is the most challenging aspect of this problem? What method will you use to deal with this? Comment
Data Preparation
Prepare the data for analysis
Modeling
Create multiple models and explore how each model perform using appropriate model performance metrics
KNN
Naive Bayes (is it applicable here? comment and if it is not applicable, how can you build an NB model in this case?)
Logistic Regression
Apply both bagging and boosting modeling procedures to create 2 models and compare its accuracy with the best model of the above step.
Actionable Insights & Recommendations
Summarize your findings from the exercise in a concise yet actionable note
Please note the following:
There are two parts to the submission:
The output/report in any file format - the key part of the output is the set of observations and insights from the exploration and analysis
Commented R code in .R or .Rmd
Please don't share your R code and/or outputs only, we expect some verbiage/story too - a meaningful output that you can share in a business environment
Attachment:- Case Study-Cars.rar