Assignment: Intro to Data Science
In this project, you will need to apply what we learned in this class to analyze a real problem and write a report.
You need to find your own dataset and determine the business question. However, if you cannot find a proper dataset, please use the Titanic dataset to finish the project (You get 5% bonus if you use your own dataset instead of Titanic).
Each student will need to:
• Obtain a dataset (Titanicorother).
• Define the business question
• Determine the type of model based on the question (descriptive, predictive)
• Describe the question and your data - in text
• Do the description analysis using statistical numbers and graphs (using table and graphs)
• Do the data analysis using RapidMiner to answer the business question (describe the model, run the model, and show the results). Use one of the methods used in this course.
• Do the model evaluation (evaluate the model to see whether it is good)
• Discuss potential problems and improvements
Format your assignment according to the give formatting requirements:
• The answer must be using Times New Roman font (size 12), double spaced, and typed, with one-inch margins on all sides.
• The response also includes a cover page containing the student's name, the title of the assignment, the course title, and the date. The cover page is not included in the required page length.
• Also include a reference page. The references and Citations should follow APA format. The reference page is not included in the required page length.