Assignment -
For this assignment use the "Red Bluff Golf Course Data".csv file from the Topic 6 Assignment as a starting point. The model is specifically focused on the "TotalSales" and "QuantitySold" fields (variables) in the file.
Using IBM SPSS Modeler, format the model shown in the "Topic 7 Assignment Model Requirements" resource.
As depicted in the resource, the model requires the nodes described below.
1. Excel Source Node: Use the same "Red Bluff Golf Course Data" .csv file from the Topic 6 assignment. Resave this file as an Excel (.xlsx) file.
2. Type Node: Ensure that all the data is "read" into this node. Provide a screenshot of the field headers and the first few rows of the data.
3. Data Audit Node: Perform a data audit on the "TotalSales" and "QuantitySold" fields. Are there any missing data for these fields? Do any extreme outliers exist for these fields? Provide a screenshots of the "Audit" and "Quality" results.
4. Table Node: Use this node to ensure that the data successfully loaded into the Type Node. Ensure that the data here matches what is in the original Excel source file. Provide a screenshot of the "Table" output that includes the field headers and first few rows of the dataset.
5. Statistics Node: Run all available descriptive statistics on the "TotalSales" and "QuantitySold" fields. Which field exhibits the more variability and why? Perform a correlation analysis on the "TotalSales" and "QuantitySold" fields. What is the general correlation between these fields: high, medium, or low? Justify your selection. Provide a screenshot of the "Statistics Report."
6. Histogram Nodes: Create a histogram for "TotalSales" and create a histogram for "QuantitySold." Do these fields appear to have normal distributions? Why or why not? Provide a screenshot of the histograms.
7. Plot Node: Create a scatterplot using "QuantitySold" as the x variable and "TotalSales" as the y variable. What is the general relationship between these variables? Provide a screenshot of the scatter plot.
8. Regression Node: Run a regression analysis using "QuantitySold" as the x variable and "TotalSales" as the y variable. In the "Model" tab, ensure that the "Include constant in equation" box is checked. Leave all other settings unchanged.
9. Gold Nugget Node: This node should appear after executing the regression analysis. Export the "Output" report (from the Advanced tab) to a Word file.
10. Hint: The derived regression equation and r-squared results obtained here should be the same as you obtained in the Topic 6 assignment for the simple regression analysis for "QuantitySold" and "TotalSales."
Attachment:- Assignment & Data File.rar