Problem
Go to the Kaggle website and search for the diabetes dataset it will appear in first and download it the variable names would be:
Answer these questions with diabetes dataset by using R studio coding descriptive Statistics:
1. Create a function that splits a data frame into numerical and categorical variables and apply into diabetes dataset.
2. Make a descriptive statistical analysis for the numerical variables that includes measures of central tendency, measures of variability. Include relevant graphs.
3. Make a descriptive statistical analysis for categorical variables that includes the most frequent scores per variable. Include relevant graphs.
4. Compare numerical variables through categories of selected categorical variables.
5. Check the relationship between some numerical variables using a measure of association.
6. Analyze the dependency between categorical variables using test of independence and goodness of fit test.