Consider other variables that are not listed in Table 3.1 that might by useful in the classification problem. Write functions to derive them from the messages in email Struct and add them to email DF. Refit the classification tree with the enhanced data frame. Were these new variables chosen to partition the messages? Is the error in classification improved?