Homework
Consider the traffic accident data set shown in Table.
Table: Traffic accident data set.
WeatherCondition
|
Driver'sCondition
|
TrafficViolation
|
Seat Belt
|
CrashSeverity
|
Good
|
Alcohol-impaired
|
Exceed speed limit
|
No
|
Major
|
Bad
|
Sober
|
None
|
Yes
|
Minor
|
Good
|
Sober
|
Disobey stop sign
|
Yes
|
Minor
|
Good
|
Sober
|
Exceed speed limit
|
Yes
|
Major
|
Bad
|
Sober
|
Disobey traffic signal
|
No
|
Major
|
Good
|
Alcohol-impaired
|
Disobey stop sign
|
Yes
|
Minor
|
Bad
|
Alcohol-impaired
|
None
|
Yes
|
Major
|
Good
|
Sober
|
Disobey traffic signal
|
Yes
|
Major
|
Good
|
Alcohol-impaired
|
None
|
No
|
Major
|
Bad
|
Sober
|
Disobey traffic signal
|
No
|
Major
|
Good
|
Alcohol-impaired
|
Exceed speed limit
|
Yes
|
Major
|
Bad
|
Sober
|
Disobey stop sign
|
Yes
|
Minor
|
Task
• Show a binarized version of the data set.
• What is the maximum width of each transaction in the binarized data?
• Assuming that the support threshold is 30%, how many candidate and frequent itemsets will be generated?
• Create a data set that contains only the following asymmetric binary attributes: (weather = Bad, Driver' s condition = Alcohol-impaired, Traffic violation = Yes, Seat Belt = No Crash Severity = Major) . For Traffic violation, only None has a value of 0. The rest of the attribute values are assigned to 1. Assuming that the support threshold is 30%, how many candidate and frequent itemsets will be generated?
• Compare the number of candidate and frequent itemsets generated in parts (c) and (d).
Format your homework according to the give formatting requirements:
• The answer must be using Times New Roman font (size 12), double spaced, typed, with one-inch margins on all sides.
• The response also includes a cover page containing the student's name, the title of the homework, the course title, and the date. The cover page is not included in the required page length.
• Also include a reference page. The references and Citations should follow APA format. The reference page is not included in the required page length.