Problem
As part of data preparation, treat the missing data, and explain your rationale of the treatments.
i. Explain and implement three other data preparation tasks required for further analysis of the data. Any appropriate Python related libraries, functions, methods (e.g. pandas.to_datetime) can be used.
ii. Analyze the data and describe three insights into the corporate claims processing of the insurance company, with at least one supporting visualization created to illustrate each insight.
iii. Perform linear regression modelling to predict the delay in days (between the Planned and Actual date) in processing the claims, explaining the approach taken, including any further data pre-processing needed for modelling
iv. Discuss the results obtained from the modelling and state the linear regression equation.