Build models and investigate the importance of each variable, Basic Computer Science

Build models and investigate the importance of each variable

Energy Prediction of Domestic Appliances Dataset

The given dataset, "Energy20.txt", can be used to create models of energy use of appliances in a energy-efficient house. The dataset provides the Energy use of appliances (denoted as Y) using 671 samples. It is a modified version of data used in the study [1]. The dataset includes 5 variables, denoted as X1, X2, X3, X4, X5, and Y, described as follows:

X1: Temperature in kitchen area, in Celsius

X2: Humidity in kitchen area, given as a percentage

X3: Temperature outside (from weather station), in Celsius

X4: Humidity outside (from weather station), given as a percentage

X5: Visibility (from weather station), in km

Y: Energy use of appliances, in Wh

Tasks:

Task 1: Understand the data

(i) Download the txt file (Energy20.txt) from Future Learn and save it to your R working directory.

(ii) Assign the data to a matrix, e.g. using

the.data <- as.matrix(read.table("Energy20.txt "))

(iii) The variable of interest is Energy use of appliances (Y). To investigate Y, generate a subset of 350 data, e.g. using:

my.data <- the.data[sample(1:671,350),c(1:6)]

(iv) Using scatter plots and histograms, report on the general relationship between each of the variables X1, X2, X3, X4, X5 and the variable of interest Y. Include 5 scatter plots, 6 histograms, and 1 or 2 sentences for each of the variables, including the variable of interest Y.

Task 2: Transform the data

(i) Choose any four from the five variables (X1, X2,..,X5). Make appropriate transformations to the chosen four variables and the variable of interest Y so that the values can be aggregated in order to predict the variable of interest. Assign your transformed data along with your transformed variable of interest to an array (it should be 350 rows and 5 columns). Save it to a txt file titled "name- transformed.txt" using

write.table(your.data,"name-transformed.txt")

where "name" is replaced with your name - you can use your surname or first name.

(ii) Briefly explain the transformations applied for the selected four variables and the variable of interest. (1- 2 sentences each)\

Task 3: Build models and investigate the importance of each variable

(i) Download the AggWaFit718.R file to your working directory and load into the R workspace using, source ("AggWaFit718.R")

(ii) Use the fitting functions to learn the parameters for

A weighted arithmetic mean (WAM)
Weighted power means (WPM) with p = 0.5, and p = 5,
An ordered weighted averaging function (OWA), and
A Choquet integral.

(iii) Include two tables in your report - one with the error measures and correlation coefficients, and one summarizing the weights/parameters and any other useful information learned for your data.

(iv) Compare and interpret the data in your tables. Comment on

a. How good the model is,

b. The importance of each of the variables (the four variables that you have selected),

c. Any interaction between any of those variables (are they complementary or redundant?) and

d. Better models favour higher or lower inputs. (1-3 paragraphs for part 3(iv))

Task 4: Use your model for prediction

(i) Choose your best fitting model.

Using your best fitting model, predict the Energy use of appliances for the following input X1=17; X2=39; X3=4; X4=77; X5=32.

(ii) Give your result and comment on whether you think it is reasonable. (1-2 sentences).

(iii) Comment on the best conditions (in terms of your chosen four variables) under which a high Energy use of appliances will occur. (1-2 sentences).

Task 5: Comparing with a linear regression model

Linear regression is used to predict the value of an outcome variable Y based on one or more input predictor variables X. The equation is Y = β0 + β1X1 + β2X2 + ? βnXn + ∈. The built- in function lm() is used to fit linear models in R.

(i) Build your linear model using the same dataset in Question 3 and describe the summary statistics for your model using the function summary().

(ii) Compare the performance of the linear model you got with your best fitting model in Question 4. Visualise the predicted Y values of both models on the 350 data and compare them with the true Y values.

(iii) Give your comment on the differences between the linear model and your best fitting model. (2-4 sentences).

All supporting information should be presented in the pdf report. It will be assessed for style and grammar, professional presentation of figures, tables and references. List and quote in the text the references used, including books, articles and web resources.

Use the Harvard style

Attachment:- Data Analysis.rar

View Complete Question

Request for Solution File

Ask an Expert for Answer!!

Basic Computer Science: Build models and investigate the importance of each variable

Reference No:- TGS03058613

Expected delivery within 24 Hours

Have a Question? (oR Write a Review)

Write atleast 100 words!!

Request for Solution File

Ask an Expert for Answer!!

Basic Computer Science: Build models and investigate the importance of each variable

Reference No:- TGS03058613

Have a Question? (oR Write a Review)

Recent Questions Asked Basic Computer Science

Q : Define the five ethical issues that health care managers

Q : Design an air conditioning system for the room

Q : What ethical dilemma encountered in your nursing experience

Q : What spiritual considerations surrounding disaster can arise

Q : Build models and investigate the importance of each variable

Q : How professionals beliefs and values impact health care

Q : Discuss two common risk factors for polypharmacy

Q : Write a final report in the blog

Q : Analyze the christian perspective of nature of spirituality

Assign the most appropriate cpt procedure code

Finger-to-nose test allows assessment of what

Post a description of the healthcare organization website

Problem about healthcare organization reviewed

Discuss about purchased an electronic health record system

Nearing the end of indigenous health in canada

Potassium has which of the following effects

Request for Solution File

Ask an Expert for Answer!!

Basic Computer Science: Build models and investigate the importance of each variable

Reference No:- TGS03058613

Recent Questions Asked Basic Computer Science

Q : Define the five ethical issues that health care managers

Q : Design an air conditioning system for the room

Q : What ethical dilemma encountered in your nursing experience

Q : What spiritual considerations surrounding disaster can arise

Q : Build models and investigate the importance of each variable

Q : How professionals beliefs and values impact health care

Q : Discuss two common risk factors for polypharmacy

Q : Write a final report in the blog

Q : Analyze the christian perspective of nature of spirituality

Asked Questions