Main objective of this project is to do an analysis of a


PROJECT:

The main objective of this project is to do an analysis of a real life example. We must use all the skills we gotto do a good job. It's important to analyze thoroughly all the data, in order to achieve the best conclusion for making your client happy.

We've got 65 plots, and in each plot we're going to use fertilizers (variable X) to improve our production of Maize (variable Y).So we have two variables, and we are going to work with them. So the aim of this project is to decide if the use of this fertilizer is profitable and give a recommendation to the client as to how much of it should be used.

The Kg of fertilizer it cost us 26.5$, and the selling price of one tone of maize is 140$. We have to study all the results that the plots give us, to find if the fertilizer is worth it and to know how much fertilizer we should buy so that we don't waste it.

Variable X: Fertilizer (kg)

The fertilizer is the variable that will give us the answer if it's worth it or not, the investment that we will do. It will say to us the efficiency and profitability for the maize production.

Therefore we will introduce some new values in our dataset:

- The mean of fertilizer used in the 65 samples of dataset which has a value of 56.74 kg.

- The median that separates the biggest and the lowest half of our 65 samples: 53kg

- The variance, that will help us to get the Standard deviation, is 385.45

- The standard deviation, will tell us how tightly our data is clustered around the mean. 19.63

- The Interquartile range represents the 50% of the data. Removing the first and last quarters of the dataset remaining only its core will help us to get rid of the irregularities. In this case is 30=IQR

1833_Analyze the Y-value dataset.png

Looking our histogram, we can clearly see that our dataset remains mostly between 40-60kg of fertilizer used in every plot. Also we can see that the 2nd biggest portion of our dataset is between 60-80kg. Therefore the precision of our analysis will certainly remain in this range of data.

This is the analysis of the dataset of the variable X. Including Robust and non-Robust variables. Now we proceed to analyze the Y-value dataset.

Variable Y: Maize amount (tons)

The amount of Maize is the y-variable. We are going to use several measures to achieve the best answer such as:

- The mean is 227.22 tons of maize produced in our 65 plots.

- The median is 228, which divide the 65 samples in two halves.

Both values are very close to each other, which help us to determine that there is not a high amount of irregularities.

The IQR is 5. This interquartile range is very low.

1693_Analyze the Y-value dataset1.png

We can observe that the production is positioned between 225-230 tons, but we can see a range where is most of the production that is between 225-235 tons. Now after analyzing each variable separately, we are going to compare them.

Study of Variables X and Y.

1483_Analyze the Y-value dataset2.png

Request for Solution File

Ask an Expert for Answer!!
Basic Statistics: Main objective of this project is to do an analysis of a
Reference No:- TGS01145015

Expected delivery within 24 Hours