Regression and correlation. Pick any two columns that have a correlation coefficient greater than 0.6 or less than -0.6. Make sure to pick the one with the highest absolute value.
a. Draw the scatter diagram of Y against X, and describe any noted significance.
b. Compute correlation coefficient (ρ or r), and what do you find? Make sure to describe thoroughly what you mean.
c. Obtain a and b of the regression equation defined as Y = a + b X, and the Coefficient of Determination (r2) from the Excel regression output, what can you tell? What is the relationship between r2 and ρ?
d. Compute the above statistics in 4) step by step using SXiYi, SXi, SYi, SXi2, SYi2 from Excel, and compare them with the results in C).
e. Draw the fitted regression line on the scatter diagram, obtain the residuals and plot them on the scatter diagram too. describe your findings.
f. prepare a paragraph or so on any observations you may have on the data, regression estimates or the regression residuals;
g. find out the additional y values for at least five other x values that do not appear in our data. Include that information in your report above and comment on whether you believe the find out y value seems realistic and consistent with the other information you have find outd in each of the parts above..