This assignment data file contains 1500 houses sold in Stockton, California, during1996--1998.
The variable descriptions are as follows:
• Sprice = Selling price of home, dollars
• Livarea = living area, hundreds of square feet
• Age = age of home at time of sale, years
• Baths = number of bathrooms
• Beds = number of bedrooms
• Pool = 1 if home has pool, 0 otherwise
• Lgelot =1 if lot size > 0.5 acres, 0 otherwise.
Model 1:
Sprice = δ_{1} +δ_{2}livarea+ δ_{3}Age+ δ_{4}Beds+δ_{5}Baths + e
Model 2:
Sprice =β_{1} + β_{2}livarea + β_{3}age + β_{4}Beds+ β_{5}Baths + β_{6}livarea^{2} + β_{7}age^{2}+ ε
Model 3:
ln Sprice = α_{1} + α_{2}livarea + α_{3}livarea^{2}+ α_{4}age+ α_{5}age^{2} + α_{6}Beds+ ε ln=natural log
problem1) Plot each of Sprice and Age in a X--Y scatter plot and comment on their pattern.
(To obtain XY scatter plot in Gretl choose “View” and “Graph specified vars” and “X--Y scatter” and select the variables to the relevant boxes)
Model 1
problem2) Estimate Model 1 and report results. Do the signs of the estimates agree with your expectations? Describe.
problem3) Using Model1, test null hypothesis that each individual coefficient is equal to zero against alternative that it is not, at the 5% significance level and comment on your findings
problem4) Consider two houses that have the living areas of the same size, same number of bathrooms, and same number of bedrooms, but one is two years old and the other is ten years old. How much difference in the prices must an investor expect between two houses according to Model 1? Construct a 95% confidence interval for this difference in the prices and interpret your result
problem5) Test overall significance of the model at the 1% significance level. Interpret the test result.
problem6) A family of four children owns a house with a living area of 2,000 square feet (i.e. Livarea = 20). They are now considering an extension of living area by 200 square feet. How much will this extension be expected to increase price of the house? Test a hypothesis, at the 5% significance level, that increase in the price would be equal to $20,000 against it is more than $20,000.
Model 2
problem7) Estimate Model 2 and use an F-- test to test that Livearea^{2} and Age^{2} are significant variables in the model? Use the 5% significance level and comment on your results.
Model 3
problem8) Estimate Model 3 and comment on your results.
problem9) Use Model 3 to predict the price of a 10--year--old house with a living area of 2,000 square feet, and three bedrooms. Comment on your answer
Models 2 and 3
problem10) Compare the results of Model 2 and Model 3 and select a preferred model. Give reasons for your choice.