Assignment title: Information


ECON 7310: ELEMENTS OF ECONOMETRICS Dr. Dong-Hyuk Kim Tutorial 4: Multiple Linear Regression, Part A At the end of this tutorial you should be able to • understand and explain the assumptions behind the multiple regression model; • explain the properties of the least squares estimators; • compute and interpret the least squares regression coefficients; • estimate the variances and covariances of the least squares estimators; • conduct hypothesis tests and construct confidence intervals concerning the regression coefficients; • use the estimated regression model to generate predictions; and • compute and explain measures of goodness-of-fit. Problems: 1. (Based on PE4 Ex.5.24) The file rice.dta contains 352 observations on 44 rice farmers in the Tarlac region of the Phillipines for the 8 years 1990 to 1997. Variables in the data set are tonnes of freshly threshed rice (prod), hectares planted (area), person-days of hired and family labour (labor) and kilograms of fertiliser (fert).Treat the data set as one sample with N = 352 and answer the following questions. Estimate the production function log(prod) = β1 + β2 log(area) + β3 log(labor) + β4 log(fert) + e (a) Report the results and interpret the estimates. What proportion of the variation in production is explained by the estimated regression model. (Answer) First, we need to define new variables Then, we run the regression as follows; 1All estimates have elasticity interpretations. For example, a 1% increase in labour will lead to a 0.4328% increase in rice output. A 1% increase in fertiliser will lead to a 0.2095% increase in rice output. Note that this is log − log linear model. Hence, prod [ = exp log( \ prod) = exp (b1 + b2 log(area) + b3 log(labor) + b4 log(fert)) where (b1; b2; b3; b4) are the OLS estimates. Then, the generalised R2 = 0:9122 = 0:8318.  (b) Comment on the statistical significance of the individual coefficient estimates. (Answer) We say that the coefficient estimate bk is significant if we reject the hypothesis H0 : βk = 0. For this hypothesis testing, the decision rule is to reject H0 if the test statistics bk − 0 se(bk) is larger than the critical value, which depends on the level α of test. The t-value in the third column of computer outcome represents this ratio. All of them are larger than the commonly used critical values (e.g., 1.96 when α = 0:05). Hence, all the estimates are statistically significant. Alternatively, we could use the p values for testing a hypothesis. Decision rule is simple; we reject H0 : β0 = 0 if p-value is less than α. The fourth column of the output shows the p values, which 2are all essentially zero. Hence, for any conventionally used level (e.g., α = 0:05 or 0.01), the estimates are all statistically significant.  (c) Using a 5% level of significance, test the hypothesis that the elasticity of production with respect to land is equal to 0.5. (Answer) We test H0 : β2 = 0:5 against H1 : β2 6= 0:5 at the level of α = 0:05. We reject H0 because b2 − 0:5 se(b2) = 0:3617359 − 0:5 0:0639678 = 2:1615 > 1:96  (d) Find a 95% interval estimate for the elasticity of production with respect to fertiliser. Has this elasticity been precisely measured? (Answer) Even before we construct the confidence interval, we know the estimate is precise because it is statistically significant. The confidence interval is given as b4 ± 1:96 × se(b4) = 0:2095 ± 1:96 × 0:0382654 = [0:1345; 0:2845]:  (e) Test the significance of the regression model. Use α = 0:05 (Answer) We test H0 : β2 = β3 = β4 = 0 against H1 : H0 is wrong, i.e., β2 6= 0 or β3 6= 0 or β4 6= 0. The Stata outcome shows the F statistic for this testing, F = 646:51. Moreover, the associated p-value is zero. So, we reject H0.  (f) The production technology exhibits constant returns to scale if β2 + β3 + β4 = 1. Test the null hypothesis of constant returns to scale using a 5% level of significance. (Answer) After running the regression in part (a), click on Statistics ! Postestimation ! Tests ! Test linear hypotheses. Then, you will Click on Create and choose Linear expressions are equal in Test type: 3In the Coefficient: section, you can add coefficients. Add the coefficients associated with log area, log labor, and log fert. Then, since the defaulted relationship between coefficients is =, you will have Edit the Linear expression: as follows; 4Then, click on Ok and Submit. Then, the result will show up as follows; F-statistic is 0.03, which is very small. Moreover, the p-value is 0:8638 > α = 0:05. So, we do not reject H0. The hypothesis of constant returns to scale cannot be rejected at the 5% significance level.  (g) Predict rice production from 1 hectare of land, 50 person-days of labour and 100 kg of fertiliser. (Answer) log( \ prod) = −1:5468 + 0:3617 log(1) + 0:4328 log(50) + 0:2095 log(100) = 1:111107 Therefore, prod [ = exp(1:111107) = 3:0377.  (h) Your economic principles suggest that a farmer should continue to apply fertiliser as long as the marginal product of fertiliser @prod=@fert is greater than the price of fertiliser divided by the price of output. Suppose that this price ratio is 0.004. For fert = 100 and the predicted value for prod found in part g), show that the farmer should continue to apply fertiliser as long as β4 > 0:1242: (Answer) The model can be written as prod = exp (β1 + β2 log(area) + β3 log(labor) + β4 log(fert) + e) Then, @prod @fert = @ @fert exp (β1 + β2 log(area) + β3 log(labor) + β4 log(fert) + e) = exp (β1 + β2 log(area) + β3 log(labor) + β4 log(fert) + e) × @fert @ β4 log(fert) = exp (log(prod)) × β4 fert 1  = β4 × prod fert: Now, the marginal principle says we should apply more fertiliser if @prod @fert > Price of fert Price of prod = 0:004: Since @prod @fert = β4 × prod fert; we should apply more fertiliser if β4 × prod fert > 0:004 5When prod = 3:0377 and fert = 100, this condition becomes β4 > 0:1317  (i) Using H1 : β4 > 0:1317 as the alternative hypothesis, test whether the farmer should apply more fertiliser. Use a 5% level of significance. Why did we choose as the alternative hypothesis? (Answer) We test H0 : β4 = 0:1317 against H1 : β4 > 0:1317. For this one-sided test, we would reject H0 when the estimate b4 is too large. More specifically, we reject H0 when b4 − 0:1317 se(b4) = 0:2095 − 0:1317 0:0383 = 2:03 is larger than the critical value of the test with level of α = 0:05. From the probability table of the standard normal distribution, we can find that Pr(Z < 1:645) = 0:95, So, the critical value is 1.645, which is smaller than the test statistic 2.03. Therefore, we reject H0. We conclude that at the 5% significance level, the farmer should apply more fertiliser. We chose a one-sided alternative hypothesis following the marginal principle in part (h).  (j) Find a 95% interval estimate for rice production from 1 hectare of land, 50 person-days of labour and 1 kg of fertiliser. (Answer) Recall that log(1) = 0. We can compute log(50) by typing display log(50) in the Command window. The Stata would give you log(50) = 3:912023. Now, re-run the regression in part (a) and run adjust command as shown below; We see that the predicted value of log(prod) = 0:146525 with prediction standard error of se(f) = 0:37938, which gives the 95% prediction interval of [−0:599641; 0:892691]. Since this is the interval estimate for log(prod), the interval estimate for prod is [exp(−0:599641); exp(0:892691)] = [0:5490; 2:4417]. (The Stata command adjust must follow an estimation command.) 6