2
Mar 17

## Durbin-Wu-Hausman test

### Setup

Consider estimation of the slope in simple regression

(1) $y_i=a+bx_i+e_i$

assuming that $x_i$ is stochastic. The framework is the same as in the second (large sample) approach to stochastic regressors: the sample size goes to infinity. We suppose that

$Var(x)\neq 0$ (OLS estimator existence condition)

and there is an instrument $z$ for $x$ that satisfies the usual conditions:

(2) $Cov(z,x)\neq 0$ (IV existence condition)

and

(3) $Cov(z,e)=0$ (IV consistency condition).

### What we know about OLS and IV

 OLS estimator consistency condition Consequences Valid: $Cov(x,e)=0$$Cov(x,e)=0$ Both OLS and IV are consistent but OLS is more efficient by the Gauss-Markov theorem Not valid: $Cov(x,e)\ne 0$$Cov(x,e)\ne 0$ (endogeneity problem) OLS is inconsistent and IV is consistent

### Two formulations of the null and alternative hypotheses

The next two formulations are based on Table 1.

Simple formulation. Null hypothesis: no endogeneity problem (OLS can be used; using IV is not advisable); alternative hypothesis: there is endogeneity problem (OLS cannot be used and IV can).

General formulation. We have two competing estimators: main estimator $\hat{b}_{main}$ (think OLS) and alternative estimator $\hat{b}_{alt}$ (think IV). Null hypothesis: Both are consistent but $\hat{b}_{main}$ is more efficient; alternative hypothesis: $\hat{b}_{main}$ is inconsistent and $\hat{b}_{alt}$ is consistent.

The format of the test statistic requires knowledge of matrix algebra and is skipped; in statistical packages, you need only to find the p-value of the Durbin-Wu-Hausman statistic (it is distributed as chi-square). The second formulation allows for a more general interpretation of the Durbin-Wu-Hausman test by comparing an IV estimator with a smaller set of instruments to an IV estimator with a wider set of instruments.

The test is also called a Hausman specification test, because the endogeneity problem may be a consequence of a wrong model specification (the cause may be, for example, omission of relevant variables). If the null of no endogeneity is rejected, the researcher might want to modify the model, instead of using the IV estimator.