Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter April 2, 2009

Selecting Instrumental Variables in a Data Rich Environment

  • Serena Ng and Jushan Bai

Practitioners often have at their disposal a large number of instruments that are weakly exogenous for the parameter of interest. However, not every instrument has the same predictive power for the endogenous variable, and using too many instruments can induce bias. We consider two ways of handling these problems. The first is to form principal components from the observed instruments, and the second is to reduce the number of instruments by subset variable selection. For the latter, we consider boosting, a method that does not require an a priori ordering of the instruments. We also suggest a way to pre-order the instruments and then screen the instruments using the goodness of fit of the first stage regression and information criteria. We find that the principal components are often better instruments than the observed data except when the number of relevant instruments is small. While no single method dominates, a hard-thresholding method based on the t test generally yields estimates with small biases and small root-mean-squared errors.

Published Online: 2009-4-2

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston

Downloaded on 19.3.2024 from https://www.degruyter.com/document/doi/10.2202/1941-1928.1014/html
Scroll to top button