Best-subset instrumental variable selection method using mixed integer optimization with application
Wednesday, Aug 6: 2:20 PM - 2:35 PM
1710
Contributed Papers
Music City Center
The classical best-subset selection method has been demonstrated to be nondeterministic polynomial-time hard and thus presents computational challenges. This problem can now be solved via advanced mixed integer optimization (MIO) algorithms for linear regression. We extend this methodology to linear instrumental variable (IV) regression and propose the best-subset instrumental variable (BSIV) method incorporating the MIO procedure. Classical IV estimation methods assume that IVs must not directly impact the outcome variable and should remain uncorrelated with nonmeasured variables. However, in practice, IVs are likely to be invalid, and existing methods can lead to a large bias relative to standard errors in certain situations. The proposed BSIV estimator is robust in estimating causal effects in the presence of unknown IV validity. We demonstrate that the BSIV using MIO algorithms outperforms two-stage least squares, Lasso-type IVs, and two-sample analysis (median and mode estimators) through Monte Carlo simulations in terms of bias and relative efficiency. We analyze two datasets involving the health-related quality of life index and proximity and the education–wage relationship
Causal inference
Instrumental variables
Mendelian randomization
Best-subset selection
Mixed integer programming
Variable selection
Main Sponsor
Section on Statistics in Epidemiology
You have unsaved changes.