Best-subset instrumental variable selection method using mixed integer optimization with application

Kristofer Månsson Co-Author
Jönköping University
 
Narayanaswamy Balakrishnan Co-Author
McMaster University
 
Muhammad Qasim First Author
Lund university
 
Muhammad Qasim Presenting Author
Lund university
 
Wednesday, Aug 6: 2:20 PM - 2:35 PM
1710 
Contributed Papers 
Music City Center 
The classical best-subset selection method has been demonstrated to be nondeterministic polynomial-time hard and thus presents computational challenges. This problem can now be solved via advanced mixed integer optimization (MIO) algorithms for linear regression. We extend this methodology to linear instrumental variable (IV) regression and propose the best-subset instrumental variable (BSIV) method incorporating the MIO procedure. Classical IV estimation methods assume that IVs must not directly impact the outcome variable and should remain uncorrelated with nonmeasured variables. However, in practice, IVs are likely to be invalid, and existing methods can lead to a large bias relative to standard errors in certain situations. The proposed BSIV estimator is robust in estimating causal effects in the presence of unknown IV validity. We demonstrate that the BSIV using MIO algorithms outperforms two-stage least squares, Lasso-type IVs, and two-sample analysis (median and mode estimators) through Monte Carlo simulations in terms of bias and relative efficiency. We analyze two datasets involving the health-related quality of life index and proximity and the education–wage relationship

Keywords

Causal inference

Instrumental variables

Mendelian randomization

Best-subset selection

Mixed integer programming

Variable selection 

Main Sponsor

Section on Statistics in Epidemiology