Doubly regularized generalized linear models for spatial data with high-dimensional covariates

Si Cheng Co-Author
University of Washington
 
Ali Shojaie Co-Author
University of Washington
 
Arjun Sondhi First Author
 
Arjun Sondhi Presenting Author
 
Wednesday, Aug 6: 8:35 AM - 8:50 AM
1901 
Contributed Papers 
Music City Center 
A discrete spatial lattice can be cast as a network structure over which spatially-correlated outcomes are observed. A second network structure may also capture similarities among measured features, when such information is available. Incorporating the network structures when analyzing such doubly-structured data can improve predictive power, and lead to better identification of important features in the data-generating process. Motivated by applications in spatial disease mapping, we develop a new doubly regularized regression framework to incorporate these network structures for analyzing high-dimensional datasets. Our estimators can be easily implemented with standard convex optimization algorithms. In addition, we describe a procedure to obtain asymptotically valid confidence intervals and hypothesis tests for our model parameters. We show empirically that our framework provides improved predictive accuracy and inferential power compared to existing high-dimensional spatial methods. These advantages hold given fully accurate network information, and also with networks which are partially misspecified or uninformative.

Keywords

high-dimensional data

penalized regression

spatial data

networks 

Main Sponsor

Biometrics Section