Random-projection ensemble dimension reduction

Timothy Cannings Co-Author
University of Edinburgh
 
Wenxing Zhou First Author
 
Wenxing Zhou Presenting Author
 
Thursday, Aug 7: 11:50 AM - 12:05 PM
1706 
Contributed Papers 
Music City Center 
We propose a framework for dimension reduction in high-dimensional regression, by aggregating an ensemble of random projections selected based on empirical regression performance. Specifically, we consider disjoint groups of independent random projections, apply a base regression method after each projection is appied to the covariates, and retain the best-performing projection in each group. The selected projections are aggregated by taking the SVD of their empirical average, yielding the leading singular vectors. Notably, the singular values indicate the importance of the corresponding projection directions, aiding in selecting the final projection dimension. We provide recommendations on aspects of our framework, including the projection distribution, base regression method, and the number of random projections. Additionally, we explore further dimension reduction by applying our algorithm twice when the initially recommended dimension is too large. Our theoretical results show that the error of algorithm stabilises as the number of projection groups increases. We demonstrate our proposal's strong empirical performance through an extensive study using simulated and real data.

Keywords

High-dimensional

mean central subspace

random projection

singular value decomposition

sufficient dimension reduction 

Main Sponsor

Section on Statistical Learning and Data Science