High-Dimensional Matching with Genetic Algorithms

Kwonsang Lee Co-Author
Seoul National University
 
Hajoung Lee First Author
Seoul National University
 
Hajoung Lee Presenting Author
Seoul National University
 
Tuesday, Aug 5: 9:35 AM - 9:50 AM
1859 
Contributed Papers 
Music City Center 
Matching in observational studies estimates causal effects by balancing covariate distributions between treated and control groups. Traditional methods rely on pairwise distances, but in high-dimensional, low-sample size settings, the curse of dimensionality makes it difficult to distinguish observations. To address this, we propose a novel matching method using genetic algorithms, shifting focus from individual- to group-level distances. Our method improves causal effect estimation by optimizing the similarity of high-dimensional joint covariate distributions. This approach has key advantages: (1) it avoids dimension reduction, preserving full covariate information without additional modeling; (2) it maintains transparency by not relying on outcomes, akin to traditional matching; and (3) it is robust in low-sample size settings, where traditional methods may struggle. Moreover, our results show the proposed method is competitive with existing approaches even in low-dimensional cases. Through simulations and real data applications, we validate its performance, offer practical guidance, and highlight its potential as a tool for causal inference in high- and low-dimensional settings.

Keywords

Matching

High-dimensional data

Genetic algorithms

Covariate balance

Low-sample size settings

Causal inference 

Main Sponsor

Korean International Statistical Society