50: Generalized Tree-Informed Mixed Model Regression

Xin Jin Co-Author
The University of Tampa
 
Riddhi Pratim Ghosh Co-Author
Bowling Green State Universty
 
Jeremiah Allis First Author
 
Jeremiah Allis Presenting Author
 
Tuesday, Aug 5: 10:30 AM - 12:20 PM
1910 
Contributed Posters 
Music City Center 
The standard regression tree method applied to observations within clusters poses both methodological and implementation challenges. Effectively leveraging these data requires methods that account for both individual-level and sample-level effects. We propose Generalized Tree-Informed Mixed Model (GTIMM), which replaces the linear fixed effect in a generalized linear mixed model (GLMM) with the output of a regression tree. Traditional parameter estimation and prediction techniques, such as the expectation-maximization algorithm, scale poorly in high-dimensional settings, creating a computational bottleneck. To address this, we employ a quasi-likelihood framework with stochastic gradient descent for optimized parameter estimation. Additionally, we establish a theoretical bound for the mean squared prediction error. The predictive performance of our method is evaluated through simulations and compared with existing approaches. Finally, we apply our model to predict country-level GDP based on trade, foreign direct investment, unemployment, inflation, and geographic region.

Keywords

tree based regression

clustered data

mixed effects

penalized quasi-likelihood

stochastic gradient descent

prediction 

Main Sponsor

Section on Statistical Learning and Data Science