20: The Kernel Regression Tree-Exploring Aggregations Estimator for Microbiome Analysis

Y. Samuel Wang Co-Author
Cornell University
 
Martin Wells Co-Author
Cornell University
 
Sithija Manage First Author
Cornell University
 
Sithija Manage Presenting Author
Cornell University
 
Monday, Aug 4: 10:30 AM - 12:20 PM
2221 
Contributed Posters 
Music City Center 
We introduce a novel Kernel Regression estimator, Kernel Regression with Tree-Exploring Aggregations (KR TEXAS), that learns a distance metric while allowing feature aggregation along a predefined tree structure. This approach is particularly relevant for microbiome analysis, where data is often collected at multiple taxonomic levels, and determining the appropriate level of aggregation is non-trivial. Unlike traditional aggregation methods that rely on uniform taxonomic levels, KR TEXAS leverages an L1-penalized distance metric to selectively aggregate features based on their importance, leading to biologically interpretable results. Our method extends prior work on metric learning and nonparametric regression, incorporating structured feature aggregation to improve predictive accuracy and interpretability. We demonstrate the utility of KR TEXAS through both simulations and real microbiome datasets, highlighting its advantages in capturing functional relationships that may be missed by conventional aggregation techniques.

Keywords

Microbiome

Compositional Data

Kernel Regression

Metric Learning 

Abstracts


Main Sponsor

Biometrics Section