Predictive Modeling of Microbiome Data with Interaction Effects

Christian L. Müller Co-Author
helmholtz Munich
 
Jacob Bien Co-Author
University of Southern California
 
Mara Stadler First Author
Helmholtz Center Munich
 
Mara Stadler Presenting Author
Helmholtz Center Munich
 
Tuesday, Aug 6: 9:45 AM - 9:50 AM
3526 
Contributed Speed 
Oregon Convention Center 
In microbiome research, predicting an outcome of interest from microbial abundances via sparse regression models is a common task. However, models linear in the features might be too simple to capture dynamics in communities, as microbial species tend to interact with one another. To address this, we propose a framework that includes strategies for modeling interaction effects in presence-absence data of microbial species, absolute abundance data, and compositional microbial 16S rRNA sequencing data, where only relative abundance information is available. Our framework incorporates an extension of the constrained lasso for compositional data to interaction effects as well as the statistical concept of hierarchy to enhance the interpretability of interaction effects. Based on synthetic data, we demonstrate the conditions under which true effects can be statically detected, considering varying sparsity of features and varying noise levels. For a selection of real-world microbiome datasets, we show that robust interaction effects between microbial species can be detected and the predictive accuracy can be improved when modeling interaction effects compared to merely additive effects.

Keywords

interaction modeling

microbial interactions

compositional data

sparsity

lasso

hierarchical interactions 

Main Sponsor

Biometrics Section