HighDimMixedModels.jl: Robust high-dimensional mixed-effects models across omics data

Rosa Aghdam Co-Author
University of Wisconsin-Madison
 
Claudia Solis-Lemus Co-Author
University of Wisconsin-Madison
 
Evan Gorstein First Author
 
Evan Gorstein Presenting Author
 
Wednesday, Aug 6: 2:50 PM - 3:05 PM
2604 
Contributed Papers 
Music City Center 
High-dimensional mixed-effects models are an increasingly important form of regression in which the number of covariates rivals or exceeds the number of samples, which are collected in groups or clusters. The penalized likelihood approach to fitting these models relies on a coordinate descent algorithm that lacks guarantees of convergence to a global optimum. Here, we empirically study the behavior of this algorithm on simulated and real examples of three types of data that are common in modern biology: transcriptome, genome-wide association, and microbiome data. Our simulations provide new insights into the algorithm's behavior in these settings, and, comparing the performance of two popular penalties, we demonstrate that the smoothly clipped absolute deviation (SCAD) penalty consistently outperforms the least absolute shrinkage and selection operator (LASSO) penalty in terms of both variable selection and estimation accuracy across omics data. To empower researchers in biology and other fields to fit models with the SCAD penalty, we implement the algorithm in a Julia package, HighDimMixedModels.jl.

Keywords

Coordinate descent

Penalized likelihood

Mixed-effects

Omics

Variable selection 

Main Sponsor

Section on Statistics in Genomics and Genetics