Cutting-Edge Advances in Graphical Models and High-Dimensional Methods

Yin Tang Chair
 
Benjamin Roycraft Discussant
University of Florida
 
Benjamin Roycraft Organizer
University of Florida
 
Monday, Aug 4: 10:30 AM - 12:20 PM
0831 
Topic-Contributed Paper Session 
Music City Center 
Room: CC-101C 

Applied

Yes

Main Sponsor

Section on Statistical Learning and Data Science

Co Sponsors

IMS
Section on Statistical Computing

Presentations

WITHDRAWN A framework for confidence bands for signal counts in purely high dimensions

Co-Author

Peter Kramlinger, University of California Davis

Covariate-Assisted Bayesian Graph Learning for Heterogeneous Data

In a traditional Gaussian graphical model, data homogeneity is routinely assumed with no extra variables affecting the conditional independence. In modern genomic datasets, there is an abundance of auxiliary information, which often gets underutilized in determining the joint dependency structure. In this article, we consider a Bayesian approach to model undirected graphs underlying heterogeneous multivariate observations with additional assistance from covariates. Building on product partition models, we propose a novel covariate-dependent Gaussian graphical model that allows graphs to vary with covariates so that observations whose covariates are similar share a similar undirected graph. To efficiently embed Gaussian graphical models into our proposed framework, we explore both Gaussian likelihood and pseudo-likelihood functions. For Gaussian likelihood, a G-Wishart distribution is used as a natural conjugate prior, and for the pseudo-likelihood, a product of Gaussian-conditionals is used. Moreover, the proposed model has large prior support and is flexible to approximate any ν-Ḧolder conditional variance-covariance matrices with ν ∈ (0,1]. We further show that based on the theory of fractional likelihood, the rate of posterior contraction is minimax optimal assuming the true density to be a Gaussian mixture with a known number of components. The efficacy of the approach is demonstrated via simulation studies and an analysis of a protein network for a breast cancer dataset assisted by mRNA gene expression as covariates. 

Speaker

Yang Ni, Texas A&M University

WITHDRAWN/DUPLICATE Hub Estimation in Gaussian Graphical Models


Identifying Common Hubs in Multiple Gaussian Graphical Models

The Gaussian graphical model (GGM) is a useful machine learning and statistics tool to represent relationships of conditional dependence among variables. In many real-world applications, datasets often contain multiple related sub-populations, whose associated GGMs may have common structure. In such cases, it is useful to recover common hub variables, which refers to variables that are highly connected in the GGMs of all sub-populations. In this paper, we propose the Joint Inverse Components for Hub Detection (JIC-HD) method to recover common hubs across multiple GGMs without the need of estimating any GGM explicitly. To this end, we define a notion of joint minimax eigenvectors for multiple precision matrices corresponding to the GGMs, and show that these vectors can recover common hubs in GGMs. We establish theoretical guarantees of common hub recovery in terms of true positive rate (TPR) for our proposed JIC-HD method. Our numerical studies further confirm the superior performance of the JIC-HD in terms of common hub recovery and computational time when compared to several existing methods. 

Keywords

Gaussian graphical model (GGM 

Speaker

Jose Angel Sanchez Gomez, UC-Riverside