CS3c: Panel: Advanced Modeling for Spatial and Longitudinal Health and Environmental Data

Conference: Women in Statistics and Data Science 2025
11/13/2025: 11:45 AM - 1:15 PM EST
Panel 

Description

This session highlights cutting-edge statistical methods for the analysis of complex spatial, temporal, and longitudinal data, with application of these methods to address urgent public health and environmental challenges. Presentations span a range of applications, including the opioid crisis in the U.S., dengue transmission in Ecuador, sea surface temperature projections over the Great Barrier Reef, and the progression of rare immune disorders. Talks feature Bayesian hierarchical and spatio-temporal models to integrate misaligned and multi-scale data, downscale high-resolution climate projections, and uncover nuanced disease dynamics often hidden in aggregate data. Collectively, the session emphasizes how advanced modeling approaches can yield actionable insights for targeted environmental and public health interventions and policy.

Keywords

Spatial

Longitudinal

Public health

disease surveillance 

Organizer

Staci Hepler, Wake Forest University

Chair

Ashley Mullan, Vanderbilt University

Target Audience

Mid-Level

Tracks

Knowledge
Women in Statistics and Data Science 2025

Presentations

Integrating data at multiple spatial scales to estimate the local burden of the opioid syndemic

The opioid epidemic has been particularly severe in Ohio, prompting significant efforts to understand its spatial patterns, mainly using available data at the county level. However, relying solely on county-level analysis can overlook crucial information relevant to localized effects. To address this, we integrate spatially misaligned data observed at the county and ZIP code levels to explore the complex interaction of five opioid-related outcomes, providing a more detailed local understanding of the opioid epidemic. We demonstrate how to map ZIP-code level data to ZIP-code Tabulation Areas (ZCTAs) and relate the county-level and ZCTA-level outcomes to a spatially correlated latent factor. The latent factor is defined on the intersection of the misaligned areal units, which provides a more granular understanding of the opioid epidemic. Furthermore, this approach allows us to identify areas with varying levels of opioid burden and reveals local regions with relatively high burden that county-level analyses might miss. Finally, we highlight the need for careful consideration when relying solely on ZIP code level data for naloxone, as it may lead to misinterpretations, particularly in rural regions. 

Speaker

Eva Murphy, Wake Forest University

Estimating spatially-varying dengue force of infection in Ecuador using age-specific case data

Dengue is one of the most prevalent mosquito-borne diseases affecting humans. While it was typically concentrated mostly in areas of the world with tropical climates, with increasing temperature, it has started making its appearance also in the United States. One of the key parameters that epidemiologists use to characterize an infectious disease is the Force of Infection (FOI), the instantaneous rate at which susceptible individuals become infected. Force of Infection is typically represented as a function of age of the individuals and of time (e.g. calendar year). Approaches to estimate FOI from serological data have been presented and discussed in the literature since the 90's. In this talk, we discuss how to estimate FOI from prevalence data. Using prevalence data from multiple areal units in coastal Ecuador, we present a Bayesian hierarchical model to estimate parish-specific FOI's, which in turn allow us to investigate how parish-level characteristics related to urbanization influence dengue's FOI. 

Speaker

Veronica Berrocal, University of California, Irvine

Tackling Statistical Challenges in Longitudinal Analysis of Idiopathic CD4 Lymphocytopenia

Idiopathic CD4 lymphocytopenia (ICL) is a rare, enigmatic syndrome characterized by CD4 lymphopenia in the absence of any known cause of immunodeficiency. This talk addresses statistical considerations in modeling a natural history ICL study, which aimed to evaluate the clinical, genetic, immunologic, and prognostic characteristics of 91 patients with ICL over an 11- year period. I will discuss statistical challenges in modeling a rich and complex dataset including the handling of longitudinal measurements via linear mixed-models to evaluate T-cell count trajectories, and logistic regression to assess the prognostic value of T-cell counts, using tertiles derived from baseline measurements. Multiple outputation was applied to the logistic regression to handle the longitudinal nature of the data, allowing us to use methods designed for independent data in a repeated-measures context. Additionally, we compared the observed mortality and cancer prevalence in the ICL cohort to expected rates derived from the general
population in the Surveillance, Epidemiology, and End Results (SEER) Program. Our findings revealed that ICL continued to be associated with increased susceptibility to viral, encapsulated fungal, and mycobacterial diseases, as well as with a reduced response to novel antigens and an increased risk of cancer.
 

Speaker

Ana Ortega-Villa, National Institute of Allergy and Infectious Diseases

Climate Model Downscaling: Spatial Models and Uncertainty Quantification

We present a statistical downscaling framework for generating fine-resolution climate projections by integrating high-resolution remote sensing data with coarse-resolution outputs from Earth system models. The proposed multivariate spatial statistical model leverages a basis function representation to enable scalable computation while accommodating potentially nonstationary spatial dependence. We apply the method to downscale sea surface temperature (SST) projections over the Great Barrier Reef region using CMIP6 model outputs. Results demonstrate substantial reductions in mean squared predictive error relative to state-of-the-art approaches, while also providing full predictive distributions that support comprehensive uncertainty quantification. 

Speaker

Emily Kang, University of Cincinnati