A sandwich smoother for spatio-temporal functional data

Conference: Symposium on Data Science and Statistics (SDSS) 2023
05/24/2023: 3:55 PM - 4:00 PM CDT
Lightning 

Description

Statistical analysis of spatio-temporal data has evolved over time to handle increasingly large data sets. E.g., the North American CORDEX program is producing daily values of climate-related variables on spatial grids with approximately 100,000 locations over 150 years. Smoothing of such massive and noisy data is essential to understanding their spatio-temporal features. It also reduces the size of the data by representing them in terms of suitable basis functions, which facilitates further computations and statistical analysis. Traditional tensor-based methods break down under the size of such massive data. We develop a penalized spline method for representing such data using a generalization of the sandwich smoother proposed by Xiao et al. (2013). Unlike the original method, our generalization treats the spatial and temporal dimensions distinctly and allows the methodology to be directly applied to non-gridded data. Additionally, this new method can exploit parallel computing architectures. We demonstrate the practicality of the methodology using both simulated and real data. The new smoother, as well as the original sandwich smoother, are implemented in the hero R package.

Keywords

massive data

climate data

spatio-temporal data

smoothing

parallel computing 

Presenting Author

Joshua French, University of Colorado-Denver

First Author

Joshua French, University of Colorado-Denver

CoAuthor

Piotr Kokoszka, Colorado State University

Target Audience

Expert

Tracks

Computational Statistics
Symposium on Data Science and Statistics (SDSS) 2023