Federated generalized linear mixed model based on one-time shared summary statistics

Christel Faes Co-Author
Hasselt University
 
Niel Hens Co-Author
Hasselt University
 
Niel Hens Co-Author
Antwerp University
 
Marie Analiz April Limpoco First Author
Hasselt University
 
Marie Analiz April Limpoco Presenting Author
Hasselt University
 
Thursday, Aug 7: 11:20 AM - 11:50 AM
1226 
Contributed Papers 
Music City Center 

Description

Data privacy has increasingly become a daunting challenge because it limits data availability, which is essential in estimating statistical models such as generalized linear models. Access to personal data often involves considerable time, effort, and paperwork, which can impede research progress and collaboration. Federated learning has emerged as a means to estimate models without accessing individual observations from multiple data providers like hospitals or health organizations. However, this strategy requires communicating parameter estimate updates to a central server until convergence to produce a global model. In this research, we propose an approach to estimate mixed linear, logistic, and Poisson models based on summary statistics requested only once from each data provider. Our strategy involves generating pseudo-data whose summary statistics match those of the actual but unavailable data and using them in the model estimation process. The estimates we achieve are identical or at least as good as those derived from the actual data in terms of bias and coverage. Generalizability and communication efficiency distinguish our approach from the existing methods.

Keywords

federated learning

mixed effects models

data privacy

pseudo-data

aggregate data

statistical sufficiency 

Main Sponsor

Biometrics Section