Simulation-Based Software for Sample Size Calculations in Linear and Logistic Regression

Conference: Symposium on Data Science and Statistics (SDSS) 2025
05/01/2025: 1:15 PM - 2:45 PM MDT
Lightning 

Description

This study develops statistical software to perform sample size calculations in multivariate linear and logistic regression settings based on simulations. Sample size calculations in multivariate regression settings may have to be estimated without analytic calculations. Simulation studies present one manner of estimating the statistical power across repeated experiments. The software develops a searching algorithm that considers a range of sample sizes. Users can specify the data model to generate the study's variables, the regression model to implement, and the simulation's parameters. This greatly reduces the coding required to develop the simulation and to search for the minimally sufficient sample size. We demonstrate the implementation of the software on an example with multivarite regression.

Keywords

Simulation

sample size calculation

regression

statistical software 

Presenting Author

David Shilane, Columbia University

First Author

David Shilane, Columbia University

Tracks

Software & Data Science Technologies
Symposium on Data Science and Statistics (SDSS) 2025