10. Mixed Integer Programming for Feature Selection in Scalar-on-Function Regression
Conference: Women in Statistics and Data Science 2025
11/13/2025: 2:30 PM - 4:00 PM EST
Speed
Feature selection is a critical challenge in model selection, particularly for functional data, where appropriate statistical methodologies remain underdeveloped. This study investigates the application of Mixed Integer Programming (MIP) combined with information criteria for best feature subset selection in scalar-on-function regression (i.e., regression models where predictors are curves). Transforming the functional regression problem into a classic linear model framework with grouped variables allows the use of model selection criteria such as Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and Generalized Cross Validation (GCV), in combination with MIP. In simulation studies, we compared our MIP method to alternative approaches and found that it consistently identifies truly active features.
Functional data analysis
Mixed Integer Programming
Model Selection
Presenting Author
Asha Pantula
First Author
Asha Pantula
CoAuthor(s)
Luca Frigato, Università di Torino
Ana Kenney, University of California – Irvine
Marzia Cremona, Universite Laval
Target Audience
Beginner
Tracks
Knowledge
Women in Statistics and Data Science 2025
You have unsaved changes.