10. Mixed Integer Programming for Feature Selection in Scalar-on-Function Regression

Conference: Women in Statistics and Data Science 2025
11/13/2025: 2:30 PM - 4:00 PM EST
Speed 

Description

Feature selection is a critical challenge in model selection, particularly for functional data, where appropriate statistical methodologies remain underdeveloped. This study investigates the application of Mixed Integer Programming (MIP) combined with information criteria for best feature subset selection in scalar-on-function regression (i.e., regression models where predictors are curves). Transforming the functional regression problem into a classic linear model framework with grouped variables allows the use of model selection criteria such as Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and Generalized Cross Validation (GCV), in combination with MIP. In simulation studies, we compared our MIP method to alternative approaches and found that it consistently identifies truly active features.

Keywords

Functional data analysis

Mixed Integer Programming

Model Selection 

Presenting Author

Asha Pantula

First Author

Asha Pantula

CoAuthor(s)

Luca Frigato, Università di Torino
Ana Kenney, University of California – Irvine
Marzia Cremona, Universite Laval

Target Audience

Beginner

Tracks

Knowledge
Women in Statistics and Data Science 2025