Comparing Two Categorical Gini Correlations with Applications to Classification Problems
Abstract Number:
3739
Submission Type:
Contributed Abstract
Contributed Abstract Type:
Paper
Participants:
Sameera Hewage (1), Yongli Sang (1)
Institutions:
(1) University of Louisiana at Lafayette, N/A
Co-Author:
Speaker:
Abstract Text:
We introduce a general inferential framework for comparing predictor importance in classification models with categorical responses. Our approach is based on the categorical Gini correlation (CGC), a dependence measure between numerical and categorical variables that captures the significance of a predictor for the response. To compare the importance of two predictors with respect to the same categorical outcome, we conduct hypothesis tests on their CGCs. The framework accommodates predictors of arbitrary and unequal dimensionalities. We derive the asymptotic distribution of the test statistic for hypothesis testing and show that the test is consistent. In addition, we propose a nonparametric bootstrap procedure as an alternative to the asymptotic normal-based test. Simulation studies demonstrate the empirical performance of the proposed tests, and applications to two real datasets illustrate their practical utility.
Keywords:
categorical Gini correlation|comparing correlations|classification|Predictor importance|Categorical response|Nonparametric bootstrap
Sponsors:
Section on Statistical Learning and Data Science
Tracks:
Variable Selection
Can this be considered for alternate subtype?
Yes
Are you interested in volunteering to serve as a session chair?
No
I have read and understand that JSM participants must abide by the Participant Guidelines.
Yes
I understand that JSM participants must register and pay the appropriate registration fee by June 1, 2026. The registration fee is non-refundable.
I understand
You have unsaved changes.