Comparing Two Categorical Gini Correlations with Applications to Classification Problems

Abstract Number:

3739 

Submission Type:

Contributed Abstract 

Contributed Abstract Type:

Paper 

Participants:

Sameera Hewage (1), Yongli Sang (1)

Institutions:

(1) University of Louisiana at Lafayette, N/A

Co-Author:

Yongli Sang  
University of Louisiana at Lafayette

Speaker:

Sameera Hewage  
University of Louisiana at Lafayette

Abstract Text:

We introduce a general inferential framework for comparing predictor importance in classification models with categorical responses. Our approach is based on the categorical Gini correlation (CGC), a dependence measure between numerical and categorical variables that captures the significance of a predictor for the response. To compare the importance of two predictors with respect to the same categorical outcome, we conduct hypothesis tests on their CGCs. The framework accommodates predictors of arbitrary and unequal dimensionalities. We derive the asymptotic distribution of the test statistic for hypothesis testing and show that the test is consistent. In addition, we propose a nonparametric bootstrap procedure as an alternative to the asymptotic normal-based test. Simulation studies demonstrate the empirical performance of the proposed tests, and applications to two real datasets illustrate their practical utility.

Keywords:

categorical Gini correlation|comparing correlations|classification|Predictor importance|Categorical response|Nonparametric bootstrap

Sponsors:

Section on Statistical Learning and Data Science

Tracks:

Variable Selection

Can this be considered for alternate subtype?

Yes

Are you interested in volunteering to serve as a session chair?

No

I have read and understand that JSM participants must abide by the Participant Guidelines.

Yes

I understand that JSM participants must register and pay the appropriate registration fee by June 1, 2026. The registration fee is non-refundable.

I understand