Improving naive Bayes classifiers with high-dimensional non-Gaussian data
Abstract Number:
1191
Submission Type:
Contributed Abstract
Contributed Abstract Type:
Poster
Participants:
Mijin Jeong (1), Gyuhyeong Goh (2), Dipak Dey (3)
Institutions:
(1) Kyungpook National University, N/A, (2) Department of Statistics, Kyungpook National University, N/A, (3) University of Connecticut, N/A
Co-Author(s):
Gyuhyeong Goh
Department of Statistics, Kyungpook National University
First Author:
Presenting Author:
Abstract Text:
The naive Bayes classifier, which assumes the conditional independence of predictors, improves classification efficiency and has a great advantage in handling high-dimensional data as well as imbalanced data. However, the success of the naive Bayes classifier hinges on the normality assumption for each continuous predictor and its performance decreases considerably as many irrelevant predictor are included.
In this paper, we develop a way of improving the performance of naive Bayes classifiers when we deal with high-dimensional non-Gaussian data. To remove irrelevant predictors, we develop an efficient variable selection procedure in the context of naive Bayes classification using the notion of Bayesian Information Criteria (BIC). In addition, we adapt the naive Bayes classifier for use with non-Gaussian data via power transformation. We conduct a comparative simulation study to demonstrate the superiority of our proposed classifier over existing classification methods. We also apply our proposed classifier to real data and confirm its effectiveness.
Keywords:
Bayes classifier|Generative classifier|High-dimensional variable selection|Power transformation| |
Sponsors:
Section on Statistical Learning and Data Science
Tracks:
Machine Learning
Can this be considered for alternate subtype?
No
Are you interested in volunteering to serve as a session chair?
No
I have read and understand that JSM participants must abide by the Participant Guidelines.
Yes
I understand that JSM participants must register and pay the appropriate registration fee by June 3, 2025. The registration fee is non-refundable.
I understand
You have unsaved changes.