Print Close

Enhancing Public Health Surveillance: Fine-Tuning Large Language Models for Adverse Drug Event Classification

Presented During: CS028 Advancing Data-Driven Community Well-Being in Health and Education

Conference: Symposium on Data Science and Statistics (SDSS) 2025

05/01/2025: 4:15 PM - 4:40 PM MDT
Refereed

Description

The increasing adoption of artificial intelligence (AI) across regulatory and healthcare domains highlights its transformative potential in addressing critical public health challenges. The U.S. Food and Drug Administration (FDA) has identified adverse drug event (ADE) detection as a priority area for innovation, as outlined in its strategic initiatives. Timely and accurate identification of ADEs is critical for ensuring patient safety and informing regulatory decisions. However, leveraging the FDA Adverse Event Reporting System (FAERS) for ADE detection remains fraught with challenges, including data heterogeneity, reporting inconsistencies, and scalability issues.

Recent advances in generative AI, machine learning (ML), and large language models (LLMs) offer a promising path forward. A recent study demonstrated the efficacy of fine-tuned LLMs, such as GPT-3.5, in analyzing detailed vaccine adverse event reports in the Vaccine Adverse Event Reporting System (VAERS) (Li et al., 2024). Using 91 annotated reports, the authors developed AE-GPT, a tool for automatically extracting and categorizing adverse events, setting a new benchmark in ADE detection.

Our research builds on this precedent, aiming to enhance ADE detection by fine-tuning LLMs for FAERS datasets. FAERS contains millions of masked case reports spanning 2004 to 2024, with data fields including demographic, administrative, drug, reaction, and patient outcome information. We use embeddings from LLMs to classify case severity and identify features predictive of severity, providing a multi-strata classification scheme for ADE detection. We use logistic regression as a baseline and compare the results to standard ML models including a Random Forest classifier, DB Scan, and XGBoost. Our framework achieved notable results demonstrating the potential of LLMs in processing complex medical data and highlight the ability to enhance early ADE detection.

Keywords

health surveillance

large language models

machine learning

adverse drug events

Presenting Author

John Riddles, Westat

First Author

Joshua Turner, Westat

CoAuthor(s)

John Riddles, Westat
Julianna Lee, Westat
Jeremy Corry, Westat
Rashi Saluja
Sean Chickery, Westat
Gizem Korkmaz, Westat
Marcelo Simas, Westat
Kevin Wilson, Westat

Tracks

Practice and Applications

Symposium on Data Science and Statistics (SDSS) 2025