A Retrospective Analysis of the SmartFind COVID-19 Vaccine Chatbot: Statistical Insights

Suchita A. Patel Co-Author
CDC
 
Faisal Reza Co-Author
CDC
 
Cynthia Knighton Co-Author
CDC
 
Angela Marie Chambliss Co-Author
CDC
 
Yi Mu First Author
 
Yi Mu Presenting Author
 
Monday, Aug 5: 10:50 AM - 11:05 AM
2412 
Contributed Papers 
Oregon Convention Center 
In 2021, the Centers for Disease Control and Prevention (CDC) implemented a cloud-based chatbot called SmartFind COVID-19 Vaccine Chatbot. This Chatbot employed natural language processing (NLP) to automatically address vaccination-related inquiries. It provided high-confidence responses matched to CDC's COVID-19 frequently asked questions and answers (FAQ&As), or a "Sorry, the ChatBot couldn't find a good match" response for low-confidence matches. An analysis of system logs from August 30, 2021 to March 16, 2023 examined 64,884 visitor questions (of which about three-fifths received an NLP matched response) and 3,925 visitor feedback entries. The goal of this project is to use NLP statistical methods, including tokenization and feature extraction, to analyze question text to determine topics that the chatbot was not able to provide a matched response for, including by design for clinical and disease-related questions, for example. The results can guide improvement of vaccination content by including FAQ&As on CDC's webpages and informing development of future chatbots using more powerful large language models.

Keywords

Chatbot

Natural Language Processing

COVID-19 Vaccination 

Main Sponsor

Section on Text Analysis