Effortless Access to Household Pulse Survey Datasets with {hpsr}: An R Data Package
Abstract Number:
2616
Submission Type:
Contributed Abstract
Contributed Abstract Type:
Poster
Participants:
Prasad Bhoite (1), Christopher Clark (1), Krupa Patel (1), Rachel Clarke (1), Nana Aisha Garba (1)
Institutions:
(1) Florida International University, Miami, FL
Co-Author(s):
First Author:
Presenting Author:
Abstract Text:
This abstract highlights the significance of utilizing Household Pulse Survey data to comprehensively evaluate COVID-19's impact on the U.S. population. The project involves creating an R package using tools like usethis, Rtools, and R Studio. The methodology includes downloading CSV files from the Census website, optimizing storage via Parquet conversion, and storing on Github. Files are transformed into .rda datasets, resulting in an open-source package on GitHub for the data science community. Converting CSV data to Parquet format reduces file size without compromising integrity. Parquet files are then transformed into .rda datasets, aligning with R's native format, all managed in R Studio for a streamlined, reproducible workflow. The package encapsulates diverse insights from 63 transformed datasets, enhancing accessibility and efficiency for researchers. The optimized pipeline contributes to storage and retrieval efficiency, and the GitHub repository promotes collaboration. The discussion emphasizes the open-source contribution's role in advancing data science, with gains from CSV to Parquet conversion, .rda dataset adoption, and community engagement through the GitHub.
Keywords:
R Package Development|Household Pulse Survey|COVID-19 Impact |Open Source contribution| |
Sponsors:
Section for Statistical Programmers and Analysts
Tracks:
Open source programming
Can this be considered for alternate subtype?
Yes
Are you interested in volunteering to serve as a session chair?
Yes
I have read and understand that JSM participants must abide by the Participant Guidelines.
Yes
I understand that JSM participants must register and pay the appropriate registration fee by June 1, 2024. The registration fee is non-refundable.
I understand
You have unsaved changes.