Print Close

Exploring the big data paradox for various estimands using vaccination data from a global survey

Presented During: Big Data Initiatives in Survey Statistics

Walter Dempsey Co-Author

Peisong Han Co-Author
Gilead Sciences

Yashwant Deshmukh Co-Author
CVoter Foundation

Sylvia Richardson Co-Author
Cambridge College London

Brian Tom Co-Author
University of Cambridge

Bhramar Mukherjee Co-Author
University of Michigan

Youqi Yang First Author

Youqi Yang Presenting Author

Tuesday, Aug 6: 9:05 AM - 9:20 AM
2099
Contributed Papers

Oregon Convention Center

Presentation

Description

Selection bias poses a substantial challenge to valid statistical inference in non-probability samples. This study compared estimates of the first-dose COVID-19 vaccination rates among Indian adults in 2021 from a large non-probability sample, the COVID-19 Trends and Impact Survey (CTIS), and a small probability survey, the Center for Voting Options and Trends in Election Research (CVoter), against national benchmark data from the COVID Vaccine Intelligence Network (CoWIN). Notably, CTIS exhibits a larger estimation error on average (0.37) compared to CVoter (0.14). Additionally, we explored the accuracy of CTIS in estimating successive differences (over time) and subgroup differences in mean vaccine uptakes (for females versus males). Compared to the overall vaccination rates, targeting these alternative estimands comparing differences or relative differences in two means, increased the effective sample size. These results suggest that the Big Data Paradox can manifest in countries beyond the US and may not apply equally to every estimand of interest.

Keywords

Big Data Paradox

Non-probability sample

Selection bias

Online survey

Vaccine uptake

View Abstract 2099

Main Sponsor

Survey Research Methods Section