Estimating Counts Through an Average Rounded to an Integer and its Theoretical & Practical Effects

Axel Cortes-Cubero Co-Author
Protocol Labs
 
Israel Almodovar-Rivera Co-Author
University of Puerto Rico At Mayaguez
 
Wolfgang Rolke Co-Author
University of Puerto Rico RUM
 
Roberto Rivera First Author
University of Puerto Rico-Mayaguez
 
Roberto Rivera Presenting Author
University of Puerto Rico-Mayaguez
 
Thursday, Aug 7: 11:05 AM - 11:20 AM
2117 
Contributed Papers 
Music City Center 
In practice, the use of rounding is ubiquitous. Although researchers have looked at the implications of rounding continuous random variables, rounding may also be applied to functions of discrete random variables. For example, to infer the number of excess deaths due to falls after a national emergency, authorities may only provide a rounded average of deaths before and after the emergency started. Deaths from falling tend to be relatively low in most places, and such rounding may seriously affect inference on the change in the rate of deaths. In this paper, we study drawing inference on a parameter from the probability mass function of a non-negative discrete random variable Y , when for rounding coarsening width h we get U = h [Y /h] as a proxy for Y. We show that the probability generating function of
U , E(U ), and Var(U ) capture the effect of the coarsening of the support of Y. Theoretical properties are explored further under some probability distributions. Moreover, we introduce two relative risks of rounding metrics to aid the numerical assessment of how sensitive the results may be to rounding. Under certain conditions, rounding has little impact. However, we also find scenarios where rounding
can significantly affect statistical inference. The methods are applied to infer the probability of success of a binomial distribution and estimate the excess deaths due to Hurricane Maria. The simple methods we propose can partially counter rounding error effects.

Keywords

rounding error

binning

Sheppard’s correction

discrete Fourier transform

excess deaths

probability generating function 

Main Sponsor

Section on Statistical Learning and Data Science