Rank, Order-Statistic, and U-Statistic Methods in Nonparametric Inference

Anqi Zhao Chair
 
Tuesday, Aug 4: 4:00 PM - 5:50 PM
6224 
Contributed Papers 
Thomas M. Menino Convention & Exhibition Center 
Room: CC-208 

Main Sponsor

Section on Nonparametric Statistics

Presentations

Correlation Bounds for Judgment Order Statistics

It is known that the maximum possible correlation between two order statistics from a simple random sample of size n occurs when the distribution is uniform. Motivated by the judgment rankings used in ranked-set sampling and related sample schemes, we study what correlations are possible if one looks at judgment order statistics rather than actual order statistics. In other words, we look for bounds on the correlation when we consider all possible distributions and all possible ranking schemes. We find that the interval of possible correlations becomes much wider; indeed, for sample sizes of at least four, all correlations except a perfect negative correlation are possible. We also study the intermediate case where the judgment rankings are based on a covariate, in which case the judgment order statistics are concomitants of order statistics. We explore whether our results can be used to improve statistical inference based on ranked-set sampling data. 

Keywords

Order statistics

Ranked-set sampling

Correlation

Judgment rankings 

Speaker

Jesse Frey, Villanova University

Large Sample Properties of a U-Statistic Comparison for Biometric Individuality

A common question when deciding whether to consider a type of forensic evidence is the degree at which trace objects from different sources can be distinguished. The degree is referred to as the individuality of the forensic modality, with respect to a specified metric for comparing a large number of samples from multiple sources. Regardless of the sample size, if the metric does not give rise to an exclusionary difference between two sources, then two sources are said to have indistinguishable profiles, with respect to the metric. We use the score developed in Davis et al. (2012) for sparse categorical data arising from the FlashID system©. We will define a new level alpha test for the equality of scores, based on all pairwise comparisons within and between documents from two different writers. In this presentation, we will derive the limiting distribution of our statistic. This test compares average within-writer scores with the between-writer scores. Using the corresponding p-values between all pairs of writers, we use a mixture model to make a statement about the confidence that a given p-value arose from two writers with indistinguishable profiles. 

Keywords

Biometric Individuality

Parametric Bootstrap

U-Statistics

Pairwise Scores 

Speaker

Clarissa Giefer, South Dakota State University

Co-Author(s)

Cami Fuglsby, Augustana University
Janean Hanka, South Dakota State University
Christopher Saunders, South Dakota State University

On the connections between Chatterjee's correlation and rank distance correlation

Distance correlation and Chatterjee's correlation are two highly representative measures of dependence. Their simplicity and desirable properties have attracted much attention over the past years, leading to numerous extensions and in-depth discussions. The theoretical connections between these two measures, however, have remained largely underexplored, despite some empirical comparisons of their statistical power. In this work, we investigate some connections between Chatterjee's correlation and the rank-based distance correlation. We show that both belong to a class of dependence measures based on non-negative weighted rank differences, but with distinct weight allocations resulting in different asymptotic null distributions. Motivated by this finding, we construct a new rank distance correlation using Gaussian kernel that, quite interestingly, exhibits Chatterjee-like behavior under finite sample and specific bandwidth conditions. 

Keywords

distance correlation

Chatterjee's correlation

non-negative weighted rank difference 

Speaker

Qingyang Zhang, University of Arkansas

Performance of Permutation-Based U-Statistics: Validating Minimax Optimality via Simulation

Recent theory by Kim, Balakrishnan, and Wasserman (2022) introduces a non-asymptotic framework for the minimax power of permutation tests. This study empirically validates these theoretical guarantees for U-statistic based permutation tests through extensive simulations.

We consider two-sample testing and categorical independence testing. In the two-sample setting, we compare the Mann–Whitney U-test based on normal approximations with its permutation-based version. For independence testing, we compare the classical chi-squared test, a bootstrapped chi-squared test, and the multinomial U-statistic proposed by Kim, Balakrishnan, and Wasserman.

Our results demonstrate that while traditional tests suffer from power collapse and poor Type I error control as dimensionality increases relative to sample size, the permutation-based U-statistic consistently achieves predicted minimax separation rates. By characterizing these error rates across diverse probability distributions and real-world data, we show that permutation tests offer a practically relevant and theoretically grounded alternative for small sample testing, particularly for high-dimensional multinomial settings. 

Keywords

permutation tests

simulation

minimax optimality

U-statistics 

Speaker

Michelle Vered, Everybody Votes Campaign

The Ranked-Set Sampling Fisher’s Exact Test

We study the analog of Fisher's exact test in the case where the data come from two independent samples drawn using ranked-set sampling (RSS) rather than simple random sampling. The exactness property is preserved not just under perfect rankings, but also under imperfect rankings as long as the same ranking scheme is used for both samples. We compare the new test to the simple random sampling Fisher's exact test and to existing RSS competitors in terms of power, and we also study what happens under imperfect rankings when the ranking schemes differ between the two samples. Fisher's exact test is famous for being conservative. We show that in the RSS case, the conservativeness of the test can be significantly reduced by using a data-based tie-breaking procedure that does not require randomization. 

Keywords

Exact tests

Judgment rankings

Test of homogeneity 

Speaker

Yimin Zhang, Villanova University

Co-Author

Jesse Frey, Villanova University