Sampling from ranking models via a hit and run approach

Chenyang Zhong First Author
Department of Statistics, Columbia University
 
Chenyang Zhong Presenting Author
Department of Statistics, Columbia University
 
Monday, Aug 4: 12:05 PM - 12:10 PM
2177 
Contributed Speed 
Music City Center 
The Mallows permutation model is a prominent model for ranking data, which are prevalent in diverse fields such as recommender systems, psychology, and electoral studies. This model specifies a family of non-uniform probability distributions on permutations, defined via a distance metric on permutations. We focus on two common choices: the L1 distance (Spearman's footrule) and the L2 distance (Spearman's rank correlation). Despite their popularity, these models present a significant computational challenge due to the intractability of their normalizing constants, hindering off-the-shelf sampling and inference methods. We develop and analyze hit and run Markov chain Monte Carlo algorithms for sampling from these Mallows models. For both models, we establish order log(n) mixing time upper bounds, providing the first theoretical guarantees for efficient sampling. The convergence analysis employs novel couplings for permutations with one-sided restrictions and leverages the path coupling technique. These advancements enable efficient Monte Carlo maximum likelihood estimation, facilitating scalable inference for ranking data with statistical guarantees.

Keywords

Mallows permutation model

ranking data

hit and run algorithm

Markov chain Monte Carlo

mixing time

scalable inference 

Main Sponsor

IMS