Pairwise-Elo (P-Elo) Rating System

Kazuhiko Shinki Co-Author
Wayne State University
 
Kin Hang Wong First Author
Wayne State University
 
Kin Hang Wong Presenting Author
Wayne State University
 
Wednesday, Aug 6: 10:05 AM - 10:20 AM
1509 
Contributed Papers 
Music City Center 

Description

This paper proposes a statistical model for player chemistry by extending the de facto Elo rating system. While various rating systems have been proposed, almost all rating systems assume that players' ratings are totally ordered and transitivity holds. Such assumption precludes possibilities that a specific player plays very
well against another specific player regardless of their general ability. The proposed model consists of (i) a statistical test for the existence of pairwise player chemistry (intransitivity) for the entire group of players and (ii) estimation of winning probability for each of the pairs with the inclusion of player chemistry. We call our model P-Elo model. We will compare P-Elo model to the traditional Elo rating system on sports: Sumo Wresting (SW) and Mixed Martial Art (MMA); as well as the recently popular Large Language Model (LLM) evaluation in terms of match/comparison result prediction and probability estimation.

Keywords

Elo Ratings

Bradley-Terry model

Ranking systems

Statistical modelling

Time series

Sports
forecasting 

Main Sponsor

Section on Statistics in Sports