Confidence sequences via online learning with applications in offline contextual bandits and active preference learning

Kwang-Sung Jun Speaker
University of Arizona
 
Sunday, Aug 3: 2:25 PM - 2:45 PM
Topic-Contributed Paper Session 
Music City Center 
Confidence sequence provides ways to characterize uncertainty in stochastic environments, which is a widely-used tool for interactive machine learning algorithms and statistical problems including A/B testing, Bayesian optimization, reinforcement learning, and offline evaluation/learning. In these problems, constructing confidence sequences that are tight without losing correctness is crucial since it has a dramatic impact on the performance of downstream tasks. In this talk, I will present how to leverage results from online learning to derive confidence sequences that are provably and numerically tight. First, I will present an implicitly-defined confidence sequence for bounded random variables, which induces an empirical Bernstein-style confidence bound (thus adapts to the variance) and is provably better than the KL divergence-based confidence bound simultaneously, unlike the standard empirical Bernstein bound. Our confidence bound is never vacuous, can be efficiently computed, and provides state-of-the-art numerical performance. Furthermore, I will show that our bound can also be extended to [0,\infty)-valued random variables and obtain a lower confidence bound. We apply this to offline contextual bandits and obtain the state-of-the-art learning guarantee. Second, I will turn to generalized linear models and how leveraging online learning helps develop (i) improved confidence sets for logistic linear models and (ii) noise-adaptive confidence sets for linear models with sub-Gaussian and bounded noise respectively. These lead to improved regret bounds in (generalized) linear bandit problems. Furthermore, I will show our confidence bound can be used to improve active preference learning in large language models. I will conclude with open problems in confidence sequences and the role that online learning plays for them.