Print Close

Provable Sampling Acceleration in Diffusion Language Models via Adaptation to Low-Dimensional Structures

Presented During: Statistical Foundations for Generative Models and Bayesian Inference

Changxiao Cai Speaker
University of Michigan

Thursday, Aug 6: 10:55 AM - 11:15 AM
Topic-Contributed Paper Session

Thomas M. Menino Convention & Exhibition Center

Diffusion language models (DLMs) have emerged as a compelling alternative to autoregressive (AR) models by enabling parallel, non-sequential token generation. Despite their strong empirical performance, the theoretical understanding of how decoding strategies affect sampling efficiency remains limited. In this talk, we present recent theoretical advances showing that DLM sampling can be substantially accelerated through decoding strategies that exploit low-dimensional structure in the target data distribution. We establish convergence guarantees for both uniform and confidence-based decoding strategies, proving that high-quality samples can be generated in a sublinear number of iterations. In particular, the iteration complexity depends on information-theoretic quantities that capture the intrinsic complexity of the target distribution. These results provide a theoretical foundation for efficient diffusion-based language generation and offer principled insights into decoding strategy design.