publications
publications by categories in reversed chronological order.
2026
- ICML DEMO
AsyncOPD: How Stale Can On-Policy Distillation Be?ICML 2026 DEMO Workshop, Extended version under review , 2026 - ICML AdaptFM
EfficientRollout: System-Aware Self-Speculative Decoding for RL RolloutsICML 2026 AdaptFM Workshop, Extended version under review , 2026 - ICML AdaptFM
MAGE: All-[MASK] Block Already Knows Where to Look in Diffusion LLMICML 2026 AdaptFM Workshop, Extended version under review , 2026 - ICLR DeLTa
CATS: Inference-aligned SFT for Diffusion LLMs via Context-sensitivity Aware Trajectory SamplingICLR 2026 DeLTa Workshop, Extended version under review , 2026 - ISCA MLArchSys
LLM Inference in a Flash!ISCA 2026 MLArchSys Workshop (Oral), 2026
2025
-
Beyond Next-Token Prediction: A Performance Characterization of Diffusion versus Autoregressive Language ModelsarXiv Preprint , 2025