2026

  • Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models
    Under Review at ICML 2026 (Mean review score = 4.5/6)
    Jonathan Williams, Esin Tureci
    Paper

2025

  • Test-Time Alignment via Hypothesis Reweighting
    TMLR, ICML 2025 Workshop PUT *Y. Lee, J. Williams, Henrik Marklund, Archit Sharma, Eric Mitchell, Anikait Singh, Chelsea Finn
    Paper