Papers
2026
- Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models
Under Review at ICML 2026 (Mean review score = 4.5/6)
Jonathan Williams, Esin Tureci
Paper
2025
- Test-Time Alignment via Hypothesis Reweighting
TMLR, ICML 2025 Workshop PUT *Y. Lee, J. Williams, Henrik Marklund, Archit Sharma, Eric Mitchell, Anikait Singh, Chelsea Finn
Paper