Papers

Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models
Arxiv
Jonathan Williams, Esin Tureci
Paper

Test-Time Alignment via Hypothesis Reweighting
ICML 2025 Workshop PUT · Y. Lee, J. Williams, Henrik Marklund, Archit Sharma, Eric Mitchell, Anikait Singh, Chelsea Finn
Paper