Papers
2026
- Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models
Arxiv
Jonathan Williams, Esin Tureci
Paper
2025
- Test-Time Alignment via Hypothesis Reweighting
ICML 2025 Workshop PUT ยท Y. Lee, J. Williams, Henrik Marklund, Archit Sharma, Eric Mitchell, Anikait Singh, Chelsea Finn
Paper