Parallel Test-Time Scaling for Latent Reasoning Models Paper • 2510.07745 • Published Oct 9, 2025 • 5
Towards Harmless Multimodal Assistants with Blind Preference Optimization Paper • 2503.14189 • Published Mar 18, 2025