view article Article Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models By nvidia and 3 others • 9 days ago • 15
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • Jul 18 • 50
Scoring Verifiers Collection Benchmarks for evaluating synthetic verifiers like test case generation and code reward models (as found in https://www.arxiv.org/abs/2502.13820). • 2 items • Updated 8 days ago • 1
Scoring Verifiers Collection Benchmarks for evaluating synthetic verifiers like test case generation and code reward models (as found in https://www.arxiv.org/abs/2502.13820). • 2 items • Updated 8 days ago • 1