kumitang commited on
Commit
8d50cf2
·
verified ·
1 Parent(s): d00556c

Upload index.html

Browse files
Files changed (1) hide show
  1. index.html +1 -1
index.html CHANGED
@@ -297,7 +297,7 @@
297
 
298
  <div class="content has-text-justified">
299
  <p>
300
- We evaluate CarBoN on MATH-500 and AIME-2024 benchmarks across multiple language models. The results demonstrate that CarBoN consistently improves upon standard Best-of-\(N\) sampling, with calibrated accuracy at \(N=64\) matching or exceeding uncalibrated results at \(N=256\)—a \(4\times\) reduction in rollout budget.
301
  </p>
302
  </div>
303
 
 
297
 
298
  <div class="content has-text-justified">
299
  <p>
300
+ We evaluate CarBoN on MATH-500 and AIME-2024 benchmarks across multiple language models. The results demonstrate that CarBoN consistently improves upon standard Best-of-\(N\) sampling, with calibrated accuracy at \(N=64\) matching or exceeding uncalibrated results at \(N=256\) (\(4\times\) reduction in rollout budget).
301
  </p>
302
  </div>
303