Upload index.html
Browse files- index.html +1 -1
index.html
CHANGED
|
@@ -297,7 +297,7 @@
|
|
| 297 |
|
| 298 |
<div class="content has-text-justified">
|
| 299 |
<p>
|
| 300 |
-
We evaluate CarBoN on MATH-500 and AIME-2024 benchmarks across multiple language models. The results demonstrate that CarBoN consistently improves upon standard Best-of-\(N\) sampling, with calibrated accuracy at \(N=64\) matching or exceeding uncalibrated results at \(N=256\)
|
| 301 |
</p>
|
| 302 |
</div>
|
| 303 |
|
|
|
|
| 297 |
|
| 298 |
<div class="content has-text-justified">
|
| 299 |
<p>
|
| 300 |
+
We evaluate CarBoN on MATH-500 and AIME-2024 benchmarks across multiple language models. The results demonstrate that CarBoN consistently improves upon standard Best-of-\(N\) sampling, with calibrated accuracy at \(N=64\) matching or exceeding uncalibrated results at \(N=256\) (\(4\times\) reduction in rollout budget).
|
| 301 |
</p>
|
| 302 |
</div>
|
| 303 |
|