Update README.md
Browse files
README.md
CHANGED
|
@@ -131,14 +131,14 @@ tags:
|
|
| 131 |
<h3>Available Quantization Sizes</h3>
|
| 132 |
<p>To use resources optimally, quants are created in a fixed range of sizes. Custom sizes will only be considered if there is a high community demand and/or available compute.</p>
|
| 133 |
<ul>
|
| 134 |
-
<li><code class="inline-code-dark">2.5bpw_H6</code></li>
|
| 135 |
-
<li><code class="inline-code-dark">3.0bpw_H6</code></li>
|
| 136 |
-
<li><code class="inline-code-dark">3.5bpw_H6</code></li>
|
| 137 |
-
<li><code class="inline-code-dark">4.0bpw_H6</code></li>
|
| 138 |
-
<li><code class="inline-code-dark">4.5bpw_H6
|
| 139 |
-
<li><code class="inline-code-dark">5.0bpw_H6</code></li>
|
| 140 |
-
<li><code class="inline-code-dark">6.0bpw_H6</code></li>
|
| 141 |
-
<li><code class="inline-code-dark">8.0bpw_H8</code></li>
|
| 142 |
</ul>
|
| 143 |
</div>
|
| 144 |
|
|
|
|
| 131 |
<h3>Available Quantization Sizes</h3>
|
| 132 |
<p>To use resources optimally, quants are created in a fixed range of sizes. Custom sizes will only be considered if there is a high community demand and/or available compute.</p>
|
| 133 |
<ul>
|
| 134 |
+
<li><code class="inline-code-dark"><b>2.5bpw_H6</b></code></li>
|
| 135 |
+
<li><code class="inline-code-dark"><b>3.0bpw_H6</b></code></li>
|
| 136 |
+
<li><code class="inline-code-dark"><b>3.5bpw_H6</b></code></li>
|
| 137 |
+
<li><code class="inline-code-dark"><b>4.0bpw_H6</b></code></li>
|
| 138 |
+
<li><code class="inline-code-dark"><b>4.5bpw_H6</b></code> / <code class="inline-code-dark"><b>4.25bpw_H6</b></code> (for 70b and above)</li>
|
| 139 |
+
<li><code class="inline-code-dark"><b>5.0bpw_H6</b></code></li>
|
| 140 |
+
<li><code class="inline-code-dark"><b>6.0bpw_H6</b></code></li>
|
| 141 |
+
<li><code class="inline-code-dark"><b>8.0bpw_H8</b></code></li>
|
| 142 |
</ul>
|
| 143 |
</div>
|
| 144 |
|