ArtusDev commited on
Commit
3200499
·
verified ·
1 Parent(s): 79ba367

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -131,14 +131,14 @@ tags:
131
  <h3>Available Quantization Sizes</h3>
132
  <p>To use resources optimally, quants are created in a fixed range of sizes. Custom sizes will only be considered if there is a high community demand and/or available compute.</p>
133
  <ul>
134
- <li><code class="inline-code-dark">2.5bpw_H6</code></li>
135
- <li><code class="inline-code-dark">3.0bpw_H6</code></li>
136
- <li><code class="inline-code-dark">3.5bpw_H6</code></li>
137
- <li><code class="inline-code-dark">4.0bpw_H6</code></li>
138
- <li><code class="inline-code-dark">4.5bpw_H6 (4.25bpw_H6 for 70b and above)</code></li>
139
- <li><code class="inline-code-dark">5.0bpw_H6</code></li>
140
- <li><code class="inline-code-dark">6.0bpw_H6</code></li>
141
- <li><code class="inline-code-dark">8.0bpw_H8</code></li>
142
  </ul>
143
  </div>
144
 
 
131
  <h3>Available Quantization Sizes</h3>
132
  <p>To use resources optimally, quants are created in a fixed range of sizes. Custom sizes will only be considered if there is a high community demand and/or available compute.</p>
133
  <ul>
134
+ <li><code class="inline-code-dark"><b>2.5bpw_H6</b></code></li>
135
+ <li><code class="inline-code-dark"><b>3.0bpw_H6</b></code></li>
136
+ <li><code class="inline-code-dark"><b>3.5bpw_H6</b></code></li>
137
+ <li><code class="inline-code-dark"><b>4.0bpw_H6</b></code></li>
138
+ <li><code class="inline-code-dark"><b>4.5bpw_H6</b></code> / <code class="inline-code-dark"><b>4.25bpw_H6</b></code> (for 70b and above)</li>
139
+ <li><code class="inline-code-dark"><b>5.0bpw_H6</b></code></li>
140
+ <li><code class="inline-code-dark"><b>6.0bpw_H6</b></code></li>
141
+ <li><code class="inline-code-dark"><b>8.0bpw_H8</b></code></li>
142
  </ul>
143
  </div>
144