Spaces:

transformers-community
/

Transformers-tenets

Running

Molbap HF Staff commited on Aug 26

Commit

cd385a2

1 Parent(s): 0b2b42c

update

Files changed (1) hide show

dist/index.html CHANGED Viewed

@@ -464,7 +464,7 @@ machinery is the <code>attention mask</code>, cause of confusion. Thankfully, we
         <h3>🚀 CUDA Warmup Efficiency Benchmark</h3>
     </div>
     <div class=demo-content>
-        <iframe src=https://molbap-cuda-warmup-transformers.hf.space width=100% height=600px frameborder=0 style="border-radius: 8px; background: white;"></iframe>
     </div>
     <div class=demo-footer>
         Real CUDA warmup benchmarking with actual Transformers models. Measure the performance impact of the <code>caching_allocator_warmup</code> function at <code>transformers/src/transformers/modeling_utils.py:6186</code>. This interactive tool loads models twice - once with warmup disabled and once with warmup enabled - to demonstrate the significant loading time improvements.

         <h3>🚀 CUDA Warmup Efficiency Benchmark</h3>
     </div>
     <div class=demo-content>
+        <iframe src=https://molbap-cuda-warmup-transformers.hf.space width=100% height=800px frameborder=0 style="border-radius: 8px; background: white;"></iframe>
     </div>
     <div class=demo-footer>
         Real CUDA warmup benchmarking with actual Transformers models. Measure the performance impact of the <code>caching_allocator_warmup</code> function at <code>transformers/src/transformers/modeling_utils.py:6186</code>. This interactive tool loads models twice - once with warmup disabled and once with warmup enabled - to demonstrate the significant loading time improvements.