amd
/

PARD-Llama-3.2-1B

@@ -31,7 +31,7 @@ PARD is a high-performance speculative decoding method that also enables low-cos
 <p align="center">
   <figure style="display: inline-block; text-align: center;">
-    <img src="https://cdn-uploads.huggingface.co/production/uploads/630cb01cc169245d78fe76b6/Dh-7wE-l0YAfU9lXWssKf.png" width="90%">
     <figcaption style="font-style: italic; margin-top: 2px;">
       AR and AR+ represent baseline auto-regressive generation using Transformers and Transformers+, respectively. VSD denotes vanilla speculative decoding. PARD refers to the proposed method in this work.
     </figcaption>

 <p align="center">
   <figure style="display: inline-block; text-align: center;">
+    <img src="https://cdn-uploads.huggingface.co/production/uploads/630cb01cc169245d78fe76b6/Dh-7wE-l0YAfU9lXWssKf.png" width="100%">
     <figcaption style="font-style: italic; margin-top: 2px;">
       AR and AR+ represent baseline auto-regressive generation using Transformers and Transformers+, respectively. VSD denotes vanilla speculative decoding. PARD refers to the proposed method in this work.
     </figcaption>