Upload folder using huggingface_hub
Browse files
    	
        README.md
    CHANGED
    
    | @@ -31,7 +31,7 @@ PARD is a high-performance speculative decoding method that also enables low-cos | |
| 31 |  | 
| 32 | 
             
            <p align="center">
         | 
| 33 | 
             
              <figure style="display: inline-block; text-align: center;">
         | 
| 34 | 
            -
                <img src="https://cdn-uploads.huggingface.co/production/uploads/630cb01cc169245d78fe76b6/Dh-7wE-l0YAfU9lXWssKf.png" width=" | 
| 35 | 
             
                <figcaption style="font-style: italic; margin-top: 2px;">
         | 
| 36 | 
             
                  AR and AR+ represent baseline auto-regressive generation using Transformers and Transformers+, respectively. VSD denotes vanilla speculative decoding. PARD refers to the proposed method in this work.
         | 
| 37 | 
             
                </figcaption>
         | 
|  | |
| 31 |  | 
| 32 | 
             
            <p align="center">
         | 
| 33 | 
             
              <figure style="display: inline-block; text-align: center;">
         | 
| 34 | 
            +
                <img src="https://cdn-uploads.huggingface.co/production/uploads/630cb01cc169245d78fe76b6/Dh-7wE-l0YAfU9lXWssKf.png" width="100%">
         | 
| 35 | 
             
                <figcaption style="font-style: italic; margin-top: 2px;">
         | 
| 36 | 
             
                  AR and AR+ represent baseline auto-regressive generation using Transformers and Transformers+, respectively. VSD denotes vanilla speculative decoding. PARD refers to the proposed method in this work.
         | 
| 37 | 
             
                </figcaption>
         | 
