Commit 
							
							·
						
						11a7a28
	
1
								Parent(s):
							
							f517097
								
Adding Evaluation Results (#5)
Browse files- Adding Evaluation Results (d51cb0405256ff61b3f51a445484a5d3c5a2dc4d)
Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>
    	
        README.md
    CHANGED
    
    | @@ -156,4 +156,17 @@ To cite this model: | |
| 156 | 
             
            ```
         | 
| 157 |  | 
| 158 | 
             
            ## Contact
         | 
| 159 | 
            -
            Hello@writer.com
         | 
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | 
|  | |
| 156 | 
             
            ```
         | 
| 157 |  | 
| 158 | 
             
            ## Contact
         | 
| 159 | 
            +
            Hello@writer.com
         | 
| 160 | 
            +
            # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
         | 
| 161 | 
            +
            Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Writer__palmyra-med-20b)
         | 
| 162 | 
            +
             | 
| 163 | 
            +
            | Metric                | Value                     |
         | 
| 164 | 
            +
            |-----------------------|---------------------------|
         | 
| 165 | 
            +
            | Avg.                  | 40.02   |
         | 
| 166 | 
            +
            | ARC (25-shot)         | 46.93          |
         | 
| 167 | 
            +
            | HellaSwag (10-shot)   | 73.51    |
         | 
| 168 | 
            +
            | MMLU (5-shot)         | 44.34         |
         | 
| 169 | 
            +
            | TruthfulQA (0-shot)   | 35.47   |
         | 
| 170 | 
            +
            | Winogrande (5-shot)   | 65.35   |
         | 
| 171 | 
            +
            | GSM8K (5-shot)        | 2.65        |
         | 
| 172 | 
            +
            | DROP (3-shot)         | 11.88         |
         | 

