Spaces:

allenai
/

ZebraLogic

Running

App Files Files Community

ZebraLogic / _header.md

yuchenlin's picture

inti commit

1c919b3 over 1 year ago

|

554 Bytes

🦁 WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

📑 Paper | 💻 GitHub | 🤗 HuggingFace | 🐦 X | 💬 Discussion | ⚙️ Version: V2 | # Models: {model_num} | Updated: {LAST_UPDATED}