Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sdiazlor 's Collections
Leaderboards
Instruction Models
Computer Vision Models
Audio Models
Data Related Tools
Utilities
Favorite Demos

Leaderboards

updated Jul 14

Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions

Upvote
-

  • Running
    15
    15

    InferBench

    🥇

    A cost/quality/speed Leaderboard for Inference Providers!


  • Running on CPU Upgrade
    6.61k
    6.61k

    MTEB Leaderboard

    🥇

    Embedding Leaderboard


  • Running on CPU Upgrade
    13.6k
    13.6k

    Open LLM Leaderboard

    🏆

    Track, rank and evaluate open LLMs and chatbots


  • Running
    4.65k
    4.65k

    LMArena Leaderboard

    🏆

    Display LMArena Leaderboard


  • Running on CPU Upgrade
    74
    74

    La Leaderboard

    🌸

    Evaluate open LLMs in the languages of LATAM and Spain.


  • Running
    108
    108

    Judge Arena

    💻

    Vote on AI responses to rank models


  • Running
    569
    569

    LLM-Perf Leaderboard

    🏆

    Explore hardware performance for LLMs


  • Running
    178
    178

    Vidore Leaderboard

    🥇

    Explore visual document retrieval benchmark results


  • Running on CPU Upgrade
    921
    921

    Open VLM Leaderboard

    🌎

    VLMEvalKit Evaluation Results Collection


  • Running
    85
    85

    SEED-Bench Leaderboard

    🏆

    Submit model evaluation results to leaderboard


  • Running
    23
    23

    MM-UPD Leaderboard

    🥇

    Submit and evaluate model results on MM-UPD benchmarks


  • Running
    24
    24

    MMBench Leaderboard

    🚀

    Explore MMBench Leaderboard data

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs