Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sdiazlor 's Collections
Leaderboards
Instruction Models
Computer Vision Models
Audio Models
Data Related Tools
Utilities
Favorite Demos

Leaderboards

updated Jul 14

Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions

Upvote
-

  • Running
    12
    12

    InferBench

    🥇

    A cost/quality/speed Leaderboard for Inference Providers!


  • Running on CPU Upgrade
    6.26k
    6.26k

    MTEB Leaderboard

    🥇

    Embedding Leaderboard


  • Running on CPU Upgrade
    13.5k
    13.5k

    Open LLM Leaderboard

    🏆

    Track, rank and evaluate open LLMs and chatbots


  • Running
    4.59k
    4.59k

    LMArena Leaderboard

    🏆

    Display LMArena Leaderboard


  • Running on CPU Upgrade
    74
    74

    La Leaderboard

    🌸

    Evaluate open LLMs in the languages of LATAM and Spain.


  • Running
    106
    106

    Judge Arena

    💻

    Vote on AI responses to rank models


  • Running
    552
    552

    LLM-Perf Leaderboard

    🏆

    Explore hardware performance for LLMs


  • Running
    168
    168

    Vidore Leaderboard

    🥇

    Explore visual document retrieval benchmark results


  • Running on CPU Upgrade
    862
    862

    Open VLM Leaderboard

    🌎

    VLMEvalKit Evaluation Results Collection


  • Running
    85
    85

    SEED-Bench Leaderboard

    🏆


  • Running
    23
    23

    MM-UPD Leaderboard

    🥇

    Submit and evaluate model results for the MM-AAD leaderboard


  • Running
    22
    22

    MMBench Leaderboard

    🚀

    View and filter MMBench leaderboard data

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs