Running on CPU Upgrade 390 390 GAIA Leaderboard ๐ฆพ Submit models for evaluation and view leaderboard results
Running on CPU Upgrade 13k 13k Open LLM Leaderboard ๐ Track, rank and evaluate open LLMs and chatbots
Running 92 92 Nexus Function Calling Leaderboard ๐ Visualize model performance on function calling tasks