-
197
MMLU-Pro Leaderboard
๐ฅMore advanced and challenging multi-task evaluation
-
43
Stick To Your Role! Leaderboard
๐ญBenchmarking LLMs on the stability of simulated populations
-
50
ZeroEval Leaderboard
๐Embed and use ZeroEval for evaluation tasks
-
24
Decentralized Arena Leaderboard
๐ฅDisplay model leaderboard evaluations
Hristo Panev
hppdqdq
AI & ML interests
None yet
Recent Activity
liked
a model
1 day ago
bartowski/google_gemma-3-27b-it-qat-GGUF
liked
a model
1 day ago
bartowski/nvidia_Llama-3.1-8B-UltraLong-4M-Instruct-GGUF
liked
a model
10 days ago
unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet