view article Article Judge Arena: Benchmarking LLMs as Evaluators By kaikaidai and 7 others • Nov 19, 2024 • 58
view article Article Vibe coding for data science: how to label a dataset with Kimi K2 By dvilasuero • 9 days ago • 19
view article Article An Introduction to AI Secure LLM Safety Leaderboard By danielz01 and 4 others • Jan 26, 2024 • 6
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • Jun 26 • 113
view article Article Teaching Data Literacy with Hugging Face's AI Sheets By ParulPandey • Jun 30 • 23
view article Article Featherless AI on Hugging Face Inference Providers 🔥 By sbrandeis and 5 others • Jun 12 • 45
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs By davidberenstein1957 and 1 other • May 7 • 40
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets Paper • 2403.03194 • Published Mar 5, 2024 • 15
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets By mingyuliutw and 4 others • Mar 18 • 41
view article Article Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition 🤖 By thomwolf and 2 others • Apr 14 • 48
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 447
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face! By burtenshaw and 6 others • Apr 5 • 146