LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs Paper โข 2504.14655 โข Published 3 days ago โข 16
PaperBench: Evaluating AI's Ability to Replicate AI Research Paper โข 2504.01848 โข Published 21 days ago โข 36
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! Mar 7 โข 53
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper โข 2501.10120 โข Published Jan 17 โข 49
view article Article Docmatix - a huge dataset for Document Visual Question Answering Jul 18, 2024 โข 72