view article Article Introducing Wikipedia Monthly: Fresh, Clean Wikipedia Dumps for NLP & AI Research By omarkamali • Jul 19 • 3
view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • Feb 10 • 58
view article Article Fine-tuning MMS Adapter Models for Multi-Lingual ASR By patrickvonplaten • Jun 19, 2023 • 20
Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives Paper • 2311.09227 • Published Sep 29, 2023 • 8
Orca 2: Teaching Small Language Models How to Reason Paper • 2311.11045 • Published Nov 18, 2023 • 77
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model Paper • 2311.13231 • Published Nov 22, 2023 • 29
Decaf: Monocular Deformation Capture for Face and Hand Interactions Paper • 2309.16670 • Published Sep 28, 2023 • 5
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback Paper • 2309.00267 • Published Sep 1, 2023 • 51
Textbooks Are All You Need II: phi-1.5 technical report Paper • 2309.05463 • Published Sep 11, 2023 • 88
Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts Paper • 2309.07430 • Published Sep 14, 2023 • 27
Chain-of-Verification Reduces Hallucination in Large Language Models Paper • 2309.11495 • Published Sep 20, 2023 • 39
Boolformer: Symbolic Regression of Logic Functions with Transformers Paper • 2309.12207 • Published Sep 21, 2023 • 11
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Paper • 2309.11674 • Published Sep 20, 2023 • 32