A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning Paper • 2507.08267 • Published Jul 11 • 10 • 2
A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning Paper • 2507.08267 • Published Jul 11 • 10
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 58
view article Article Hugging Face x LangChain : A new partner package in LangChain May 14, 2024 • 153
view article Article Releasing Common Corpus: the largest public domain dataset for training LLMs By Pclanglais • Mar 20, 2024 • 25