lmstudio-community/Mistral-Small-3.1-24B-Instruct-2503-GGUF Text Generation • 24B • Updated May 2 • 727 • 37
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne • Jul 29, 2024 • 354
MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF Text Generation • 71B • Updated Jul 29, 2024 • 161k • 39
Comparing DPO with IPO and KTO Collection A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 8 • 32