pinned
Running
2.86k
The Ultra-Scale Playbook
🌌
The ultimate guide to training LLM on large GPU Clusters
Large scale distributed AI model training, model parallelisation, low-level GPU acceleration, make GPUs go brrrrr
(*If you experience issues downloading the PDF with Chrome try restarting/updating or use a different browser)
The Nanotron team focus on sharing open knowledge and developping open-source libraries for efficient distributed training of large-scale AI models.
Some of its contributions are: