Nanotron Research

community

AI & ML interests

Large scale distributed AI model training, model parallelisation, low-level GPU acceleration, make GPUs go brrrrr

Recent Activity

lvwerra updated a Space about 16 hours ago

nanotron/README

julien-c updated a Space 1 day ago

nanotron/README

lvwerra new activity 1 day ago

nanotron/book:Update README.md

View all activity

Organization Card

Community About org cards

HF PRESS

book cover

The Ultra-Scale Playbook: Training LLMs on GPU Clusters

Essential reading for anyone scaling ML infrastructure

The knowledge on how to efficiently scale training to large GPU clusters has been well kept within a handful big industry labs. With this book, we set out to lift the veil and release a comprehensive resource on distributed training.

AUTHORS
Nouamane Tazi, Ferdinand Mom, Haolun Zhao, Phuc Nguyen, Mohamed Mekkouri, Leandro Werra, Thomas Wolf

AFFILIATION
Hugging Face

PUBLISHED
Jul 30, 2025

This book PDF is accessible with a PRO subscription.

Subscribe to PRO

(*If you experience issues downloading the PDF with Chrome try restarting/updating or use a different browser)

The Nanotron team focus on sharing open knowledge and developping open-source libraries for efficient distributed training of large-scale AI models.

Some of its contributions are:

the Nanotron library
the Picotron library
the Ultrascale-Playbook, a comprehensive book covering all distributed/parallelisation and low-level techniques that can be used to efficiently train models at the largest scales.

spaces 2

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

Predict Memory

Analyze and visualize memory usage from model configurations

models 14

nanotron/temp_for_pr_review

Updated Sep 24, 2024

nanotron/fp8_for_nanotron

Updated Sep 21, 2024

nanotron/llama3-8b-infini-attention

Updated Aug 5, 2024 • 3 • 3

nanotron/bench_cluster_epfl

Updated Jul 12, 2024

nanotron/bench_cluster

Updated Jul 6, 2024

nanotron/test

Updated Jul 6, 2024

nanotron/old_bench

Updated Jul 6, 2024 • 3

nanotron/minicpm-nanotron

Updated Apr 11, 2024 • 6

nanotron/doremi-llama-2.5b-optimized-weights

Updated Feb 22, 2024

nanotron/doremi-llama-2.5b-reference

Updated Feb 22, 2024

datasets 15

nanotron/book

Updated 1 day ago • 7 • 1

nanotron/ultrascale-playbook-data

Updated Mar 12 • 153 • 5

nanotron/picotron_bench

Viewer • Updated Dec 17, 2024 • 740 • 305 • 1

nanotron/minipile_100_samples

Viewer • Updated Jul 10, 2024 • 100 • 12 • 1

nanotron/llama3-1024-passkey-retrieval-eval

Viewer • Updated Jul 4, 2024 • 12.6k • 19

nanotron/llama3-16k-passkey-retrieval-finetuning

Viewer • Updated Jun 20, 2024 • 77.3k • 16

nanotron/llama3-16k-passkey-retrieval-eval

Viewer • Updated Jun 19, 2024 • 712 • 12

nanotron/llama3_needle_16k_finetuning

Viewer • Updated Jun 15, 2024 • 3.57k • 7

nanotron/needle_32k_eval_dataset

Viewer • Updated May 29, 2024 • 1.79k • 15 • 1

nanotron/needle_32k_finetuning_dataset

Viewer • Updated May 16, 2024 • 35.5k • 15

View 15 datasets