File size: 208 Bytes
5fa1a76
 
 
 
 
 
1
2
3
4
5
6
Or use multiple GPUs instead

First you need to install deepspeed: pip install deepspeed

Here we use a 3B "bigscience/T0_3B" model which needs about 15GB GPU RAM - so 1 largish or 2
small GPUs can handle it.