Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
It isn't necessary to explicitly set this value if you only have 1 GPU because DeepSpeed deploys all GPUs it can see on a given node.