Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Set the allgather_bucket_size and reduce_bucket_size values to 2e8 in the ZeRO-2 configuration file to get better performance on a single GPU.