Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
When used with NVMe offload, sub_group_size determines when model states are moved in and out of CPU memory from during the optimization step.