Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Depending on the CPU and/or NVMe memory available, you can offload both the optimizer states and parameters, just one of them, or none.