Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
DeepSpeed
DeepSpeed, powered by Zero Redundancy Optimizer (ZeRO), is an optimization library for training and fitting very large models onto a GPU.