Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Quantization
Quantization techniques reduces memory and computational costs by representing weights and activations with lower-precision data types like 8-bit integers (int8).