Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Benchmarks
To compare the speed, throughput, and latency of each quantization scheme, check the following benchmarks obtained from the optimum-benchmark library.