Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
"num_key_value_heads": The number of key value heads that should be used to implement Grouped Query Attention (GQA).