"num_key_value_heads": The number of key value heads that should be used to implement Grouped Query Attention (GQA).