M3_smoothquant096 / recipe.yaml
HAissa's picture
Upload Qwen3ForCausalLM
df77321 verified
raw
history blame contribute delete
171 Bytes
default_stage:
default_modifiers:
SmoothQuantModifier: {smoothing_strength: 0.96}
GPTQModifier:
scheme: W4A8
targets: Linear
ignore: [lm_head]