DavidAU
/

Qwen3-53B-A3B-2507-THINKING-TOTAL-RECALL-v2-MASTER-CODER

Model card Files Files and versions Community

DavidAU commited on 28 days ago

Commit

52e603e

·

verified ·

1 Parent(s): 2648102

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -73,6 +73,7 @@ For coding, programming set expert to:
 - 10 for moderate work.
 - 12-16 for complex work, long projects, complex coding.
 - And for longer context, and/or multi-turn -> increase experts by 1-2 to help with longer context/multi turn understanding.
 Recommended settings - general:
 - Rep pen 1.05 to 1.1 ; however rep pen of 1 will work well (may need to raise it for lower quants/fewer activated experts)
@@ -80,12 +81,14 @@ Recommended settings - general:
 - Topk of 20, 40 or 100
 - Topp of .95 / min p of .05
 - System prompt (optional) to focus the model better.
 Creative Use Cases:
 - Rep pen of 1.09 or higher, especially if using a lower quant / lower temps.
 - Temps of .8 to 2 suggested.
 - Also use rep pen of 1.1 or higher with very short prompts.
 - You can set active experts as low as "4" for creative use cases.
 - NOTE: The 20x/42B version may be better for creative use cases.
 This is the refined version -V1.4- from this project (see this repo for all settings, details, system prompts, example generations etc etc):

 - 10 for moderate work.
 - 12-16 for complex work, long projects, complex coding.
 - And for longer context, and/or multi-turn -> increase experts by 1-2 to help with longer context/multi turn understanding.
+- Suggest min context 8k-16k for thinking/output.
 Recommended settings - general:
 - Rep pen 1.05 to 1.1 ; however rep pen of 1 will work well (may need to raise it for lower quants/fewer activated experts)
 - Topk of 20, 40 or 100
 - Topp of .95 / min p of .05
 - System prompt (optional) to focus the model better.
+- Suggest min context 8k-16k for thinking/output.
 Creative Use Cases:
 - Rep pen of 1.09 or higher, especially if using a lower quant / lower temps.
 - Temps of .8 to 2 suggested.
 - Also use rep pen of 1.1 or higher with very short prompts.
 - You can set active experts as low as "4" for creative use cases.
+- Suggest min context 8k-16k for thinking/output.
 - NOTE: The 20x/42B version may be better for creative use cases.
 This is the refined version -V1.4- from this project (see this repo for all settings, details, system prompts, example generations etc etc):