NexesMess
/

Llama-3.3-Nemotron-70B-Instruct-Abliterated-TA_v0.10

Text Generation

text-generation-inference

Model card Files Files and versions

Nexesenex commited on 1 day ago

Commit

5a89ecc

·

verified ·

1 Parent(s): 48c18b9

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -14,6 +14,10 @@ A DL-worthy model, merge of Nemotron Abliterated and Llama 3.3 Instruct, with L3
 And it's quite smart and verbose, albeit quite a bit prudish.
 One of my favorites ATM.
 ---
 # merge
@@ -44,7 +48,7 @@ models:
   - model: meta-llama/Llama-3.3-70B-Instruct
     parameters:
       weight: 1.0
-dtype: bfloat16
 out_dtype: bfloat16
 parameters:
   int8_mask: true

 And it's quite smart and verbose, albeit quite a bit prudish.
 One of my favorites ATM.
+Note : if someone could reproduce the recipe with huihui's Llama 3.3 70b Instruct abliterated (normal version, not the "finetuned") instead of Llama 3.3 70b Instruct, it'd be worth a try.
+I don't have access to a mergekit rig atm, now that Arcee's space is down.
 ---
 # merge
   - model: meta-llama/Llama-3.3-70B-Instruct
     parameters:
       weight: 1.0
+dtype: bfloat16 -> replace with float32 if possible
 out_dtype: bfloat16
 parameters:
   int8_mask: true