GGUF
unsloth
trl
sft
conversational
ertghiu256 commited on
Commit
47129b5
·
verified ·
1 Parent(s): 7003482

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - unsloth
5
+ - trl
6
+ - sft
7
+ datasets:
8
+ - nvidia/OpenCodeReasoning
9
+ - nvidia/OpenMathReasoning
10
+ base_model:
11
+ - Qwen/Qwen3-14B
12
+ ---
13
+
14
+ Qwen 3 14b a great medium sized model. This qwen3 model is trained on 20.000 samples taken from Nvidia/OpenCodeReasoning and Nvidia/OpenMathReasoning.
15
+
16
+ ## Model use cases:
17
+ - Coding (python and c++)
18
+ - Solving math questions
19
+ - General reasoning
20
+
21
+ **Use thinking mode using "/think" for the best results**