Winmodel commited on
Commit
16680e6
·
verified ·
1 Parent(s): 0e57511

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -3
README.md CHANGED
@@ -1,3 +1,29 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ base_model:
5
+ - Qwen/Qwen2.5-0.5B-Instruct
6
+ tags:
7
+ - llama-factory
8
+ - full
9
+ - generated_from_trainer
10
+ model-index:
11
+ - name: QwenThinker0.5B
12
+ datasets:
13
+ - open-thoughts/open-thoughts-114k
14
+
15
+ # QwenThinker0.5B
16
+
17
+ This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) on the
18
+ [OpenThoughts-114k](https://huggingface.co/datasets/open-thoughts/OpenThoughts-114k) dataset.
19
+
20
+ The dataset is derived by distilling DeepSeek-R1 using the [data pipeline available on github](https://github.com/open-thoughts/open-thoughts).
21
+ More info about the dataset can be found on the dataset card at [OpenThoughts-114k dataset](https://huggingface.co/datasets/open-thoughts/open-thoughts-114k).
22
+
23
+ Trained with [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)
24
+
25
+ ### Training hyperparameters
26
+ - 288 global batch size
27
+ - learning_rate: 1e-05
28
+ - num_epochs: 1.0
29
+ - learning_rate: 1e-05