hamishivi commited on
Commit
e2d48e4
·
verified ·
1 Parent(s): 1274943

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - hamishivi/gsm8k-symbolic
5
+ language:
6
+ - en
7
+ base_model:
8
+ - hamishivi/tess2_base
9
+ ---
10
+ # TESS 2 - A Generalist Instruction Tuned Diffusion LM
11
+
12
+ This model is the TESS 2 model trained on GSM8k symbolic data found [here](https://huggingface.co/datasets/hamishivi/gsm8k-symbolic), adapted from [here](https://github.com/HKUNLP/diffusion-of-thoughts). This model is a simplex-based diffusion model adapted from Mistral v0.1 7B, further trained on Dolma 1.7 and Tulu 2 SFT data.
13
+ For more details, please check out our paper [TESS-2: A Large-Scale, Generalist Diffusion Language Model](https://todo).
14
+
15
+ This model will only work with our custom codebase found [here](https://github.com/armancohan/simplex-diffusion) -- please go there to see details on how to run training and inference.
16
+
17
+
18
+ ## Using this model
19
+
20
+ To run this model, first clone https://github.com/armancohan/simplex-diffusion.
21
+
22
+ Then, after creating a python environment with the correct packages, you can run inference via a ui with:
23
+ ```sh
24
+ ./shell_scripts/run_interactive_demo.sh hamishivi/tess2
25
+ ```
26
+
27
+ This allows you to directly interact with the model, and shows the diffusion generation process.
28
+ For training or other evaluations, please see our main repository.