Update README.md
Browse files
README.md
CHANGED
@@ -50,7 +50,7 @@ print(outputs[0]["generated_text"][-1])
|
|
50 |
## Model Card
|
51 |
|
52 |
* Base model: [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
|
53 |
-
* Continued Pretraining: 1B tokens on 4M Per-source upsampled
|
54 |
* Supervised fine-tuning (SFT): 1B tokens on open-source instruction datasets across general, mathematics, and code domains.
|
55 |
* Maximum context window: 4M tokens
|
56 |
|
|
|
50 |
## Model Card
|
51 |
|
52 |
* Base model: [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
|
53 |
+
* Continued Pretraining: 1B tokens on 4M Per-source upsampled Pretraining data.
|
54 |
* Supervised fine-tuning (SFT): 1B tokens on open-source instruction datasets across general, mathematics, and code domains.
|
55 |
* Maximum context window: 4M tokens
|
56 |
|