prithivMLmods
/

Theta-Crucis-0.6B-Turbo1

Text Generation

text-generation-inference

Mixture of Experts

Model card Files Files and versions

prithivMLmods commited on 14 days ago

Commit

a513c5e

·

verified ·

1 Parent(s): ea066b7

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -14,6 +14,8 @@ datasets:
 - open-r1/Mixture-of-Thoughts
 ---
 # **Theta-Crucis-0.6B-Turbo1**
 > **Theta-Crucis-0.6B-Turbo1** is a compact, high-performance model designed for **code generation**, **technical reasoning**, and **structured output tasks**. Fine-tuned from **Qwen3-0.6B** using the **Mixture of Thoughts (MoT)** dataset with an emphasis on **code expert clusters**, this model delivers agile and accurate coding assistance in low-resource environments. At only **0.6B parameters**, it offers strong fluency in programming, structured syntax, and technical language generation.
@@ -108,4 +110,4 @@ print(response)
 1. [Qwen2.5 Technical Report (2024)](https://arxiv.org/pdf/2412.15115)
 2. [YaRN: Efficient Context Window Extension of Large Language Models](https://arxiv.org/pdf/2309.00071)
-3. [open-r1/Mixture-of-Thoughts](https://huggingface.co/datasets/open-r1/Mixture-of-Thoughts)

 - open-r1/Mixture-of-Thoughts
 ---
+![3.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/tCzY2m08LhLrUmcCyLkQu.png)
 # **Theta-Crucis-0.6B-Turbo1**
 > **Theta-Crucis-0.6B-Turbo1** is a compact, high-performance model designed for **code generation**, **technical reasoning**, and **structured output tasks**. Fine-tuned from **Qwen3-0.6B** using the **Mixture of Thoughts (MoT)** dataset with an emphasis on **code expert clusters**, this model delivers agile and accurate coding assistance in low-resource environments. At only **0.6B parameters**, it offers strong fluency in programming, structured syntax, and technical language generation.
 1. [Qwen2.5 Technical Report (2024)](https://arxiv.org/pdf/2412.15115)
 2. [YaRN: Efficient Context Window Extension of Large Language Models](https://arxiv.org/pdf/2309.00071)
+3. [open-r1/Mixture-of-Thoughts](https://huggingface.co/datasets/open-r1/Mixture-of-Thoughts)