--- library_name: transformers tags: - unsloth - trl - sft license: mit datasets: - FreedomIntelligence/medical-o1-reasoning-SFT language: - en base_model: - deepseek-ai/DeepSeek-R1-Distill-Llama-8B pipeline_tag: question-answering --- # Model Card for Model ID DeepSeek-R1-Distill-Llama-8B medical CoT model. ## Model Details ### Model Description DeepSeek-R1-Distill-Llama-8B model fine tuned on the FreedomIntelligence/medical-o1-reasoning-SFT. - **Developed by:** Vignesh - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Model type:** DeepSeek-R1 - **Language(s) (NLP):** En - **License:** MIT - **Finetuned from model [optional]:** deepseek-ai/DeepSeek-R1-Distill-Llama-8B find the quantized models here: https://huggingface.co/mradermacher/DeepSeek-R1-Distill-Llama-8B-Medical-Expert-GGUF ### Training Data FreedomIntelligence/medical-o1-reasoning-SFT