---
library_name: transformers
tags:
- unsloth
- trl
- sft
license: mit
datasets:
- FreedomIntelligence/medical-o1-reasoning-SFT
language:
- en
base_model:
- deepseek-ai/DeepSeek-R1-Distill-Llama-8B
pipeline_tag: question-answering
---

# Model Card for Model ID
DeepSeek-R1-Distill-Llama-8B medical CoT model.


## Model Details

### Model Description

DeepSeek-R1-Distill-Llama-8B model fine tuned on the FreedomIntelligence/medical-o1-reasoning-SFT.

- **Developed by:** Vignesh
- **Funded by [optional]:** [More Information Needed]
- **Shared by [optional]:** [More Information Needed]
- **Model type:** DeepSeek-R1
- **Language(s) (NLP):** En
- **License:** MIT
- **Finetuned from model [optional]:** deepseek-ai/DeepSeek-R1-Distill-Llama-8B

find the quantized models here: https://huggingface.co/mradermacher/DeepSeek-R1-Distill-Llama-8B-Medical-Expert-GGUF

### Training Data
FreedomIntelligence/medical-o1-reasoning-SFT