zephyr-quiklang-3b / README.md
Walmart-the-bag's picture
Update README.md
3c8784e verified
---
license: other
pipeline_tag: text-generation
tags:
- causal_lm
datasets:
- teknium/openhermes
- unalignment/toxic-dpo-v0.1
base_model: stabilityai/stablelm-zephyr-3b
inference: false
metrics:
- bleu
- rouge
---
<div style="border-left: 4px solid #f1c40f; background-color: #fffbea; color: #000000; padding: 12px; margin: 10px 0;">
<strong style="color: #000000">Removal:</strong> This model will soon be fully removed and no longer available for download. (07/10/2025)
</div>
# Model Description
This is a finetune of [StableLM-Zephyr-3B](https://huggingface.co/stabilityai/stablelm-zephyr-3b) with 2 datasets, toxic-dpo and openhermes with 10000 samples. This was finetuned at 1024 context, for 4k version, go here: https://huggingface.co/Walmart-the-bag/zephyr-quiklang-3b-4K.
# Training Parameters
- 1xA6000-48GB
- batch_size: 6
- learning_rate: 5e-5
# Datasets:
- unalignment/toxic-dpo-v0.1
- teknium/openhermes
# Metrics/Basic Eval:
"predict_bleu-4": 31.594154999999997,
"predict_rouge-1": 44.092935,
"predict_rouge-2": 22.276081000000005,
"predict_rouge-l": 34.506909,
"predict_runtime": 121.7549,
"predict_samples_per_second": 0.821,
"predict_steps_per_second": 0.107