Walmart-the-bag
/

zephyr-quiklang-3b

Text Generation

feature-extraction

Model card Files Files and versions

zephyr-quiklang-3b / README.md

Walmart-the-bag's picture

Walmart-the-bag

Update README.md

3c8784e verified 14 days ago

|

history blame contribute delete

1.21 kB

	---
	license: other
	pipeline_tag: text-generation
	tags:
	- causal_lm
	datasets:
	- teknium/openhermes
	- unalignment/toxic-dpo-v0.1
	base_model: stabilityai/stablelm-zephyr-3b
	inference: false
	metrics:
	- bleu
	- rouge
	---
	<div style="border-left: 4px solid #f1c40f; background-color: #fffbea; color: #000000; padding: 12px; margin: 10px 0;">
	<strong style="color: #000000">Removal:</strong> This model will soon be fully removed and no longer available for download. (07/10/2025)
	</div>

	# Model Description
	This is a finetune of [StableLM-Zephyr-3B](https://huggingface.co/stabilityai/stablelm-zephyr-3b) with 2 datasets, toxic-dpo and openhermes with 10000 samples. This was finetuned at 1024 context, for 4k version, go here: https://huggingface.co/Walmart-the-bag/zephyr-quiklang-3b-4K.

	# Training Parameters
	- 1xA6000-48GB
	- batch_size: 6
	- learning_rate: 5e-5

	# Datasets:
	- unalignment/toxic-dpo-v0.1
	- teknium/openhermes

	# Metrics/Basic Eval:
	"predict_bleu-4": 31.594154999999997,
	"predict_rouge-1": 44.092935,
	"predict_rouge-2": 22.276081000000005,
	"predict_rouge-l": 34.506909,
	"predict_runtime": 121.7549,
	"predict_samples_per_second": 0.821,
	"predict_steps_per_second": 0.107