Qwen2.5-Smooth-Coder-14B-Instruct / README.md

Update README.md

4c5ae46 verified 3 months ago

4.7 kB

	---
	base_model:
	- allknowingroger/Qwenslerp4-14B
	- djuna/Q2.5-Veltha-14B-0.5
	- jpacifico/Chocolatine-2-14B-Instruct-v2.0b3
	- JungZoona/T3Q-qwen2.5-14b-v1.0-e3
	- deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
	- Triangle104/Herodotos-14B
	- arcee-ai/Virtuoso-Small-v2
	- suayptalha/Lamarckvergence-14B
	- allknowingroger/QwenStock3-14B
	- tanliboy/lambda-qwen2.5-14b-dpo-test
	- CultriX/Qwen2.5-14B-Verged
	- prithivMLmods/Galactic-Qwen-14B-Exp2
	- sometimesanotion/Qwen2.5-14B-Vimarckoso-v3
	- deepcogito/cogito-v1-preview-qwen-14B
	- mergekit-community/VirtuosoSmall-InstructModelStock
	- spacematt/Qwen2.5-Casual-Coder-14B-Instruct
	- sometimesanotion/Lamarck-14B-v0.7
	- sometimesanotion/Qwenvergence-14B-v13-Prose-DS
	- YOYO-AI/Qwen2.5-14B-YOYO-V5
	- CultriX/Qwen2.5-14B-Wernicke
	- CultriX/Qwen2.5-14B-Ultimav2
	- allura-org/TQ2.5-14B-Aletheia-v1
	- wanlige/li-14b-v0.4
	library_name: transformers
	tags:
	- mergekit
	- merge
	license: apache-2.0
	---
	# merge

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details
	### Merge Method

	This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [spacematt/Qwen2.5-Casual-Coder-14B-Instruct](https://huggingface.co/spacematt/Qwen2.5-Casual-Coder-14B-Instruct) as a base.

	### Models Merged

	The following models were included in the merge:
	* [allknowingroger/Qwenslerp4-14B](https://huggingface.co/allknowingroger/Qwenslerp4-14B)
	* [djuna/Q2.5-Veltha-14B-0.5](https://huggingface.co/djuna/Q2.5-Veltha-14B-0.5)
	* [jpacifico/Chocolatine-2-14B-Instruct-v2.0b3](https://huggingface.co/jpacifico/Chocolatine-2-14B-Instruct-v2.0b3)
	* [JungZoona/T3Q-qwen2.5-14b-v1.0-e3](https://huggingface.co/JungZoona/T3Q-qwen2.5-14b-v1.0-e3)
	* [deepseek-ai/DeepSeek-R1-Distill-Qwen-14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B)
	* [Triangle104/Herodotos-14B](https://huggingface.co/Triangle104/Herodotos-14B)
	* [arcee-ai/Virtuoso-Small-v2](https://huggingface.co/arcee-ai/Virtuoso-Small-v2)
	* [suayptalha/Lamarckvergence-14B](https://huggingface.co/suayptalha/Lamarckvergence-14B)
	* [allknowingroger/QwenStock3-14B](https://huggingface.co/allknowingroger/QwenStock3-14B)
	* [tanliboy/lambda-qwen2.5-14b-dpo-test](https://huggingface.co/tanliboy/lambda-qwen2.5-14b-dpo-test)
	* [CultriX/Qwen2.5-14B-Verged](https://huggingface.co/CultriX/Qwen2.5-14B-Verged)
	* [prithivMLmods/Galactic-Qwen-14B-Exp2](https://huggingface.co/prithivMLmods/Galactic-Qwen-14B-Exp2)
	* [sometimesanotion/Qwen2.5-14B-Vimarckoso-v3](https://huggingface.co/sometimesanotion/Qwen2.5-14B-Vimarckoso-v3)
	* [deepcogito/cogito-v1-preview-qwen-14B](https://huggingface.co/deepcogito/cogito-v1-preview-qwen-14B)
	* [mergekit-community/VirtuosoSmall-InstructModelStock](https://huggingface.co/mergekit-community/VirtuosoSmall-InstructModelStock)
	* [sometimesanotion/Lamarck-14B-v0.7](https://huggingface.co/sometimesanotion/Lamarck-14B-v0.7)
	* [sometimesanotion/Qwenvergence-14B-v13-Prose-DS](https://huggingface.co/sometimesanotion/Qwenvergence-14B-v13-Prose-DS)
	* [YOYO-AI/Qwen2.5-14B-YOYO-V5](https://huggingface.co/YOYO-AI/Qwen2.5-14B-YOYO-V5)
	* [CultriX/Qwen2.5-14B-Wernicke](https://huggingface.co/CultriX/Qwen2.5-14B-Wernicke)
	* [CultriX/Qwen2.5-14B-Ultimav2](https://huggingface.co/CultriX/Qwen2.5-14B-Ultimav2)
	* [allura-org/TQ2.5-14B-Aletheia-v1](https://huggingface.co/allura-org/TQ2.5-14B-Aletheia-v1)
	* [wanlige/li-14b-v0.4](https://huggingface.co/wanlige/li-14b-v0.4)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: sometimesanotion/Qwen2.5-14B-Vimarckoso-v3
	- model: JungZoona/T3Q-qwen2.5-14b-v1.0-e3
	- model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
	- model: prithivMLmods/Galactic-Qwen-14B-Exp2
	- model: CultriX/Qwen2.5-14B-Wernicke
	- model: CultriX/Qwen2.5-14B-Ultimav2
	- model: allknowingroger/Qwenslerp4-14B
	- model: wanlige/li-14b-v0.4
	- model: allknowingroger/QwenStock3-14B
	- model: tanliboy/lambda-qwen2.5-14b-dpo-test
	- model: CultriX/Qwen2.5-14B-Verged
	- model: arcee-ai/Virtuoso-Small-v2
	- model: mergekit-community/VirtuosoSmall-InstructModelStock
	- model: sometimesanotion/Lamarck-14B-v0.7
	- model: suayptalha/Lamarckvergence-14B
	- model: allura-org/TQ2.5-14B-Aletheia-v1
	- model: YOYO-AI/Qwen2.5-14B-YOYO-V5
	- model: deepcogito/cogito-v1-preview-qwen-14B
	- model: sometimesanotion/Qwenvergence-14B-v13-Prose-DS
	- model: Triangle104/Herodotos-14B
	- model: jpacifico/Chocolatine-2-14B-Instruct-v2.0b3
	- model: djuna/Q2.5-Veltha-14B-0.5
	merge_method: model_stock
	base_model: spacematt/Qwen2.5-Casual-Coder-14B-Instruct
	normalize: true
	int8_mask: true
	dtype: bfloat16
	```