--- base_model: - sometimesanotion/Lamarck-14B-v0.7 - mergekit-community/mergekit-sce-vjeombg - CultriX/Qwen2.5-14B-Hyperionv5 - allura-org/TQ2.5-14B-Neon-v1 - oxyapi/oxy-1-small - sometimesanotion/Lamarck-14B-v0.7-Fusion - CultriX/Qwen2.5-14B-ReasoningMerge - spacematt/Qwen2.5-Agile-Coder-14B-Instruct - netease-youdao/Confucius-o1-14B - arcee-ai/SuperNova-Medius - v000000/Qwen2.5-14B-Gutenberg-Instruct-Slerpeno - agentica-org/DeepCoder-14B-Preview library_name: transformers tags: - mergekit - merge license: apache-2.0 --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [spacematt/Qwen2.5-Agile-Coder-14B-Instruct](https://huggingface.co/spacematt/Qwen2.5-Agile-Coder-14B-Instruct) as a base. ### Models Merged The following models were included in the merge: * [sometimesanotion/Lamarck-14B-v0.7](https://huggingface.co/sometimesanotion/Lamarck-14B-v0.7) * [mergekit-community/mergekit-sce-vjeombg](https://huggingface.co/mergekit-community/mergekit-sce-vjeombg) * [CultriX/Qwen2.5-14B-Hyperionv5](https://huggingface.co/CultriX/Qwen2.5-14B-Hyperionv5) * [allura-org/TQ2.5-14B-Neon-v1](https://huggingface.co/allura-org/TQ2.5-14B-Neon-v1) * [oxyapi/oxy-1-small](https://huggingface.co/oxyapi/oxy-1-small) * [sometimesanotion/Lamarck-14B-v0.7-Fusion](https://huggingface.co/sometimesanotion/Lamarck-14B-v0.7-Fusion) * [CultriX/Qwen2.5-14B-ReasoningMerge](https://huggingface.co/CultriX/Qwen2.5-14B-ReasoningMerge) * [netease-youdao/Confucius-o1-14B](https://huggingface.co/netease-youdao/Confucius-o1-14B) * [arcee-ai/SuperNova-Medius](https://huggingface.co/arcee-ai/SuperNova-Medius) * [v000000/Qwen2.5-14B-Gutenberg-Instruct-Slerpeno](https://huggingface.co/v000000/Qwen2.5-14B-Gutenberg-Instruct-Slerpeno) * [agentica-org/DeepCoder-14B-Preview](https://huggingface.co/agentica-org/DeepCoder-14B-Preview) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: allura-org/TQ2.5-14B-Neon-v1 - model: v000000/Qwen2.5-14B-Gutenberg-Instruct-Slerpeno - model: CultriX/Qwen2.5-14B-ReasoningMerge - model: sometimesanotion/Lamarck-14B-v0.7-Fusion - model: arcee-ai/SuperNova-Medius - model: netease-youdao/Confucius-o1-14B - model: mergekit-community/mergekit-sce-vjeombg - model: oxyapi/oxy-1-small - model: CultriX/Qwen2.5-14B-Hyperionv5 - model: sometimesanotion/Lamarck-14B-v0.7 - model: agentica-org/DeepCoder-14B-Preview - model: spacematt/Qwen2.5-Agile-Coder-14B-Instruct merge_method: model_stock base_model: spacematt/Qwen2.5-Agile-Coder-14B-Instruct normalize: true int8_mask: true dtype: bfloat16 ```