File size: 1,517 Bytes
4a4a249 72e1129 4a4a249 72e1129 4a4a249 72e1129 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 |
---
base_model:
- GreenerPastures/Basically-Human-4B
- hardlyworking/Sugma4B
- Hastagaras/Qibil-4B-v0.1-RP
library_name: transformers
tags:
- mergekit
- merge
- not-for-all-audiences
license: apache-2.0
datasets:
- nothingiisreal/Reddit-Dirty-And-WritingPrompts
---
# MiniButler-4B
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the [DELLA](https://arxiv.org/abs/2406.11617) merge method using **Qwen3/4B/full/apollo_mini_writing/checkpoint-33758** as a base.
### Models Merged
The following models were included in the merge:
* [GreenerPastures/Basically-Human-4B](https://huggingface.co/GreenerPastures/Basically-Human-4B)
* [hardlyworking/Sugma4B](https://huggingface.co/hardlyworking/Sugma4B)
* [Hastagaras/Qibil-4B-v0.1-RP](https://huggingface.co/Hastagaras/Qibil-4B-v0.1-RP)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: hardlyworking/Sugma4B
parameters:
density: 0.7
weight: 0.3
epsilon: 0.25
- model: GreenerPastures/Basically-Human-4B
parameters:
density: 0.7
weight: 0.4
epsilon: 0.25
- model: Hastagaras/Qibil-4B-v0.1-RP
parameters:
density: 0.7
weight: 0.3
epsilon: 0.25
merge_method: della
base_model: Qwen3/4B/full/apollo_mini_writing/checkpoint-33758
dtype: bfloat16
parameters:
normalize: false
int8_mask: false
lambda: 1.05
``` |