This is a Mistral model with ChatML tokens added to the tokenizer.

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the TIES merge method using shisa-ai/shisa-v2-mistral-nemo-12b as a base.

Models Merged

The following models were included in the merge:

Elizezen/Himeyuri-v0.1-12B
inflatebot/MN-12B-Mag-Mell-R1

Configuration

The following YAML configuration was used to produce this model:

base_model: shisa-ai/shisa-v2-mistral-nemo-12b
models:
  - model: Elizezen/Himeyuri-v0.1-12B
    parameters:
      weight: [0, 0.25, 0.5, 0.75, 1]
  - model: inflatebot/MN-12B-Mag-Mell-R1
    parameters:
      weight: [0.25, 0.3, 0.5, 0.3, 0.25]
merge_method: ties
dtype: bfloat16
parameters:
  normalize: true
  density: 0.5
tokenizer:
  source: union

Model tree for yamatazen/StarrySky-12B