merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Multiplicative Model Merger merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

merge_method: mult
models:
  - model: failspy/Llama-3-8B-Instruct-MopeyMule
  - model: NousResearch/DeepHermes-3-Llama-3-8B-Preview
parameters:
  scale: 3 # adjust as needed
  normalize: true

Open LLM Leaderboard Evaluation Results

Detailed results can be found here! Summarized results can be found here!

Metric	Value (%)
Average	4.77
IFEval (0-Shot)	23.18
BBH (3-Shot)	1.21
MATH Lvl 5 (4-Shot)	0.00
GPQA (0-shot)	0.45
MuSR (0-shot)	2.73
MMLU-PRO (5-shot)	1.04

Model tree for stupidity-ai/Llama-3-8B-Instruct-MultiMoose

Evaluation results

averaged accuracy on IFEval (0-Shot)
Open LLM Leaderboard

23.180
normalized accuracy on BBH (3-Shot)
test set Open LLM Leaderboard

1.210
exact match on MATH Lvl 5 (4-Shot)
test set Open LLM Leaderboard

0.000
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

0.450
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

2.730
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

1.040

View on Papers With Code