merge
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the Multiplicative Model Merger merge method.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
merge_method: mult
models:
- model: failspy/Llama-3-8B-Instruct-MopeyMule
- model: NousResearch/DeepHermes-3-Llama-3-8B-Preview
parameters:
scale: 3 # adjust as needed
normalize: true
Open LLM Leaderboard Evaluation Results
Detailed results can be found here! Summarized results can be found here!
Metric | Value (%) |
---|---|
Average | 4.77 |
IFEval (0-Shot) | 23.18 |
BBH (3-Shot) | 1.21 |
MATH Lvl 5 (4-Shot) | 0.00 |
GPQA (0-shot) | 0.45 |
MuSR (0-shot) | 2.73 |
MMLU-PRO (5-shot) | 1.04 |
- Downloads last month
- 1
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for stupidity-ai/Llama-3-8B-Instruct-MultiMoose
Merge model
this model
Evaluation results
- averaged accuracy on IFEval (0-Shot)Open LLM Leaderboard23.180
- normalized accuracy on BBH (3-Shot)test set Open LLM Leaderboard1.210
- exact match on MATH Lvl 5 (4-Shot)test set Open LLM Leaderboard0.000
- acc_norm on GPQA (0-shot)Open LLM Leaderboard0.450
- acc_norm on MuSR (0-shot)Open LLM Leaderboard2.730
- accuracy on MMLU-PRO (5-shot)test set Open LLM Leaderboard1.040