Columbia-NLP
/

gemma-2b-zephyr-sft

@@ -27,9 +27,6 @@ model-index:
     - type: acc_norm
       value: 51.88
       name: normalized accuracy
-    source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=wandb/gemma-2b-zephyr-sft
-      name: Open LLM Leaderboard
   - task:
       type: text-generation
       name: Text Generation
@@ -43,9 +40,6 @@ model-index:
     - type: acc_norm
       value: 72.63
       name: normalized accuracy
-    source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=wandb/gemma-2b-zephyr-sft
-      name: Open LLM Leaderboard
   - task:
       type: text-generation
       name: Text Generation
@@ -60,9 +54,6 @@ model-index:
     - type: acc
       value: 42.20
       name: accuracy
-    source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=wandb/gemma-2b-zephyr-sft
-      name: Open LLM Leaderboard
   - task:
       type: text-generation
       name: Text Generation
@@ -76,9 +67,6 @@ model-index:
     metrics:
     - type: mc2
       value: 41.96
-    source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=wandb/gemma-2b-zephyr-sft
-      name: Open LLM Leaderboard
   - task:
       type: text-generation
       name: Text Generation
@@ -93,9 +81,6 @@ model-index:
     - type: acc
       value: 63.85
       name: accuracy
-    source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=wandb/gemma-2b-zephyr-sft
-      name: Open LLM Leaderboard
   - task:
       type: text-generation
       name: Text Generation
@@ -110,12 +95,37 @@ model-index:
     - type: acc
       value: 20.09
       name: accuracy
-    source:
-      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=wandb/gemma-2b-zephyr-sft
-      name: Open LLM Leaderboard
 ---
-# gemma-2b-zephyr
-<!-- Provide a quick summary of what the model is/does. -->

     - type: acc_norm
       value: 51.88
       name: normalized accuracy
   - task:
       type: text-generation
       name: Text Generation
     - type: acc_norm
       value: 72.63
       name: normalized accuracy
   - task:
       type: text-generation
       name: Text Generation
     - type: acc
       value: 42.20
       name: accuracy
   - task:
       type: text-generation
       name: Text Generation
     metrics:
     - type: mc2
       value: 41.96
   - task:
       type: text-generation
       name: Text Generation
     - type: acc
       value: 63.85
       name: accuracy
   - task:
       type: text-generation
       name: Text Generation
     - type: acc
       value: 20.09
       name: accuracy
 ---
+# Model Card for Gemma 2B Zephyr SFT
+We trained the [google/gemma-2b](https://huggingface.co/google/gemma-2b) with [deita-10k-v0-sft](https://huggingface.co/datasets/HuggingFaceH4/deita-10k-v0-sft).
+We carefully selected the hyper-parameters and masked the user tokens during training to achieve the best supervised fine-tuning performance.
+## Model description
+- **Model type:** A 2.5B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets.
+- **Language(s) (NLP):** Primarily English
+- **License:** Gemma Terms of Use
+- **Finetuned from model:** [google/gemma-2b](https://huggingface.co/google/gemma-2b)
+## License
+This model has the same license as the [original Gemma model collection](https://ai.google.dev/gemma/terms)
+## OpenLLM Leaderboard Performance
+| Models                               | Avg.  | ARC-C | HellaSwag | MMLU  | TruthfulQA | Winogrande | GSM8k |
+|--------------------------------------|-------|-------|-----------|-------|------------|------------|-------|
+| google/gemma-2b                      | 46.37 | 48.38 | 71.77     | 41.77 | 33.08      | 34.42      | 16.91 |
+| wandb/gemma-2b-zephyr-sft            | 47.18 | 49.74 | 72.38     | 41.37 | 34.42      | 66.93      | 18.27 |
+| wandb/gemma-2b-zephyr-dpo            | 46.92 | 49.66 | 72.23     | 41.13 | 34.47      | 66.54      | 17.51 |
+| **Columbia-NLP/gemma-2b-zephyr-sft** | 48.75 | 51.8  | 72.63     | 42.20 | 41.96      | 63.85      | 20.09 |
+| Columbia-NLP/gemma-2b-zephyr-dpo     | 49.14 | 52.22 | 73.11     | 42.55 | 42.64      | 64.40      | 19.94 |
+## MT-Bench