Add files using upload-large-folder tool

Browse files

Files changed (11) hide show

.gitattributes +1 -0
README.md +431 -0
added_tokens.json +9 -0
genai_config.json +50 -0
merges.txt +0 -0
model.onnx +3 -0
model.onnx.data +3 -0
special_tokens_map.json +39 -0
tokenizer.json +0 -0
tokenizer_config.json +235 -0
vocab.json +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+model.onnx.data filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,431 @@

+---
+pipeline_tag: text-generation
+inference: false
+license: apache-2.0
+library_name: onnxruntime_genai
+tags:
+- language
+- granite-3.3
+- onnxruntime_genai
+base_model:
+- Prince-1/Granite-3.3-8B-Instruct-Onnx
+---
+# Granite-3.3-8B-Instruct
+**Model Summary:**
+Granite-3.3-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for improved reasoning and instruction-following capabilities. Built on top of Granite-3.3-8B-Base, the model delivers significant gains on benchmarks for measuring generic performance including AlpacaEval-2.0 and Arena-Hard, and improvements in mathematics, coding, and instruction following. It supports structured reasoning through \<think\>\<\/think\> and \<response\>\<\/response\> tags, providing clear separation between internal thoughts and final outputs. The model has been trained on a carefully balanced combination of permissively licensed data and curated synthetic tasks.
+- **Developers:** Granite Team, IBM
+- **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
+- **Release Date**: April 16th, 2025
+- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
+**Supported Languages:**
+English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, and Chinese. However, users may finetune this Granite model for languages beyond these 12 languages.
+**Intended Use:**
+This model is designed to handle general instruction-following tasks and can be integrated into AI assistants across various domains, including business applications.
+**Capabilities**
+* Thinking
+* Summarization
+* Text classification
+* Text extraction
+* Question-answering
+* Retrieval Augmented Generation (RAG)
+* Code related tasks
+* Function-calling tasks
+* Multilingual dialog use cases
+<!-- * Fill-in-the-middle -->
+* Long-context tasks including long document/meeting summarization, long document QA, etc.
+**Generation:**
+This is a simple example of how to use Granite-3.3-8B-Instruct model.
+Install the following libraries:
+```shell
+pip install torch torchvision torchaudio
+pip install accelerate
+pip install transformers
+```
+Then, copy the snippet from the section that is relevant for your use case.
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer, set_seed
+import torch
+model_path="ibm-granite/granite-3.3-8b-instruct"
+device="cuda"
+model = AutoModelForCausalLM.from_pretrained(
+        model_path,
+        device_map=device,
+        torch_dtype=torch.bfloat16,
+    )
+tokenizer = AutoTokenizer.from_pretrained(
+        model_path
+)
+conv = [{"role": "user", "content":"Redesign a common household item to make it more sustainable and user-friendly. Explain the changes and their benefits."}]
+input_ids = tokenizer.apply_chat_template(conv, return_tensors="pt", thinking=True, return_dict=True, add_generation_prompt=True).to(device)
+set_seed(42)
+output = model.generate(
+    **input_ids,
+    max_new_tokens=8192,
+)
+prediction = tokenizer.decode(output[0, input_ids["input_ids"].shape[1]:], skip_special_tokens=True)
+print(prediction)
+```
+**Example Outputs**
+- thinking=True
+```md
+<think>To address this prompt, I'll focus on redesigning a common household item: the plastic water bottle. The goal is to make it more sustainable and user-friendly. Here are the proposed changes and their benefits:
+1. **Material**: Replace plastic with biodegradable or recyclable materials like glass or plant-based plastics (PLA).
+2. **Design**: Introduce a modular design for easy refilling and cleaning.
+3. **Functionality**: Incorporate a built-in water filter for purification.
+4. **Durability**: Enhance the bottle's lifespan through robust construction.
+5. **User-friendliness**: Simplify the cap mechanism for easy opening and closing.
+These changes would significantly reduce plastic waste, promote recycling, ensure access to clean drinking water, and enhance the overall user experience.
+Now, I will detail each change and its benefits in a structured format.</think>
+<response>### Redesign of the Plastic Water Bottle for Sustainability and User-Friendliness
+#### 1. Material: Biodegradable or Recyclable Alternatives
+**Change**: Transition from traditional PET plastic to glass or plant-based plastics (Polylactic Acid - PLA).
+**Benefits**:
+- **Sustainability**: Glass and PLA are biodegradable or highly recyclable, drastically reducing plastic pollution.
+- **Health**: Unlike PET, these materials do not leach chemicals into the water, ensuring safer consumption.
+#### 2. Modular Design for Easy Refilling and Cleaning
+**Change**: Implement a detachable, interchangeable inner liner within a reusable outer casing.
+**Benefits**:
+- **User-Friendliness**: Users can easily remove the inner liner for thorough cleaning, reducing the risk of bacterial growth.
+- **Sustainability**: The outer casing remains intact, encouraging reuse, while the inner liner can be replaced or recycled when worn out.
+#### 3. Built-in Water Filter
+**Change**: Integrate a removable, replaceable water filter into the bottle cap.
+**Benefits**:
+- **Accessibility**: Provides clean drinking water directly from any source (tap, river, etc.), eliminating the need for additional filtration devices.
+- **Health**: Removes contaminants, bacteria, and impurities, ensuring safe hydration.
+- **Sustainability**: Reduces the use of single-use plastic water bottles by offering a purification solution within a reusable container.
+#### 4. Enhanced Durability
+**Change**: Use high-quality, durable materials and robust construction techniques for both the outer casing and inner liner.
+**Benefits**:
+- **Longevity**: A more robust design increases the bottle's lifespan, reducing the frequency of replacements.
+- **Cost-effectiveness**: Although the initial cost might be higher, the long-term savings from reduced replacements make it economical.
+- **Sustainability**: Less frequent disposal means lower environmental impact.
+#### 5. Simplified Cap Mechanism
+**Change**: Adopt a one-touch, easy-to-open and close cap design, possibly inspired by sports bottle caps.
+**Benefits**:
+- **User-Friendliness**: Especially beneficial for children, elderly, or those with limited hand dexterity.
+- **Convenience**: Quick access to water encourages more frequent hydration, promoting healthier habits.
+### Conclusion
+This redesigned water bottle addresses key sustainability concerns by minimizing plastic use and waste, while also enhancing user-friendliness through improved functionality, durability, and accessibility. By integrating biodegradable materials, a modular design, a built-in filter, robust construction, and a simple cap mechanism, the bottle not only reduces environmental footprint but also offers a superior user experience.</response>
+```
+- thinking=False
+```md
+Item for Redesign: Plastic Bottle (e.g., water or beverage bottle)
+Redesign Concept: Eco-Friendly, Modular, and Multi-Functional Reusable Bottle
+Changes:
+1. Material: Replace the conventional plastic with food-grade, BPA-free, and recyclable materials such as stainless steel or glass. These materials are durable, non-toxic, and have a longer lifespan, reducing waste.
+2. Modular Design: Introduce a modular design that allows users to customize their bottle according to their needs. This could include interchangeable caps, mouthpiece sizes, and carrying loops. Users can purchase additional modules as needed, promoting a more sustainable approach by reducing the need to buy an entirely new bottle for different purposes.
+3. Integrated Filter: Incorporate a built-in, washable, and reusable filter that can remove impurities and improve the taste of water. This eliminates the need for single-use disposable filters or bottled water, further reducing plastic waste.
+4. Smart Cap: Develop a smart cap with a built-in digital display and temperature sensor. This feature allows users to track their daily water intake, set hydration goals, and monitor the temperature of their beverage. The smart cap can be synced with a mobile app for additional functionality, such as reminders and progress tracking.
+5. Easy-to-Clean Design: Ensure the bottle has a wide mouth and smooth interior surfaces for easy cleaning. Include a brush for hard-to-reach areas, making maintenance simple and encouraging regular use.
+6. Collapsible Structure: Implement a collapsible design that reduces the bottle's volume when not in use, making it more portable and convenient for storage.
+Benefits:
+1. Sustainability: By using recyclable materials and reducing plastic waste, this redesigned bottle significantly contributes to a more sustainable lifestyle. The modular design and reusable filter also minimize single-use plastic consumption.
+2. User-Friendly: The smart cap, easy-to-clean design, and collapsible structure make the bottle convenient and user-friendly. Users can customize their bottle to suit their needs, ensuring a better overall experience.
+3. Healthier Option: Using food-grade, BPA-free materials and an integrated filter ensures that the beverages consumed are free from harmful chemicals and impurities, promoting a healthier lifestyle.
+4. Cost-Effective: Although the initial investment might be higher, the long-term savings from reduced purchases of single-use plastic bottles and disposable filters make this reusable bottle a cost-effective choice.
+5. Encourages Hydration: The smart cap's features, such as hydration tracking and temperature monitoring, can motivate users to stay hydrated and develop healthier habits.
+By redesigning a common household item like the plastic bottle, we can create a more sustainable, user-friendly, and health-conscious alternative that benefits both individuals and the environment.
+```
+**Evaluation Results:**
+<table>
+<thead>
+    <caption style="text-align:center"><b>Comparison with different models over various benchmarks<sup id="fnref1"><a href="#fn1">1</a></sup>. Scores of AlpacaEval-2.0 and Arena-Hard are calculated with thinking=True</b></caption>
+  <tr>
+    <th style="text-align:left; background-color: #001d6c; color: white;">Models</th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">Arena-Hard</th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">AlpacaEval-2.0</th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">MMLU</th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">PopQA</th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">TruthfulQA</th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">BigBenchHard<sup id="fnref2"><a href="#fn2">2</a></sup></th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">DROP<sup id="fnref3"><a href="#fn3">3</a></sup></th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">GSM8K</th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">HumanEval</th>
+   <th style="text-align:center; background-color: #001d6c; color: white;">HumanEval+</th>
+  <th style="text-align:center; background-color: #001d6c; color: white;">IFEval</th>
+  <th style="text-align:center; background-color: #001d6c; color: white;">AttaQ</th>
+  </tr></thead>
+  <tbody>
+<tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Granite-3.1-2B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">23.3</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">27.17</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">57.11</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">20.55</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">59.79</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">61.82</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">20.99</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">67.55</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">79.45</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">75.26</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">63.59</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">84.7</td>
+  </tr>
+  <tr>
+      <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Granite-3.2-2B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">24.86</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">34.51</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">57.18</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">20.56</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">59.8</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">61.39</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">23.84</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">67.02</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">80.13</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">73.39</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">61.55</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">83.23</td>
+  </tr>
+  <tr>
+      <td style="text-align:left; background-color: #DAE8FF; color: black;"><b>Granite-3.3-2B-Instruct</b></td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 28.86 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 43.45 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 55.88 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 18.4 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 58.97 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 63.91 </td>
+      <td style="text-align:center; background-color: #DAE8FF; color: black;"> 44.33 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 72.48 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 80.51 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 75.68 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 65.8 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;">87.47</td>
+      </tr>
+  <tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Llama-3.1-8B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">36.43</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">27.22</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">69.15</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">28.79</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">52.79</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">73.43</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">71.23</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">83.24</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">85.32</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">80.15</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">79.10</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">83.43</td>
+  </tr>
+  <tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">DeepSeek-R1-Distill-Llama-8B</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">17.17</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">21.85</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">45.80</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">13.25</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">47.43</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">67.39</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">49.73</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">72.18</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">67.54</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">62.91</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">66.50</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">42.87</td>
+  </tr>
+  <tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Qwen-2.5-7B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">25.44</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">30.34</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">74.30</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">18.12</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">63.06</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">69.19</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">64.06</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">84.46</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">93.35</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">89.91</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">74.90</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">81.90</td>
+  </tr>
+  <tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">DeepSeek-R1-Distill-Qwen-7B</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">10.36</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">15.35</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">50.72</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">9.94</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">47.14</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">67.38</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">51.78</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">78.47</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">79.89</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">78.43</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">59.10</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">42.45</td>
+  </tr>
+  <tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Granite-3.1-8B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">37.58</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">30.34</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">66.77</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">28.7</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">65.84</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">69.87</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">58.57</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">79.15</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">89.63</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">85.79</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">73.20</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">85.73</td>
+  </tr>
+<tr>
+      <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Granite-3.2-8B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">55.25</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">61.19</td>
+   <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">66.79</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">28.04</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">66.92</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">71.86</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">58.29</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">81.65</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">89.35</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">85.72</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">74.31</td>
+     <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;">84.7</td>
+  </tr>
+  <tr>
+      <td style="text-align:left; background-color: #DAE8FF; color: black;"><b>Granite-3.3-8B-Instruct</b></td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 57.56 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 62.68 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 65.54 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 26.17 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 66.86 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 69.13 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 59.36 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 80.89 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 89.73 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 86.09 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 74.82 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;">88.5</td>
+      </tr>
+</tbody></table>
+<table>
+ <caption style="text-align:center"><b>Math Benchmarks</b></caption>
+<thead>
+  <tr>
+    <th style="text-align:left; background-color: #001d6c; color: white;">Models</th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">AIME24</th>
+    <th style="text-align:center; background-color: #001d6c; color: white;">MATH-500</th>
+  </tr></thead>
+  <tbody>
+  <tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Granite-3.1-2B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;"> 0.89 </td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;"> 35.07 </td>
+  </tr>
+  <tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Granite-3.2-2B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;"> 0.89 </td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;"> 35.54 </td>
+  </tr>
+  <tr>
+      <td style="text-align:left; background-color: #DAE8FF; color: black;"><b>Granite-3.3-2B-Instruct</b></td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 3.28 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 58.09 </td>
+  </tr>
+  <tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Granite-3.1-8B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;"> 1.97 </td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;"> 48.73 </td>
+  </tr>
+  <tr>
+    <td style="text-align:left; background-color: #FFFFFF; color: #2D2D2D;">Granite-3.2-8B-Instruct</td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;"> 2.43 </td>
+    <td style="text-align:center; background-color: #FFFFFF; color: #2D2D2D;"> 52.8 </td>
+  </tr>
+  <tr>
+      <td style="text-align:left; background-color: #DAE8FF; color: black;"><b>Granite-3.3-8B-Instruct</b></td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 8.12 </td>
+    <td style="text-align:center; background-color: #DAE8FF; color: black;"> 69.02 </td>
+  </tr>
+    </tbody></table>
+**Training Data:**
+Overall, our training data is largely comprised of two key sources: (1) publicly available datasets with permissive license, (2) internal synthetically generated data targeted to enhance reasoning capabilites.
+<!-- A detailed attribution of datasets can be found in [Granite 3.2 Technical Report (coming soon)](#), and [Accompanying Author List](https://github.com/ibm-granite/granite-3.0-language-models/blob/main/author-ack.pdf). -->
+**Infrastructure:**
+We train Granite-3.3-8B-Instruct using IBM's super computing cluster, Blue Vela, which is outfitted with NVIDIA H100 GPUs. This cluster provides a scalable and efficient infrastructure for training our models over thousands of GPUs.
+**Ethical Considerations and Limitations:**
+Granite-3.3-8B-Instruct builds upon Granite-3.3-8B-Base, leveraging both permissively licensed open-source and select proprietary data for enhanced performance. Since it inherits its foundation from the previous model, all ethical considerations and limitations applicable to [Granite-3.3-8B-Base](https://huggingface.co/ibm-granite/granite-3.3-8b-base) remain relevant.
+**Resources**
+- ⭐️ Learn about the latest updates with Granite: https://www.ibm.com/granite
+- 📄 Get started with tutorials, best practices, and prompt engineering advice: https://www.ibm.com/granite/docs/
+- 💡 Learn about the latest Granite learning resources: https://github.com/ibm-granite-community/
+<p><a href="#fnref1" title="Jump back to reference">[1]</a> Evaluated using <a href="https://github.com/allenai/olmes">OLMES</a> (except AttaQ and Arena-Hard scores)</p>
+<p><a href="#fnref2" title="Jump back to reference">[2]</a> Added regex for more efficient asnwer extraction.</a></p>
+<p><a href="#fnref3" title="Jump back to reference">[3]</a> Modified the implementation to handle some of the issues mentioned <a href="https://huggingface.co/blog/open-llm-leaderboard-drop">here</a></p>
+<!-- ## Citation
+<!-- ## Citation
+```
+@misc{granite-models,
+  author = {author 1, author2, ...},
+  title = {},
+  journal = {},
+  volume = {},
+  year = {2024},
+  url = {https://arxiv.org/abs/0000.00000},
+}
+``` -->

added_tokens.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+  "<|end_of_cite|>": 49156,
+  "<|end_of_plugin|>": 49158,
+  "<|end_of_role|>": 49153,
+  "<|start_of_cite|>": 49155,
+  "<|start_of_plugin|>": 49157,
+  "<|start_of_role|>": 49152,
+  "<|tool_call|>": 49154
+}

genai_config.json ADDED Viewed

	@@ -0,0 +1,50 @@

+{
+    "model": {
+        "bos_token_id": 0,
+        "context_length": 131072,
+        "decoder": {
+            "session_options": {
+                "log_id": "onnxruntime-genai",
+                "provider_options": []
+            },
+            "filename": "model.onnx",
+            "head_size": 128,
+            "hidden_size": 4096,
+            "inputs": {
+                "input_ids": "input_ids",
+                "attention_mask": "attention_mask",
+                "position_ids": "position_ids",
+                "past_key_names": "past_key_values.%d.key",
+                "past_value_names": "past_key_values.%d.value"
+            },
+            "outputs": {
+                "logits": "logits",
+                "present_key_names": "present.%d.key",
+                "present_value_names": "present.%d.value"
+            },
+            "num_attention_heads": 32,
+            "num_hidden_layers": 40,
+            "num_key_value_heads": 8
+        },
+        "eos_token_id": 0,
+        "pad_token_id": 0,
+        "type": "granite",
+        "vocab_size": 49159
+    },
+    "search": {
+        "diversity_penalty": 0.0,
+        "do_sample": false,
+        "early_stopping": true,
+        "length_penalty": 1.0,
+        "max_length": 131072,
+        "min_length": 0,
+        "no_repeat_ngram_size": 0,
+        "num_beams": 1,
+        "num_return_sequences": 1,
+        "past_present_share_buffer": false,
+        "repetition_penalty": 1.0,
+        "temperature": 1.0,
+        "top_k": 1,
+        "top_p": 1.0
+    }
+}

merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f4d8ddc424d677ffeaac314abeb53fd3b3859f1d200a09b62a25fb49256fd4a4
+size 952122

model.onnx.data ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a0d6b289613807231e23e2370bde15428d31da9f603065e3bed1fa9b027a4e58
+size 16777994240

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,39 @@

+{
+  "additional_special_tokens": [
+    "<|start_of_role|>",
+    "<|end_of_role|>",
+    "<|tool_call|>",
+    "<|start_of_cite|>",
+    "<|end_of_cite|>",
+    "<|start_of_plugin|>",
+    "<|end_of_plugin|>"
+  ],
+  "bos_token": {
+    "content": "<|end_of_text|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "<|end_of_text|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "<|end_of_text|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<|end_of_text|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,235 @@

+{
+  "add_bos_token": false,
+  "add_prefix_space": false,
+  "added_tokens_decoder": {
+    "0": {
+      "content": "<|end_of_text|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "<fim_prefix>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "<fim_middle>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "3": {
+      "content": "<fim_suffix>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "4": {
+      "content": "<fim_pad>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "5": {
+      "content": "<filename>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "6": {
+      "content": "<gh_stars>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "7": {
+      "content": "<issue_start>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "8": {
+      "content": "<issue_comment>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "9": {
+      "content": "<issue_closed>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "10": {
+      "content": "<jupyter_start>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "11": {
+      "content": "<jupyter_text>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "12": {
+      "content": "<jupyter_code>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "13": {
+      "content": "<jupyter_output>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "14": {
+      "content": "<empty_output>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "15": {
+      "content": "<commit_before>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "16": {
+      "content": "<commit_msg>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "17": {
+      "content": "<commit_after>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "18": {
+      "content": "<reponame>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49152": {
+      "content": "<|start_of_role|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49153": {
+      "content": "<|end_of_role|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49154": {
+      "content": "<|tool_call|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49155": {
+      "content": "<|start_of_cite|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49156": {
+      "content": "<|end_of_cite|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49157": {
+      "content": "<|start_of_plugin|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49158": {
+      "content": "<|end_of_plugin|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "additional_special_tokens": [
+    "<|start_of_role|>",
+    "<|end_of_role|>",
+    "<|tool_call|>",
+    "<|start_of_cite|>",
+    "<|end_of_cite|>",
+    "<|start_of_plugin|>",
+    "<|end_of_plugin|>"
+  ],
+  "bos_token": "<|end_of_text|>",
+  "chat_template": "{# Alias tools -> available_tools #}\n{%- if tools and not available_tools -%}\n    {%- set available_tools = tools -%}\n{%- endif -%}\n{%- if messages[0]['role'] == 'system' %}\n     {%- set system_message = messages[0]['content'] %}\n     {%- set loop_messages = messages[1:] %}\n {%- else %}\n     {%- set system_message = \"Knowledge Cutoff Date: April 2024.\nToday's Date: \" + strftime_now('%B %d, %Y') + \".\nYou are Granite, developed by IBM.\" %}\n     {%- if available_tools and documents %}\n         {%- set system_message = system_message + \" You are a helpful assistant with access to the following tools. When a tool is required to answer the user's query, respond only with <|tool_call|> followed by a JSON list of tools used. If a tool does not exist in the provided list of tools, notify the user that you do not have the ability to fulfill the request.\nWrite the response to the user's input by strictly aligning with the facts in the provided documents. If the information needed to answer the question is not available in the documents, inform the user that the question cannot be answered based on the available data.\" %}\n     {%- elif available_tools %}\n         {%- set system_message = system_message + \" You are a helpful assistant with access to the following tools. When a tool is required to answer the user's query, respond only with <|tool_call|> followed by a JSON list of tools used. If a tool does not exist in the provided list of tools, notify the user that you do not have the ability to fulfill the request.\" %}\n     {%- elif documents %}\n         {%- set system_message = system_message + \" Write the response to the user's input by strictly aligning with the facts in the provided documents. If the information needed to answer the question is not available in the documents, inform the user that the question cannot be answered based on the available data.\" %}\n    {%- elif thinking %}\n    {%- set system_message = system_message + \" You are a helpful AI assistant.\nRespond to every user query in a comprehensive and detailed way. You can write down your thoughts and reasoning process before responding. In the thought process, engage in a comprehensive cycle of analysis, summarization, exploration, reassessment, reflection, backtracing, and iteration to develop well-considered thinking process. In the response section, based on various attempts, explorations, and reflections from the thoughts section, systematically present the final solution that you deem correct. The response should summarize the thought process. Write your thoughts between <think></think> and write your response between <response></response> for each user query.\" %}\n     {%- else %}\n         {%- set system_message = system_message + \" You are a helpful AI assistant.\" %}\n     {%- endif %}\n     {%- if 'citations' in controls and documents %}\n         {%- set system_message = system_message + '\nUse the symbols <|start_of_cite|> and <|end_of_cite|> to indicate when a fact comes from a document in the search result, e.g <|start_of_cite|> {document_id: 1}my fact <|end_of_cite|> for a fact from document 1. Afterwards, list all the citations with their corresponding documents in an ordered list.' %}\n     {%- endif %}\n     {%- if 'hallucinations' in controls and documents %}\n         {%- set system_message = system_message + '\nFinally, after the response is written, include a numbered list of sentences from the response with a corresponding risk value that are hallucinated and not based in the documents.' %}\n     {%- endif %}\n     {%- set loop_messages = messages %}\n {%- endif %}\n {{- '<|start_of_role|>system<|end_of_role|>' + system_message + '<|end_of_text|>\n' }}\n {%- if available_tools %}\n     {{- '<|start_of_role|>available_tools<|end_of_role|>' }}\n     {{- available_tools | tojson(indent=4) }}\n     {{- '<|end_of_text|>\n' }}\n {%- endif %}\n {%- if documents %}\n     {%- for document in documents %}\n         {{- '<|start_of_role|>document {\"document_id\": \"' + document['doc_id'] | string + '\"}<|end_of_role|>\n' }}\n         {{- document['text'] }}\n         {{- '<|end_of_text|>\n' }}\n              {%- endfor %}\n {%- endif %}\n {%- for message in loop_messages %}\n     {{- '<|start_of_role|>' + message['role'] + '<|end_of_role|>' + message['content'] + '<|end_of_text|>\n' }}\n     {%- if loop.last and add_generation_prompt %}\n         {{- '<|start_of_role|>assistant' }}\n             {%- if controls %}\n                 {{- ' ' + controls | tojson()}}\n             {%- endif %}\n         {{- '<|end_of_role|>' }}\n     {%- endif %}\n {%- endfor %}",
+  "clean_up_tokenization_spaces": true,
+  "eos_token": "<|end_of_text|>",
+  "errors": "replace",
+  "extra_special_tokens": {},
+  "model_max_length": 9223372036854775807,
+  "pad_token": "<|end_of_text|>",
+  "padding_side": "left",
+  "tokenizer_class": "GPT2Tokenizer",
+  "unk_token": "<|end_of_text|>",
+  "vocab_size": 49152
+}

vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff