Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ This model is a fine-tuned version of [meta-llama/Llama-2-7b-chat-hf](https://hu
|
|
19 |
|
20 |
This model has been generated in the context of the [Araft](https://github.com/FDeRubeis/Araft) project. The Araft project consists in fine-tuning a Llama2-7B model to adapt it to use the ReAct pattern for Wikipedia-augmented question-answering. This model is the product of the first training step: SFT training.
|
21 |
|
22 |
-
In the SFT training step, the trajectories from the [Araft dataset](https://huggingface.co/datasets/FDeRubeis/araft) have been used to fine-tune the model, using each step as a desired output for the previous part of the trajectory. The model
|
23 |
|
24 |
For further information, please see the [Araft](https://github.com/FDeRubeis/Araft) github repo.
|
25 |
|
|
|
19 |
|
20 |
This model has been generated in the context of the [Araft](https://github.com/FDeRubeis/Araft) project. The Araft project consists in fine-tuning a Llama2-7B model to adapt it to use the ReAct pattern for Wikipedia-augmented question-answering. This model is the product of the first training step: SFT training.
|
21 |
|
22 |
+
In the SFT training step, the trajectories from the [Araft dataset](https://huggingface.co/datasets/FDeRubeis/araft) have been used to fine-tune the model, using each step as a desired output for the previous part of the trajectory. The model achieves a 16% performace (f1 score) on the HotpotQA dataset.
|
23 |
|
24 |
For further information, please see the [Araft](https://github.com/FDeRubeis/Araft) github repo.
|
25 |
|