kreasof-ai
/

nllb-200-600M-bem2eng-bigc-tatoeba

@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1743
-- Bleu: 27.72
-- Chrf: 51.94
 ## Model description
@@ -53,9 +53,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Bleu  | Chrf  |
 |:-------------:|:-----:|:-----:|:---------------:|:-----:|:-----:|
-| 0.1605        | 1.0   | 6556  | 0.1821          | 26.26 | 51.26 |
-| 0.1397        | 2.0   | 13112 | 0.1744          | 27.1  | 51.72 |
-| 0.1197        | 3.0   | 19668 | 0.1743          | 27.72 | 51.94 |
 ### Framework versions

 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1749
+- Bleu: 27.41
+- Chrf: 51.88
 ## Model description
 | Training Loss | Epoch | Step  | Validation Loss | Bleu  | Chrf  |
 |:-------------:|:-----:|:-----:|:---------------:|:-----:|:-----:|
+| 0.1608        | 1.0   | 6556  | 0.1825          | 26.23 | 50.96 |
+| 0.1406        | 2.0   | 13112 | 0.1749          | 26.94 | 51.61 |
+| 0.1198        | 3.0   | 19668 | 0.1749          | 27.41 | 51.88 |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -1,5 +1,4 @@
 {
-  "_from_model_config": true,
   "bos_token_id": 0,
   "decoder_start_token_id": 2,
   "eos_token_id": 2,

 {
   "bos_token_id": 0,
   "decoder_start_token_id": 2,
   "eos_token_id": 2,