embc25_finetuned_30000_en_es-ipa

This model is a fine-tuned version of Kyungjin-Kim/mmc_roberta_500000_en_es-ipa on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 16
eval_batch_size: 8
seed: 42
gradient_accumulation_steps: 4
total_train_batch_size: 64
optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 500
num_epochs: 10
mixed_precision_training: Native AMP

Training Loss	Epoch	Step	Validation Loss	Accuracy	Precision	Recall	F1
0.4869	0.5926	500	0.4969	0.75	0.7040	0.8627	0.7753
0.3874	1.1849	1000	0.4351	0.7955	0.7548	0.8753	0.8106
0.3566	1.7775	1500	0.4139	0.8162	0.8519	0.7653	0.8063
0.2936	2.3698	2000	0.3982	0.8213	0.8473	0.784	0.8144
0.2993	2.9624	2500	0.4063	0.8255	0.7985	0.8707	0.8330
0.2421	3.5547	3000	0.4422	0.8302	0.8213	0.844	0.8325
0.176	4.1470	3500	0.4823	0.8287	0.8058	0.866	0.8348
0.1747	4.7396	4000	0.4821	0.836	0.8389	0.8317	0.8353
0.129	5.3319	4500	0.5636	0.8325	0.8198	0.8523	0.8358
0.1333	5.9244	5000	0.5687	0.8287	0.8041	0.869	0.8353
0.112	6.5167	5500	0.6131	0.8313	0.8502	0.8043	0.8267
0.0705	7.1090	6000	0.7031	0.8327	0.8338	0.831	0.8324
0.078	7.7016	6500	0.7070	0.8323	0.8339	0.83	0.8319
0.0658	8.2939	7000	0.7818	0.8287	0.8077	0.8627	0.8343
0.0693	8.8865	7500	0.7682	0.8332	0.8337	0.8323	0.8330
0.0579	9.4788	8000	0.7984	0.832	0.8266	0.8403	0.8334