mbart-en-np-seqtoseq-sentence-translation

This model is a fine-tuned version of facebook/mbart-large-50-many-to-many-mmt on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Bleu	Gen Len
1.0147	1.0	1250	0.9876	40.1501	9.885
0.6038	2.0	2500	1.0122	40.728	10.113
0.3557	3.0	3750	1.0809	35.9297	10.844
0.2071	4.0	5000	1.1502	40.4318	10.28
0.1241	5.0	6250	1.1896	40.4595	10.288

Safetensors

Model size

0.6B params

Tensor type

F32

Base model

Finetuned

(254)

this model