Translation
Transformers
Safetensors
English
Dutch
marian
text2text-generation
machine-translation
low-resource
creativity
Eval Results (legacy)
Instructions to use Muniekstache/EN-FR_to_EN-NL_Non-Creative_MarianMT_LRL_Finetuned with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Muniekstache/EN-FR_to_EN-NL_Non-Creative_MarianMT_LRL_Finetuned with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "translation" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("translation", model="Muniekstache/EN-FR_to_EN-NL_Non-Creative_MarianMT_LRL_Finetuned")# Load model directly from transformers import AutoTokenizer, AutoModelForMultimodalLM tokenizer = AutoTokenizer.from_pretrained("Muniekstache/EN-FR_to_EN-NL_Non-Creative_MarianMT_LRL_Finetuned") model = AutoModelForMultimodalLM.from_pretrained("Muniekstache/EN-FR_to_EN-NL_Non-Creative_MarianMT_LRL_Finetuned") - Notebooks
- Google Colab
- Kaggle
# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM
tokenizer = AutoTokenizer.from_pretrained("Muniekstache/EN-FR_to_EN-NL_Non-Creative_MarianMT_LRL_Finetuned")
model = AutoModelForMultimodalLM.from_pretrained("Muniekstache/EN-FR_to_EN-NL_Non-Creative_MarianMT_LRL_Finetuned")Quick Links
EN-DE parent ➜ EN-NL fine-tuned on creative corpus
Authors: Niek Holter
Thesis: “Transferring Creativity”
Summary
This model starts from Helsinki-NLP’s MarianMT opus-mt-en-fr and is fine-tuned on a 10k-sentence non-creative English–Dutch corpus (Journalistic texts).
It is one of four systems trained for my bachelor’s thesis to study how transfer-learning settings affect MT creativity.
| Parent model | Fine-tune data | BLEU | COMET | Transformer Creativity Score |
|---|---|---|---|---|
| en-de | Creative | 9.950 | 0.574 | 0.34 |
Intended use
- Research on creative MT and low-resource transfer learning
Training details
- Hardware : NVIDIA GTX 1070 (CUDA 12.1)
- Epochs : Early-stopped ≤ 200 (patience 5)
- LR / batch : 2 e-5 / 16
- Script :
finetuning.py - Env :
environment.yml
Data
- Non-Creative corpus 10k sentences from DPC Journalistic texts.
- Sentence-level 1:1 alignments; deduplicated to avoid leakage.
See https://github.com/muniekstache/Transfer-Creativity.git for full pipeline.
- Downloads last month
- 3
Evaluation results
- SacreBLEU on Dutch Parallel Corpus Journalistic textstest set self-reported9.950
# Use a pipeline as a high-level helper # Warning: Pipeline type "translation" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("translation", model="Muniekstache/EN-FR_to_EN-NL_Non-Creative_MarianMT_LRL_Finetuned")