M2M100 Model 418M for Lopatnov.Translate

This repository contains an ONNX-converted version of the M2M100 418M model, optimized for the Lopatnov.Translate service. These files are converted from the original weights in the facebook/m2m100_418M repository.

Conversion recipe

conda create -n model_export python=3.10 -y
conda activate model_export
pip install "optimum[onnxruntime,export,openvino]" transformers sentencepiece
optimum-cli export onnx --model facebook/m2m100_418M --task seq2seq-lm --dtype fp32 --opset 18 ./m2m100_418M

Configuration

To configure model for the service, please modify appsettings.json:

Example configuration:

{
  "Models": {
    "m2m100_418M": {
      "Type": "M2M100",
      "Path": "../../models/translate/m2m100_418M",
      "EncoderFile": "encoder_model.onnx",
      "DecoderFile": "decoder_model.onnx",
      "TokenizerFile": "sentencepiece.bpe.model",
      "TokenizerConfigFile": "added_tokens.json",
      "MaxTokens": 512,
      "VocabFile": "vocab.json"
    }
  },
  // These configurations are optional
  "Translation": {
    "AllowedModels": [ "m2m100_418M" ], // Whitelist allowed models if needed
    "DefaultModel": "m2m100_418M", // Set this model as the default
    "WarmUp": [ "m2m100_418M" ] // Load the model into memory on startup
  }
}

Note: Make sure the Path correctly points to the location of your downloaded .onnx files relative to application.

Documentation

For advanced configuration options, visit the Official Documentation.

Downloads last month: 21

Model tree for lopatnov/m2m100_418M-onnx

Base model

facebook/m2m100_418M

Quantized

(4)

this model