Kyrgyz MRPC mBERT

This repository contains a fine-tuned mBERT checkpoint for Kyrgyz paraphrase detection.

Dataset

Model

  • Base model: bert-base-multilingual-cased
  • Role: multilingual baseline/reference checkpoint
  • Framework: Hugging Face Transformers

Reported Results

Metric Score
F1 0.8134
Accuracy 0.7472

Training Summary

Setting Value
Epochs 3
Batch size 64
Learning rate 2e-5
Training time 11.9 seconds

Notes

This is a multilingual reference checkpoint for the Kyrgyz MRPC task.

Intended Use

This checkpoint is intended for baseline/reference evaluation for Kyrgyz paraphrase detection. It is intended for research, reproducibility, and educational use by the Kyrgyz NLP community. It should not be used for high-stakes decisions or production deployment without separate validation for the target domain.

License and Usage

License metadata is set to other. The checkpoint is released for research and reproducibility. Downstream datasets and base models may have their own licenses or usage terms; users are responsible for following the corresponding dataset cards and upstream model licenses. The checkpoint is provided without warranty.

Downloads last month
6
Safetensors
Model size
0.2B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for metinovadilet/kyrgyz-mrpc-mbert

Finetuned
(998)
this model