Kabyle-POS-tagger / README.md
boffire's picture
Update README.md
0151195 verified
|
Raw
History Blame
953 Bytes
metadata
title: Kabyle POS Tagger
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 6.15.2
app_file: app.py
pinned: false
license: apache-2.0

Kabyle POS Tagger Demo

Interactive demo for the boffire/kabyle-pos model — a Part-of-Speech tagger for Kabyle (kab), a Berber language spoken in Algeria.

Model Details

  • Base: XLM-RoBERTa-base
  • Task: Token Classification (POS tagging)
  • Test F1: 87.5%
  • Tagset: Universal Dependencies (17 tags)

Usage

Type or paste a Kabyle sentence and click Submit to see predicted POS tags with confidence scores.

Limitations

  • Trained on ~1,200 sentences (small dataset)
  • Struggles with complex cliticized verb forms
  • Domain bias toward short translated sentences (Tatoeba corpus)
  • No diacritic normalization

Citation

Side part of the Masakhane initiative for African NLP. See the model card for full citation details.