musaw
docs(seo): add intent pages, topics checklist, backlink plan, and release drafts
9003457
|
Raw
History Blame
2.91 kB
metadata
layout: default
title: Pashto Language Resources Hub
description: >-
  Open-source Pashto (Pukhto/Pashto) datasets, models, benchmarks, ASR, TTS,
  NLP, and MT resources.

Pashto Language Resources Hub

pashto-language-resources is a community-led open-source project focused on making Pashto (also written as Pukhto/Pushto) a first-class language in speech and language AI.

Mission

  • Build open Pashto datasets and resource indexes for ASR, TTS, NLP, and MT.
  • Publish reproducible baseline models, benchmark schemas, and evaluation workflows.
  • Keep progress transparent, contributor-friendly, and public-benefit oriented.

What Is In This Repository

  • data/: dataset workflows, metadata, and normalization seeds.
  • asr/: ASR baselines and experiment notes.
  • tts/: TTS baselines and quality tracking.
  • benchmarks/: benchmark schema, result format, and metric guidance.
  • experiments/: reproducible run cards and experiment records.
  • docs/: policies, roadmap, release process, and operating guides.
  • resources/: verified external Pashto datasets, models, tools, benchmarks, and papers.

Search Pashto Resources

Intent Pages

Project References

SEO Operations

Contributing

You can help by improving documentation, validating normalization rows, sharing verified resources, or contributing data, model, and evaluation workflows.

For contributor workflow and standards, start at:

Search Terms

This project is relevant to searches like:

  • Pashto datasets
  • Pashto ASR model
  • Pashto TTS resources
  • Pashto NLP benchmark
  • Pashto language technology
  • Pukhto language resources

License

This project is released under Apache 2.0. See LICENSE.