--- license: apache-2.0 language: - ps tags: - pashto - asr - tts - nlp --- # 🌍 Pukhto/Pashto Open Language Project Community-led open-source project to make Pashto a first-class language in AI speech and language tooling. ## πŸ”— Project Links - GitHub: `https://github.com/Musawer1214/Pukhto_Pashto` - Hugging Face: `https://huggingface.co/Musawer14/Pukhto_Pashto` ## 🎯 Core Goal - Build open datasets, benchmarks, and models for Pashto ASR, TTS, and NLP. - Keep work reproducible, transparent, and contribution-friendly. - Focus on public good and broad accessibility. ## πŸš€ Start Here - πŸ“˜ Purpose: `PROJECT_PURPOSE.md` - 🀝 Contributing: `CONTRIBUTING.md` - πŸ—ΊοΈ Roadmap: `ROADMAP.md` - πŸ›οΈ Governance: `GOVERNANCE.md` - πŸ’¬ Community coordination: `community/COMMUNICATION.md` ## 🧩 Initial Workstreams - `data/` Pashto data collection, cleaning, metadata - `asr/` speech-to-text baselines and experiments - `tts/` text-to-speech baselines and experiments - `benchmarks/` fixed test sets and evaluation scripts - `apps/desktop/` app integration references