musaw commited on
Commit ·
5bb5a63
1
Parent(s): 7b1db93
Add GitHub Pages About site under docs
Browse files- README.md +1 -0
- docs/_config.yml +4 -0
- docs/index.md +44 -0
README.md
CHANGED
|
@@ -16,6 +16,7 @@ Community-led open-source project to make Pashto a first-class language in AI sp
|
|
| 16 |
## 🔗 Project Links
|
| 17 |
- GitHub: [Pukhto_Pashto](https://github.com/Musawer1214/Pukhto_Pashto)
|
| 18 |
- Hugging Face: [Musawer14/Pukhto_Pashto](https://huggingface.co/Musawer14/Pukhto_Pashto)
|
|
|
|
| 19 |
|
| 20 |
## 🎯 Core Goal
|
| 21 |
- Build open datasets, benchmarks, and models for Pashto ASR, TTS, and NLP.
|
|
|
|
| 16 |
## 🔗 Project Links
|
| 17 |
- GitHub: [Pukhto_Pashto](https://github.com/Musawer1214/Pukhto_Pashto)
|
| 18 |
- Hugging Face: [Musawer14/Pukhto_Pashto](https://huggingface.co/Musawer14/Pukhto_Pashto)
|
| 19 |
+
- GitHub Pages (About): [Pukhto_Pashto Site](https://musawer1214.github.io/Pukhto_Pashto/)
|
| 20 |
|
| 21 |
## 🎯 Core Goal
|
| 22 |
- Build open datasets, benchmarks, and models for Pashto ASR, TTS, and NLP.
|
docs/_config.yml
ADDED
|
@@ -0,0 +1,4 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
title: Pukhto Pashto
|
| 2 |
+
description: Community-led open Pashto AI language project
|
| 3 |
+
theme: minima
|
| 4 |
+
markdown: kramdown
|
docs/index.md
ADDED
|
@@ -0,0 +1,44 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
layout: default
|
| 3 |
+
title: About Pukhto Pashto
|
| 4 |
+
---
|
| 5 |
+
|
| 6 |
+
# About This Repository
|
| 7 |
+
|
| 8 |
+
`Pukhto_Pashto` is a community-led open project focused on making Pashto a first-class language in speech and language AI.
|
| 9 |
+
|
| 10 |
+
## Mission
|
| 11 |
+
|
| 12 |
+
- Build open Pashto datasets for ASR, TTS, and NLP.
|
| 13 |
+
- Publish reproducible baseline models and evaluation workflows.
|
| 14 |
+
- Keep progress transparent, contributor-friendly, and public-benefit oriented.
|
| 15 |
+
|
| 16 |
+
## What Is In This Repository
|
| 17 |
+
|
| 18 |
+
- `data/`: dataset workflows, metadata, and normalization seeds.
|
| 19 |
+
- `asr/`: ASR baselines and experiment notes.
|
| 20 |
+
- `tts/`: TTS baselines and quality tracking.
|
| 21 |
+
- `benchmarks/`: benchmark schema, result format, and metric guidance.
|
| 22 |
+
- `experiments/`: reproducible run cards and experiment records.
|
| 23 |
+
- `docs/`: policies, roadmap, release process, and operating guides.
|
| 24 |
+
- `resources/`: verified external Pashto datasets, models, tools, and benchmarks.
|
| 25 |
+
|
| 26 |
+
## Project References
|
| 27 |
+
|
| 28 |
+
- Repository: [Musawer1214/Pukhto_Pashto](https://github.com/Musawer1214/Pukhto_Pashto)
|
| 29 |
+
- Hugging Face: [Musawer14/Pukhto_Pashto](https://huggingface.co/Musawer14/Pukhto_Pashto)
|
| 30 |
+
- Purpose: [PROJECT_PURPOSE.md](../PROJECT_PURPOSE.md)
|
| 31 |
+
- Roadmap: [ROADMAP.md](../ROADMAP.md)
|
| 32 |
+
- Contributing: [CONTRIBUTING.md](../CONTRIBUTING.md)
|
| 33 |
+
|
| 34 |
+
## Contributing
|
| 35 |
+
|
| 36 |
+
You can help by improving documentation, validating normalization rows, sharing verified resources, or contributing data and evaluation workflows.
|
| 37 |
+
|
| 38 |
+
For contributor workflow and standards, start at:
|
| 39 |
+
- [docs/README.md](README.md)
|
| 40 |
+
- [community/COMMUNICATION.md](../community/COMMUNICATION.md)
|
| 41 |
+
|
| 42 |
+
## License
|
| 43 |
+
|
| 44 |
+
This project is released under Apache 2.0. See [LICENSE](../LICENSE).
|