File size: 1,261 Bytes
f13fd7c f725a8a f13fd7c d2f0b77 f13fd7c d2f0b77 f13fd7c d2f0b77 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 | # Scripts
Automation scripts for quality checks, resource catalog validation, and search index generation.
## Available scripts
- `validate_normalization.py`: validate normalization seed TSV format and rules.
- `check_links.py`: ensure markdown links are clickable (optional online reachability check).
- `validate_resource_catalog.py`: validate `resources/catalog/resources.json`.
- `generate_resource_views.py`: generate `resources/*/README.md`, `resources/README.md`, and `docs/search/resources.json` from the catalog.
- `sync_resources.py`: collect new candidate Pashto resources from public endpoints into `resources/catalog/pending_candidates.json`.
## Usage
Validate normalization seed file:
```bash
python scripts/validate_normalization.py data/processed/normalization_seed_v0.1.tsv
```
Validate resource catalog:
```bash
python scripts/validate_resource_catalog.py
```
Generate markdown and search index from catalog:
```bash
python scripts/generate_resource_views.py
```
Sync candidate resources for maintainer review:
```bash
python scripts/sync_resources.py --limit 20
```
Check markdown links format:
```bash
python scripts/check_links.py
```
Check markdown links and verify URLs online:
```bash
python scripts/check_links.py --online
```
|