Pashto Language Resources Hub (Pukhto/Pashto)
Open-source Pashto language technology hub for datasets, models, benchmarks, ASR, TTS, NLP, and machine translation.
This repository curates verified Pashto resources and keeps validation and publishing workflows reproducible.
Quick Links
- Search page: Pashto Resource Search
- Project site: Pashto Language Resources Hub
- Documentation hub: docs/README.md
- GitHub: Musawer1214/pashto-language-resources
- Hugging Face mirror: Musawer14/pashto-language-resources
High-Intent Pages
Repository Structure
resources/: verified external resources with structured categories.data/: normalization seeds, metadata, and data workflows.asr/: ASR notes, baselines, and references.tts/: TTS notes, baselines, and references.benchmarks/: schemas, result templates, and evaluation guidance.experiments/: reproducible run-card templates.docs/: SEO, release, platform, and contribution documentation.
Resource Workflow
- Discovery job (
.github/workflows/resource_sync.yml) updates candidate feed. - Automation promotes valid non-duplicate candidates into
resources/catalog/resources.json. - Regeneration and validation update derived views and search index.
Core commands:
python scripts/validate_resource_catalog.py
python scripts/generate_resource_views.py
python scripts/check_links.py
python -m pytest -q
SEO and Discoverability
- SEO playbook: docs/discoverability_seo.md
- GitHub topics checklist: docs/github_topics_checklist.md
- Backlink strategy: docs/backlink_strategy.md
- Platform sync policy: docs/platform_sync_policy.md
- Search UI source: docs/search/index.html
- Citation metadata: CITATION.cff
Releases
- Release notes index: docs/releases/README.md
- Latest release notes: v1.1.1
- Changelog: CHANGELOG.md
Contributing
- Contribution guide: CONTRIBUTING.md
- Community communication: community/COMMUNICATION.md
- Resource guidelines: docs/dataset_guidelines.md
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support