A newer version of the Streamlit SDK is available:
1.50.0
title: Arabic Transcriber Pro
emoji: π£οΈ
colorFrom: green
colorTo: red
sdk: streamlit
sdk_version: 1.48.0
app_file: app.py
pinned: true
ποΈ Arabic Transcriber Pro
Convert Arabic speech to text with precision β powered by NVIDIA NeMo and Streamlit.
β¨ Live Demo: https://huggingface.co/spaces/NightPrince/Arabic-ASR
π Portfolio: https://nightprincey.github.io/Portfolio/
Screenshot: Gloomy-elegant UI with real-time transcription and audio visualization
π Overview
Arabic Transcriber Pro is a sleek, AI-powered web application that converts spoken Arabic audio into accurate, readable text using NVIDIAβs state-of-the-art NeMo ASR model. Designed with a modern, gloomy-elegant aesthetic, this tool delivers fast, reliable transcription for podcasts, interviews, lectures, and more β all within a user-friendly Streamlit interface hosted on Hugging Face Spaces.
Built by Yahya Alnwsany β AI Engineer, NLP Specialist, and Hugging Face Ambassador β this project reflects a deep commitment to advancing Arabic NLP and making AI accessible for real-world applications.
π Live Demo: https://huggingface.co/spaces/NightPrince/Arabic-ASR
π€ Developer Portfolio: https://nightprincey.github.io/Portfolio/
π§ Features
- β
High-Accuracy Arabic ASR using
nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0
- π§ Multi-Format Support: WAV, MP3, OGG, FLAC, M4A
- π Auto Audio Conversion: Resamples to 16kHz mono WAV for optimal model input
- β‘ Fast Processing with real-time progress feedback
- πΎ Downloadable Transcripts in
.txt
format - π Web-Based UI with Streamlit β no installation needed
- π¨ Elegant Dark Theme with RTL-ready Arabic text rendering
- π Audio Metadata Display: Duration, sample rate, channels
- π Cached Model Loading for improved performance
πΌοΈ UI Design Highlights
- Color Palette: Deep navy (
#0b132b
,#1c2541
) with teal (#5bc0be
) and coral (#e55934
) accents - Typography: Clean, modern sans-serif with RTL support
- Interactive Elements: Smooth progress bars, hover effects, and responsive layout
- Responsive Cards & Gradient Headers for professional feel
π οΈ Tech Stack
Component | Technology |
---|---|
Frontend | Streamlit |
ASR Engine | NVIDIA NeMo |
Audio Processing | pydub , soundfile |
Styling | Custom CSS (Dark Theme, RTL Support) |
Hosting | Hugging Face Spaces |
Deployment | Docker / Streamlit / Git |
βΆοΈ Try It Live
Visit the live app on Hugging Face:
π https://huggingface.co/spaces/NightPrince/Arabic-ASR
No setup required β just upload an Arabic audio file and get instant transcription.
π¦ Project Structure
Arabic-transcriber-pro/
β
βββ app.py # Main Streamlit application
βββ requirements.txt # Python dependencies
βββ README.md # This file
π Supported Audio Formats
Format | Extension | Notes |
---|---|---|
WAV | .wav |
Native support |
MP3 | .mp3 |
Requires ffmpeg |
OGG | .ogg |
Vorbis/Opus |
FLAC | .flac |
Lossless |
M4A | .m4a |
AAC audio |
π All files are automatically converted to 16kHz mono WAV before transcription.