Spaces:

Quantamhash
/

Quantum_STT-V1

Running on Zero

File size: 921 Bytes

---
title: Quantum_Speach_Recognizer
emoji: 🏆
colorFrom: red
colorTo: pink
sdk: gradio
sdk_version: 5.23.3
app_file: app.py
pinned: true
license: apache-2.0
short_description: Speach To Text
---

# Quantum Speech Recognizer

This is a simple speech recognition application using the Quantum_STT model from Hugging Face. Upload an audio file to get its transcription.

## How to Use

1. Upload an audio file in one of the supported formats: .caf, .au, .opus, .amr, .alac, .aiff, .wma, .m4a, .ogg, .aac, .flac, .wav, .mp3.
2. The application will transcribe the audio and display the text.

## Supported Formats

- .caf
- .au
- .opus
- .amr
- .alac
- .aiff
- .wma
- .m4a
- .ogg
- .aac
- .flac
- .wav
- .mp3

## Dependencies

- transformers
- gradio
- pydub

## Note

This application is designed to run on CPU (ZeroGPU).


Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference