File size: 921 Bytes
82196f0
676da5b
 
 
 
82196f0
 
 
676da5b
82196f0
676da5b
82196f0
 
e227e6f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
82196f0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
---
title: Quantum_Speach_Recognizer
emoji: πŸ†
colorFrom: red
colorTo: pink
sdk: gradio
sdk_version: 5.23.3
app_file: app.py
pinned: true
license: apache-2.0
short_description: Speach To Text
---

# Quantum Speech Recognizer

This is a simple speech recognition application using the Quantum_STT model from Hugging Face. Upload an audio file to get its transcription.

## How to Use

1. Upload an audio file in one of the supported formats: .caf, .au, .opus, .amr, .alac, .aiff, .wma, .m4a, .ogg, .aac, .flac, .wav, .mp3.
2. The application will transcribe the audio and display the text.

## Supported Formats

- .caf
- .au
- .opus
- .amr
- .alac
- .aiff
- .wma
- .m4a
- .ogg
- .aac
- .flac
- .wav
- .mp3

## Dependencies

- transformers
- gradio
- pydub

## Note

This application is designed to run on CPU (ZeroGPU).


Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference