How can I align timesteps to text for Parakeet-tdt-0.6b-v2 output using KenLM?
2
#48 opened 5 days ago
by
Nguyen667201

Poor WER when trying to fine-tune Parakeet v2 TDT to other dataset than English
#47 opened 13 days ago
by
pronoobie

OutOfMemoryError: CUDA out of memory. on RTX A5000
1
#46 opened 15 days ago
by
akskuchi

Recipes to Finetune to new Language example Hindi (Finally figured out)
β€οΈ
2
4
#45 opened 15 days ago
by
pronoobie

how to load the model from local directory?
1
#44 opened 22 days ago
by
ALu7

Use this to fill web forms no typing (STT server)
β€οΈ
π₯
2
#43 opened 23 days ago
by
pronoobie

Real-Time Mic Transcription on free 2vCPU - using this model, check it out
β€οΈ
π
5
5
#41 opened 24 days ago
by
WJ88

How should I start word-level timestampοΌ
1
#40 opened 25 days ago
by
ppoudd
Speaker Diarization ??
π
1
#39 opened 26 days ago
by
vasanth5596
Word boosting / context biasing
4
#34 opened 28 days ago
by
hoavu1234
Why does using the same fastconformer_hybrid_tdt_ctc_bpe.yaml config to fine-tune pre-train model result in a "mismatch" error?
1
#33 opened 29 days ago
by
Nguyen667201

How can I get timestamps when using KenLM with the model?
1
#32 opened 29 days ago
by
Nguyen667201

parakeet as a local MCP server
β€οΈ
1
#31 opened 29 days ago
by
alexmnahas
Besides GPU, can any other edge accellerators run it. EX: (Hailo AI Hat for RPI)
4
#30 opened 29 days ago
by
Flyingcrabs
Is the model capable of splitting different speakers?
π
1
1
#29 opened 30 days ago
by
BigDeeper
How can I get the correct y_sequence format as I expect?
#27 opened about 1 month ago
by
Nguyen667201

Model initialization
1
#22 opened about 1 month ago
by
Homin
Bug report
π
1
2
#20 opened about 1 month ago
by
JustinRocks
Is CUDA supported when running on Jetson Orin?
4
#19 opened about 1 month ago
by
kikaitachi

Seeking a Clear Guide for Fine-Tuning NVIDIA NeMo Models on New English Audio Domains
1
#18 opened about 1 month ago
by
jacktol

Only English is supported?
4
#17 opened about 1 month ago
by
wangleineo
Does this model identifies speaker?
π
1
8
#16 opened about 1 month ago
by
SouravAhmed

How can I transcribe an audio file thatβs longer than an hour when I have only 12β―GB of VRAM?
6
#15 opened about 1 month ago
by
will1130
New Language Training
π
π₯
7
6
#11 opened about 1 month ago
by
ali-amiri
What is the data format for training?
5
#10 opened about 1 month ago
by
Nguyen667201

ONNX conversion
14
#9 opened about 1 month ago
by
Berrisius

Ignores repeated words
1
#8 opened about 1 month ago
by
HeadlessBandit

Finetuning With Custom Data Tutorial
2
#7 opened about 1 month ago
by
SirCodesAlot
A German Version would be fantastic!
2
#6 opened about 1 month ago
by
Buttermilk03
We clearly need a french version
π
1
1
#4 opened about 2 months ago
by
Sarwg
Streaming?
β
6
12
#3 opened about 2 months ago
by
pscar
Please do It for Japanese
π
1
4
#2 opened about 2 months ago
by
riken12
Local Installation Video and Testing - Step by Step
π
3
1
#1 opened about 2 months ago
by
fahdmirzac
