Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
18
1
92
9-Volt Fan
9voltfan2009
Follow
tanooki426's profile picture
WisdomChenje's profile picture
KlaskyCsupoRoboSplaat87's profile picture
6 followers
·
8 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 23 hours ago
9voltfan2009/DorkDiaries-RVC
reacted
to
fdaudens
's
post
with 👍
2 days ago
Forget everything you know about transcription models - NVIDIA's parakeet-tdt-0.6b-v2 changed the game for me! Just tested it with Steve Jobs' Stanford speech and was speechless (pun intended). The video isn’t sped up. 3 things that floored me: - Transcription took just 10 seconds for a 15-min file - Got a CSV with perfect timestamps, punctuation & capitalization - Stunning accuracy (correctly captured "Reed College" and other specifics) NVIDIA also released a demo where you can click any transcribed segment to play it instantly. The improvement is significant: number 1 on the ASR Leaderboard, 6% error rate (best in class) with complete commercial freedom (cc-by-4.0 license). Time to update those Whisper pipelines! H/t @Steveeeeeeen for the finding! Model: https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2 Demo: https://huggingface.co/spaces/nvidia/parakeet-tdt-0.6b-v2 ASR Leaderboard: https://huggingface.co/spaces/hf-audio/open_asr_leaderboard
reacted
to
fdaudens
's
post
with 🔥
2 days ago
Forget everything you know about transcription models - NVIDIA's parakeet-tdt-0.6b-v2 changed the game for me! Just tested it with Steve Jobs' Stanford speech and was speechless (pun intended). The video isn’t sped up. 3 things that floored me: - Transcription took just 10 seconds for a 15-min file - Got a CSV with perfect timestamps, punctuation & capitalization - Stunning accuracy (correctly captured "Reed College" and other specifics) NVIDIA also released a demo where you can click any transcribed segment to play it instantly. The improvement is significant: number 1 on the ASR Leaderboard, 6% error rate (best in class) with complete commercial freedom (cc-by-4.0 license). Time to update those Whisper pipelines! H/t @Steveeeeeeen for the finding! Model: https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2 Demo: https://huggingface.co/spaces/nvidia/parakeet-tdt-0.6b-v2 ASR Leaderboard: https://huggingface.co/spaces/hf-audio/open_asr_leaderboard
View all activity
Organizations
models
2
Sort: Recently updated
9voltfan2009/DorkDiaries-RVC
Updated
about 23 hours ago
9voltfan2009/WarioWareMoveIt-RVC
Updated
8 days ago
•
2
datasets
0
None public yet