Compiled engines for running Whisper with TRT LLM for much faster inference.
baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
662

baseten/whisper_trt_large_v3_250729_NVIDIA_H100_80GB_HBM3_0_21_0
Updated

baseten/whisper_trt_large_v3_250729_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_21_0
Updated

baseten/whisper_trt_large_v3_250729_NVIDIA_L4_0_21_0
Updated

baseten/whisper_trt_large_v3_250729_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_1_0_0rc6
Updated

baseten/7b-fp8-dynamic
8B
•
Updated
•
6

baseten/gemma-3-27b-causallm-it
27B
•
Updated
•
36

baseten/DummyGemmaTextModelForEmbedding
Feature Extraction
•
0.3B
•
Updated
•
21

baseten/q-r-e
0.2B
•
Updated
•
17

baseten/Kimi-K2-Instruct-FP4
581B
•
Updated
•
2.42k
•
1

baseten/whisper_trt_large_v3_turbo_250730_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated