Whisper Engines Compiled engines for running Whisper with TRT LLM for much faster inference. baseten/whisper_trt_large-v3_NVIDIA_A100-SXM4-80GB_i224_o512_bs32_bw5_int4 Updated Jun 21, 2024 baseten/whisper_trt_large-v3_A10G_i224_o512_bs8_bw5 Updated May 21, 2024 baseten/whisper_trt_large-v3_L4_i224_o512_bs8_bw5 Updated May 21, 2024 baseten/whisper_trt_large-v3_H100_i224_o512_bs8_bw5 Updated May 21, 2024
Whisper Engines Compiled engines for running Whisper with TRT LLM for much faster inference. baseten/whisper_trt_large-v3_NVIDIA_A100-SXM4-80GB_i224_o512_bs32_bw5_int4 Updated Jun 21, 2024 baseten/whisper_trt_large-v3_A10G_i224_o512_bs8_bw5 Updated May 21, 2024 baseten/whisper_trt_large-v3_L4_i224_o512_bs8_bw5 Updated May 21, 2024 baseten/whisper_trt_large-v3_H100_i224_o512_bs8_bw5 Updated May 21, 2024