Compiled engines for running Whisper with TRT LLM for much faster inference.
baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
672

baseten/whisper_trt_large_v2_251013_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_21_0
Updated

baseten/whisper_trt_large_v3_turbo_251013_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_21_0
Updated

baseten/whisper_trt_large_v2_251013_NVIDIA_H100_80GB_HBM3_0_21_0
Updated

baseten/whisper_trt_large_v3_251013_NVIDIA_L4_0_21_0
Updated

baseten/whisper_trt_large_v2_251013_NVIDIA_L4_0_21_0
Updated

baseten/whisper_trt_large_v3_251013_NVIDIA_H100_80GB_HBM3_0_21_0
Updated

baseten/whisper_trt_large_v3_turbo_251013_NVIDIA_L4_0_21_0
Updated

baseten/whisper_trt_large_v3_turbo_251013_NVIDIA_H100_80GB_HBM3_0_21_0
Updated

baseten/whisper_trt_large_v3_251013_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_21_0
Updated

baseten/Llama-3.2-3B-Instruct-pythonic
Text Generation
•
3B
•
Updated
•
1.93k