Compiled engines for running Whisper with TRT LLM for much faster inference.
baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
729

baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-A10G-v0.20.0-TP2
Updated
•
1

baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-A10G-v0.20.0-TP1
Updated
•
1

baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-A10G-v0.20.0-TP1
Updated
•
13

baseten/whisper_trt_large_v3_fixed_20240624_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated

baseten/btest-Qwen-0.5B-NVIDIA-H100-80GB-HBM3-v0.20.0-TP1
Updated
•
7

baseten/qwen3-rerank-8b-h100
Updated
•
4

baseten/qwen3-embed-8b-h100
Updated
•
5

baseten/qwen3-embed-0.6b-h100
Updated
•
4

baseten/orpheus-3b-0.1-ft-fp8
3B
•
Updated
•
60

baseten/orpheus-3b-0.1-ft-fp8-fix
3B
•
Updated
•
8