-
-
-
-
-
-
Inference Providers
Active filters:
npu
magicunicorn/whisper-large-v2-amd-npu-int8
Updated
•
12
•
3
GatekeeperZA/Qwen3-1.7B-RKLLM-v1.2.3
Text Generation
•
Updated
•
11
•
1
GatekeeperZA/Phi-3-mini-4k-instruct-RKLLM-v1.2.3
Text Generation
•
Updated
•
6
•
1
dahara1/llama3-8b-amd-npu
dahara1/llama3.1-8b-Instruct-amd-npu
dahara1/ALMA-Ja-V3-amd-npu
dahara1/llama-translate-amd-npu
Translation
•
Updated
•
5
dahara1/llama-translate-gguf
8B
•
Updated
•
270
•
16
FluidInference/qwen3-0.6b-int4-ov-npu
FluidInference/qwen3-8b-int4-ov-npu
Updated
•
7
•
1
FluidInference/qwen3-1.7b-int4-ov-npu
FluidInference/qwen3-4b-int4-ov-npu
Updated
•
11
•
1
magicunicorn/kokoro-npu-quantized
Text-to-Speech
•
Updated
•
3
FluidInference/whisper-tiny-int4-ov-npu
magicunicorn/gemma-3-27b-npu-quantized
Text Generation
•
Updated
•
1
magicunicorn/unicorn-execution-engine-models
Updated
magicunicorn/whisper-large-v3-amd-npu-int8
Updated
•
9
•
3
magicunicorn/whisper-medium-amd-npu-int8
Updated
magicunicorn/whisper-small-amd-npu-int8
Updated
•
4
•
1
magicunicorn/whisper-base-amd-npu-int8
Updated
AhtnaGlen/phi-4-mini-instruct-int4-sym-npu-ov
Text Generation
•
Updated
•
10
rk-transformers/distilbert-base-uncased-finetuned-sst-2-english
rk-transformers/all-MiniLM-L6-v2
Sentence Similarity
•
Updated
•
35
rk-transformers/bert-base-uncased
Updated
rk-transformers/ms-marco-MiniLM-L12-v2
Text Ranking
•
Updated
•
2
rk-transformers/distilbert-base-cased-distilled-squad
Updated
rk-transformers/bert-base-NER
Updated
rk-transformers/bert-base-uncased_SWAG
NexaAI/rf-detr-seg-preview-npu
Object Detection
•
Updated
•
3
rk-transformers/ModernBERT-base