Automatic Speech Recognition
Transformers
Safetensors
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
Eval Results
Instructions to use microsoft/Phi-4-multimodal-instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/Phi-4-multimodal-instruct with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="microsoft/Phi-4-multimodal-instruct", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("microsoft/Phi-4-multimodal-instruct", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md (#2)
Browse files- Update README.md (faf353bc39506c48383e5b28e743c37a601fc33b)
Co-authored-by: aergsfxds <fasdfgaer@users.noreply.huggingface.co>
README.md
CHANGED
|
@@ -158,7 +158,7 @@ MT bench scores are scaled by 10x to match the score range of MMMLU:
|
|
| 158 |
|
| 159 |

|
| 160 |
|
| 161 |
-
#### Audio
|
| 162 |
|
| 163 |
AIR bench scores are scaled by 10x to match the score range of MMAU:
|
| 164 |
|
|
|
|
| 158 |
|
| 159 |

|
| 160 |
|
| 161 |
+
#### Audio Understanding
|
| 162 |
|
| 163 |
AIR bench scores are scaled by 10x to match the score range of MMAU:
|
| 164 |
|