razhan/DOLMA-speech
Viewer • Updated • 41.7k • 144 • 1
How to use razhan/whisper-base-glk with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="razhan/whisper-base-glk") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("razhan/whisper-base-glk")
model = AutoModelForSpeechSeq2Seq.from_pretrained("razhan/whisper-base-glk")This model is a fine-tuned version of openai/whisper-base on the razhan/DOLMA-speech gilaki dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| No log | 1.0 | 6 | 3.5224 | 1.1311 | 0.5560 |
| 2.4889 | 2.0 | 12 | 3.4807 | 1.0566 | 0.5018 |
| 2.4889 | 3.0 | 18 | 3.2108 | 1.0561 | 0.4986 |
| 2.3707 | 4.0 | 24 | 2.9445 | 1.0583 | 0.5155 |
| 2.0528 | 5.0 | 30 | 2.6806 | 1.0472 | 0.5468 |
Base model
openai/whisper-base