File size: 712 Bytes
c54cef7 3f0ead8 c54cef7 3f0ead8 c54cef7 3f0ead8 c54cef7 3f0ead8 c54cef7 3f0ead8 c54cef7 3f0ead8 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
library_name: transformers
tags: []
---
# whisper-large-v3-encoder
## How to
```python
from transformers.models.whisper.modeling_whisper import WhisperEncoder
from transformers import AutoFeatureExtractor
import torch
import librosa
encoder = WhisperEncoder.from_pretrained(
'huseinzol05/whisper-large-v3-encoder',
torch_dtype = torch.float16).cuda()
feature_extractor = AutoFeatureExtractor.from_pretrained('openai/whisper-large-v3')
y, sr = librosa.load('audio.mp3', sr = 16000)
input_ids = feature_extractor(y, return_tensors = 'pt', sampling_rate = feature_extractor.sampling_rate)
input_ids['input_features'] = input_ids['input_features'].to(torch.float16).cuda()
encoder(**input_ids)
``` |