File size: 712 Bytes
c54cef7
 
 
 
 
3f0ead8
c54cef7
3f0ead8
c54cef7
3f0ead8
 
 
 
 
c54cef7
3f0ead8
 
 
c54cef7
3f0ead8
 
c54cef7
3f0ead8
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
library_name: transformers
tags: []
---

# whisper-large-v3-encoder

## How to

```python
from transformers.models.whisper.modeling_whisper import WhisperEncoder
from transformers import AutoFeatureExtractor
import torch
import librosa

encoder = WhisperEncoder.from_pretrained(
    'huseinzol05/whisper-large-v3-encoder', 
    torch_dtype = torch.float16).cuda()

feature_extractor = AutoFeatureExtractor.from_pretrained('openai/whisper-large-v3')
y, sr = librosa.load('audio.mp3', sr = 16000)

input_ids = feature_extractor(y, return_tensors = 'pt', sampling_rate = feature_extractor.sampling_rate)
input_ids['input_features'] = input_ids['input_features'].to(torch.float16).cuda()
encoder(**input_ids)
```