File size: 2,136 Bytes
a2c44d9
 
 
 
 
 
 
 
 
 
 
86e7cd1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
abce46e
 
86e7cd1
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
---
title: Tamil Audio to Text
emoji: ▶︎ •၊၊||၊|။||||။‌‌‌‌‌၊|• 0:10  📄

colorFrom: blue
colorTo: yellow
sdk: gradio
app_file: app.py
pinned: false
license: apache-2.0
---
# Whisper Small Tamil - Hugging Face Demo

This repository hosts a demo for the **Whisper Small Tamil** model, fine-tuned for Tamil speech recognition. This model is based on OpenAI's Whisper-Small and has been trained to improve Automatic Speech Recognition (ASR) for Tamil language inputs.

## 🚀 Demo
Try the model directly on [🤗 Hugging Face Spaces](https://huggingface.co/spaces/deepakkumar07/whisper-small-tamil).

## 📝 Model Details
- **Base Model:** OpenAI Whisper-Small
- **Fine-tuned for:** Tamil ASR
- **Dataset Used:** Common Voice Tamil & other curated datasets
- **Supports:** Tamil speech-to-text transcription

## 🔧 How to Use
You can use this model in Python with the `transformers` library:

```python
from transformers import pipeline

# Load model from Hugging Face Hub
asr_pipeline = pipeline("automatic-speech-recognition", model="deepakkumar07/whisper-small-tamil")

# Transcribe an audio file
result = asr_pipeline("path/to/audio.wav")
print(result["text"])
```

## 📊 Performance
This model is optimized for Tamil speech but may still have minor errors in transcription, especially with noisy audio or mixed-language inputs. Contributions and improvements are welcome!

## 📌 Training Details
- Fine-tuned using the **Hugging Face Transformers** and **datasets** libraries.
- Trained on GPUs for better performance.
- Supports **streaming inference** for real-time transcription.

## 💡 Applications
- Tamil voice-to-text conversion
- Subtitling and transcription services
- Voice-controlled Tamil applications

## 🤝 Contributing
If you find any issues or want to improve the model, feel free to open a PR or reach out!

## 📜 License
This model is released under an open license. Please refer to OpenAI's original Whisper license for base model terms.

---

For more details, check out the [Hugging Face model page](https://huggingface.co/deepakkumar07/whisper-small-tamil). 🚀