File size: 747 Bytes
b9e3bd6
8791967
 
 
 
b9e3bd6
39fcfe7
b9e3bd6
 
 
 
 
 
8791967
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
39fcfe7
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
title: Whisper Speech Transcription
emoji: 🎙️
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.41.1
app_file: app.py
pinned: false
license: mit
short_description: use finetuned s2t model
---

# Whisper Speech Transcription

AI-powered speech-to-text with timestamps using fine-tuned Whisper model.

## Features
- Upload audio files (up to 3 minutes)
- Record voice directly
- Get timestamped transcriptions
- Download JSON and SRT formats
- Optimized for English speech

## Usage
1. Choose upload or record option
2. Process your audio (max 3 minutes)
3. View transcription with timestamps
4. Download results in multiple formats


Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference