Speech-to-Text (Whisper)
Supported Models
Model
Parameters
VRAM
Best For
Distilled Models (Faster)
Model
Parameters
VRAM
Notes
Getting Started
Supported Audio Formats
Format
Extension
Notes
API Usage
Transcribe Audio
With Language Hint
Python SDK
Response Format
Streaming Response
Performance
Model
Typical RTF (GPU)
Typical RTF (CPU)
Last updated