Speech-to-Text
API Reference
Speech-to-Text
Transcribe an audio file to text. Accepts multipart form upload with the audio file and optional parameters. Compatible with the OpenAI audio transcription API.
POST
Speech-to-Text
Authorizations
Bearer token issued by your workspace.
Body
multipart/form-data
The audio file to transcribe. Supported formats: wav, mp3, flac, ogg, m4a, webm. Maximum size: 15 MB.
The model to use for transcription.
The format of the response. json returns {"text": "..."}. text returns plain text. verbose_json includes language, timing, and word-level data.
Available options:
json, text, verbose_json Sampling temperature between 0 and 1. Lower values are more deterministic.