Endpoints
Create Transcription
Transcribes audio from a given audio file. Also supports neural speaker diarization.
Supported Formats and Limitations
- Supported formats:
mp3
,wav
,mp3
,m4a
,flac
,ogg
,opus
,mp4
,mov
,avi
,mkv
. Furthermore, sending files by their URL is supported by the API. - Maximum file size:
1GB
. If you need more, write to Support. - Minimum audio length: 0.3 seconds.
- Maximum audio length: 10 hours.
- Rate limit: 10 requests per second.
To save bandwidth, it is recommended to convert video files to audio formats, for example using ffmpeg:
ffmpeg -i input.mp4 -vn -c:a aac -b:a 192k output.m4a
In this example, the video file input.mp4
is converted to output.m4a
with a bitrate of 192 kbps.
Examples
Click the picker on the right, which is either verbose_json_example
, json_example
or diarization_example
to view the example responses.
Authorizations
Use your API key as a Bearer token in the Authorization header. Example: Authorization: Bearer nx-yourkey
Body
multipart/form-data
Response
200
application/json
Successful transcription or diarization response. The format depends on the 'response_format' parameter.
The response is of type object
.