Transcribes audio from a given audio file. Also supports neural speaker diarization.
mp3
, wav
, mp3
, m4a
, flac
, ogg
, opus
, mp4
, mov
, avi
, mkv
. Furthermore, sending files by their URL is supported by the API.1GB
. If you need more, write to Support.To save bandwidth, it is recommended to convert video files to audio formats, for example using ffmpeg:
ffmpeg -i input.mp4 -vn -c:a aac -b:a 192k output.m4a
In this example, the video file input.mp4
is converted to output.m4a
with a bitrate of 192 kbps.
Click the picker on the right, which is either verbose_json_example
, json_example
or diarization_example
to view the example responses.
Use your API key as a Bearer token in the Authorization header. Example: Authorization: Bearer nx-yourkey
Successful transcription or diarization response. The format depends on the 'response_format' parameter.
The response is of type object
.