Skip to main content

Transcribe an audio or video file.

POST 

/api/v2/Audio/transcribe

Transcribes an audio or video file into text. Supported file formats: mp3, mp4, mpeg, mpga, m4a, wav, webm. Maximum file size: 25 MB.

Request

Query Parameters

    culture string

    Use this param to override the culture of the response. Options: - en (default) - es

Body

    ModelId uuid

    The AI Model Id to use for transcription. If not provided, the default audio model will be used.

    File binaryrequired

    The audio or video file to transcribe.

    Prompt string

    Optional prompt to guide the transcription. Useful for providing context or specifying terminology.

    UserIdentifier string

    Possible values: <= 256 characters

    Optional user identifier for tracking and cost management purposes.

    Channel string

    Possible values: <= 100 characters

    The channel used in the execution

Responses

Returns the transcription result.

Schema

    instanceId uuid
    transcript stringnullable

    metadata

    object

    language stringnullable
    duration date-spannullable

    tokenUsage

    object

    completionTokens int32
    promptTokens int32
    totalTokens int32

    cost

    object

    completion double
    prompt double
    total double
    currency stringnullable
Loading...