You're looking at a specific version of this model. Jump to the model overview.

lucataco /voxtral-mini-3b:f5a2a8bc

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
audio
string
Audio file to process
mode
string (enum)
transcription

Options:

transcription, understanding

Processing mode
prompt
string
Please describe what you hear in this audio.
Text prompt for audio understanding mode (ignored for transcription)
language
string (enum)
en

Options:

en, es, fr, de, it, nl, pt, hi

Language code for transcription
max_new_tokens
integer
1024

Min: 1

Max: 32768

Maximum number of tokens to generate

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'string'}