You're looking at a specific version of this model. Jump to the model overview.

zsxkib /audio-flamingo-3:419bdd5e

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
audio
string
Audio file to analyze. Supports speech, music, and sound effects. Maximum duration: 10 minutes.
prompt
string
Please describe this audio in detail.
Question or instruction about the audio
system_prompt
string
System instructions to customize the model's behavior, output format, or analysis style. Leave empty for default behavior.
enable_thinking
boolean
False
Enable detailed chain-of-thought reasoning for complex analysis. False for faster responses, True for deeper insights.
temperature
number
0

Max: 1

Controls response creativity and randomness. Use 0.0 for deterministic (default), 0.1-0.3 for factual analysis, 0.7-0.9 for creative interpretation.
max_length
integer
0

Max: 2048

Maximum length of the response in tokens. Use 0 for model default, or specify 50-2048 for custom length.
start_time
number
Start time in seconds for audio segment analysis (optional). Useful for long audio files.
end_time
number
End time in seconds for audio segment analysis (optional). Must be greater than start_time.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'string'}