zsxkib/audio-flamingo-3:2856d42f | Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
audio	string		Audio file to analyze. Supports speech, music, and sound effects. Maximum duration: 10 minutes.
prompt	string	Please describe this audio in detail.	Question or instruction about the audio
system_prompt	string		System instructions to customize the model's behavior, output format, or analysis style. Leave empty for default behavior.
enable_thinking	boolean	False	Enable detailed chain-of-thought reasoning for complex analysis. False for faster responses, True for deeper insights.
temperature	number	0.3 Max: 1	Controls response creativity and randomness. Lower values (0.1-0.3) for factual analysis, higher values (0.7-0.9) for creative interpretation.
max_length	integer	1024 Min: 50 Max: 2048	Maximum length of the response in tokens. Shorter for concise answers, longer for detailed analysis.
start_time	number		Start time in seconds for audio segment analysis (optional). Useful for long audio files.
end_time	number		End time in seconds for audio segment analysis (optional). Must be greater than start_time.

The shape of the response you’ll get when you run this model with an API.

Schema

{'title': 'Output', 'type': 'string'}