zsxkib/kimi-audio-7b-instruct:7500b323 | Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

zsxkib /kimi-audio-7b-instruct:7500b323

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
audio	string		Input audio file for processing. Can be used for speech-to-text (ASR) or audio-to-audio generation.
prompt	string		Optional text prompt to guide the model. For ASR, use prompts like 'Please convert this audio to text' or '请将音频内容转换为文字' (Chinese).
output_type	string (enum)	both Options: audio, text, both	Type of output to generate: 'audio' for audio only, 'text' for transcription only, or 'both' for both audio and text responses.
return_json	boolean	True	Return text results in JSON format instead of text file
audio_temperature	number	0.8	Temperature for audio generation. Higher values (0.8-1.0) increase creativity but may reduce coherence.
audio_top_k	integer	10	Top-k for audio generation. Limits the token selection to the k most likely tokens.
text_temperature	number	0	Temperature for text generation. Lower values (0.0-0.5) increase factual accuracy.
text_top_k	integer	5	Top-k for text generation. Limits the token selection to the k most likely tokens.
audio_repetition_penalty	number	1	Repetition penalty for audio. Values > 1.0 discourage repetition in audio generation.
audio_repetition_window_size	integer	64	Window size for audio repetition penalty calculation.
text_repetition_penalty	number	1	Repetition penalty for text. Values > 1.0 discourage repetition in text generation.
text_repetition_window_size	integer	16	Window size for text repetition penalty calculation.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema

{'properties': {'json_str': {'title': 'Json Str', 'type': 'string'},
                'media_path': {'format': 'uri',
                               'title': 'Media Path',
                               'type': 'string'}},
 'title': 'Output',
 'type': 'object'}