You're looking at a specific version of this model. Jump to the model overview.

lucataco /qwen2.5-omni-7b:0ca8160f

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
prompt
string
Text prompt for the model
image
string
Optional image input
audio
string
Optional audio input
video
string
Optional video input
system_prompt
string
You are Qwen, a virtual human developed by the Qwen Team, Alibaba Group, capable of perceiving auditory and visual inputs, as well as generating text and speech.
System prompt for the model
use_audio_in_video
boolean
True
Whether to use audio in video
voice_type
string (enum)
Chelsie

Options:

Chelsie, Ethan

Voice type for audio output
generate_audio
boolean
True
Whether to generate audio output

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'properties': {'text': {'title': 'Text', 'type': 'string'},
                'voice': {'format': 'uri', 'title': 'Voice', 'type': 'string'}},
 'required': ['text'],
 'title': 'Output',
 'type': 'object'}