You're looking at a specific version of this model. Jump to the model overview.

meta /sam-2-video:2d721987

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
input_video
string
Input video file path
click_coordinates
string
Click coordinates as '[x,y],[x,y],...'. Determines number of clicks.
click_labels
string
1
Click types (1=foreground, 0=background) as '1,1,0,1'. Auto-extends if shorter than coordinates.
click_frames
string
0
Frame indices for clicks as '0,0,150,0'. Auto-extends if shorter than coordinates.
click_object_ids
string
Object labels for clicks as 'person,dog,cat'. Auto-generates if missing or incomplete.
mask_type
string (enum)
binary

Options:

binary, highlighted, greenscreen

Mask type: binary (B&W), highlighted (colored overlay), or greenscreen
output_video
boolean
False
True for video output, False for image sequence
video_fps
integer
30

Min: 1

Max: 60

Video output frame rate (ignored for image sequence)
output_format
string (enum)
webp

Options:

webp, jpg, png

Image format for sequence (ignored for video)
output_quality
integer
80

Max: 100

JPG/WebP compression quality (0-100, ignored for PNG and video)
output_frame_interval
integer
1
Output every Nth frame. 1=all frames, 2=every other, etc.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'items': {'format': 'uri', 'type': 'string'},
 'title': 'Output',
 'type': 'array',
 'x-cog-array-type': 'iterator'}