You're looking at a specific version of this model. Jump to the model overview.

meta /sam-2-video:ae84885e

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
input_video
string
Path to the input video file
click_coordinates
string
List of click coordinates in format '[x,y],[x,y],...'
click_labels
string
List of click types (1 for foreground, 0 for background), e.g., '1,1,0,1'
click_frames
string
List of frame indices for each click, e.g., '0,0,150,0'
click_object_ids
string
List of object IDs for each click, e.g., '1,1,1,2'
output_frame_interval
integer
1
Interval for output frame visualization (1 = every frame, 2 = every other frame, etc.)
mask_type
string (enum)
binary

Options:

binary, highlighted

Choose the type of mask to return (binary or highlighted frames)
output_format
string (enum)
webp

Options:

webp, jpg, png

The image file format of the generated output images
output_quality
integer
80

Max: 100

The image compression quality (for lossy formats like JPEG and WebP). 100 = best quality, 0 = lowest quality.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'items': {'format': 'uri', 'type': 'string'},
 'title': 'Output',
 'type': 'array',
 'x-cog-array-type': 'iterator'}