bytedance
/
sa2va-4b-video
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Prediction
bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bdID7fkwtc6rfhrmc0cn6ezbw62948StatusSucceededSourceWebHardwareL40STotal durationCreatedInput
- video
- instruction
- what color is the flower?
- frame_interval
- 4
{ "video": "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", "instruction": "what color is the flower?", "frame_interval": 4 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", { input: { video: "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", instruction: "what color is the flower?", frame_interval: 4 } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", input={ "video": "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", "instruction": "what color is the flower?", "frame_interval": 4 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", "input": { "video": "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", "instruction": "what color is the flower?", "frame_interval": 4 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
The flower is pink.<|end|>{ "completed_at": "2025-02-23T16:58:27.766710Z", "created_at": "2025-02-23T16:58:20.156000Z", "data_removed": false, "error": null, "id": "7fkwtc6rfhrmc0cn6ezbw62948", "input": { "video": "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", "instruction": "what color is the flower?", "frame_interval": 4 }, "logs": null, "metrics": { "predict_time": 7.586836302, "total_time": 7.61071 }, "output": { "response": "The flower is pink.<|end|>", "masked_video": null }, "started_at": "2025-02-23T16:58:20.179874Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/7fkwtc6rfhrmc0cn6ezbw62948", "cancel": "https://api.replicate.com/v1/predictions/7fkwtc6rfhrmc0cn6ezbw62948/cancel" }, "version": "12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd" }
Generated inPrediction
bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bdID0mrgkhhsz9rm80cn6f2awk4z28StatusSucceededSourceWebHardwareL40STotal durationCreatedInput
- video
- instruction
- Segment the otter
- frame_interval
- 4
{ "video": "https://replicate.delivery/pbxt/MXbFBBi6sAgN7w0hoVJgStenhfJA4Qar6iFZXRHE9pVicycg/sora-otter-5.mp4", "instruction": "Segment the otter", "frame_interval": 4 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", { input: { video: "https://replicate.delivery/pbxt/MXbFBBi6sAgN7w0hoVJgStenhfJA4Qar6iFZXRHE9pVicycg/sora-otter-5.mp4", instruction: "Segment the otter", frame_interval: 4 } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", input={ "video": "https://replicate.delivery/pbxt/MXbFBBi6sAgN7w0hoVJgStenhfJA4Qar6iFZXRHE9pVicycg/sora-otter-5.mp4", "instruction": "Segment the otter", "frame_interval": 4 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", "input": { "video": "https://replicate.delivery/pbxt/MXbFBBi6sAgN7w0hoVJgStenhfJA4Qar6iFZXRHE9pVicycg/sora-otter-5.mp4", "instruction": "Segment the otter", "frame_interval": 4 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
response
Sure, [SEG].<|end|>masked_video
{ "completed_at": "2025-02-23T17:04:31.882309Z", "created_at": "2025-02-23T17:04:12.794000Z", "data_removed": false, "error": null, "id": "0mrgkhhsz9rm80cn6f2awk4z28", "input": { "video": "https://replicate.delivery/pbxt/MXbFBBi6sAgN7w0hoVJgStenhfJA4Qar6iFZXRHE9pVicycg/sora-otter-5.mp4", "instruction": "Segment the otter", "frame_interval": 4 }, "logs": "propagate in video: 0%| | 0/45 [00:00<?, ?it/s]\npropagate in video: 18%|█▊ | 8/45 [00:00<00:00, 73.96it/s]\npropagate in video: 36%|███▌ | 16/45 [00:00<00:00, 37.22it/s]\npropagate in video: 47%|████▋ | 21/45 [00:00<00:00, 32.76it/s]\npropagate in video: 56%|█████▌ | 25/45 [00:00<00:00, 30.91it/s]\npropagate in video: 64%|██████▍ | 29/45 [00:00<00:00, 29.74it/s]\npropagate in video: 73%|███████▎ | 33/45 [00:01<00:00, 28.97it/s]\npropagate in video: 80%|████████ | 36/45 [00:01<00:00, 28.56it/s]\npropagate in video: 87%|████████▋ | 39/45 [00:01<00:00, 28.26it/s]\npropagate in video: 93%|█████████▎| 42/45 [00:01<00:00, 28.02it/s]\npropagate in video: 100%|██████████| 45/45 [00:01<00:00, 27.85it/s]\npropagate in video: 100%|██████████| 45/45 [00:01<00:00, 30.70it/s]\nffmpeg version 4.4.2-0ubuntu0.22.04.1 Copyright (c) 2000-2021 the FFmpeg developers\nbuilt with gcc 11 (Ubuntu 11.2.0-19ubuntu1)\nconfiguration: --prefix=/usr --extra-version=0ubuntu0.22.04.1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-librsvg --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 70.100 / 56. 70.100\nlibavcodec 58.134.100 / 58.134.100\nlibavformat 58. 76.100 / 58. 76.100\nlibavdevice 58. 13.100 / 58. 13.100\nlibavfilter 7.110.100 / 7.110.100\nlibswscale 5. 9.100 / 5. 9.100\nlibswresample 3. 9.100 / 3. 9.100\nlibpostproc 55. 9.100 / 55. 9.100\nInput #0, avi, from '/tmp/output/temp_masked.avi':\nMetadata:\nsoftware : Lavf59.27.100\nDuration: 00:00:06.01, start: 0.000000, bitrate: 1700 kb/s\nStream #0:0: Video: mjpeg (Baseline) (MJPG / 0x47504A4D), yuvj420p(pc, bt470bg/unknown/unknown), 1940x1096, 1725 kb/s, 7.48 fps, 7.48 tbr, 7.48 tbn, 7.48 tbc\nStream mapping:\nStream #0:0 -> #0:0 (mjpeg (native) -> h264 (libx264))\nPress [q] to stop, [?] for help\n[swscaler @ 0x570e87d69d00] deprecated pixel format used, make sure you did set range correctly\n[libx264 @ 0x570e87d31f80] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 AVX512\n[libx264 @ 0x570e87d31f80] profile High, level 4.2, 4:2:0, 8-bit\n[libx264 @ 0x570e87d31f80] 264 - core 163 r3060 5db6aa6 - H.264/MPEG-4 AVC codec - Copyleft 2003-2021 - http://www.videolan.org/x264.html - options: cabac=1 ref=2 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=6 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=15 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=1 keyint=250 keyint_min=7 scenecut=40 intra_refresh=0 rc_lookahead=30 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/output/masked_video.mp4':\nMetadata:\nsoftware : Lavf59.27.100\nencoder : Lavf58.76.100\nStream #0:0: Video: h264 (avc1 / 0x31637661), yuv420p(tv, bt470bg/unknown/unknown, progressive), 1940x1096, q=2-31, 7.48 fps, 14964 tbn\nMetadata:\nencoder : Lavc58.134.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nframe= 1 fps=0.0 q=0.0 size= 0kB time=00:00:00.00 bitrate=N/A speed=N/A\nframe= 45 fps=0.0 q=-1.0 Lsize= 178kB time=00:00:05.61 bitrate= 260.5kbits/s speed=10.9x\nvideo:177kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.743007%\n[libx264 @ 0x570e87d31f80] frame I:1 Avg QP: 6.99 size: 5411\n[libx264 @ 0x570e87d31f80] frame P:17 Avg QP:16.48 size: 4561\n[libx264 @ 0x570e87d31f80] frame B:27 Avg QP:18.29 size: 3622\n[libx264 @ 0x570e87d31f80] consecutive B-frames: 13.3% 8.9% 33.3% 44.4%\n[libx264 @ 0x570e87d31f80] mb I I16..4: 87.2% 10.8% 2.0%\n[libx264 @ 0x570e87d31f80] mb P I16..4: 0.4% 2.0% 1.3% P16..4: 1.7% 0.5% 0.2% 0.0% 0.0% skip:93.9%\n[libx264 @ 0x570e87d31f80] mb B I16..4: 0.4% 0.0% 0.2% B16..8: 2.6% 1.3% 0.4% direct: 0.1% skip:95.1% L0:50.2% L1:46.8% BI: 3.0%\n[libx264 @ 0x570e87d31f80] 8x8 transform intra:25.1% inter:1.7%\n[libx264 @ 0x570e87d31f80] coded y,uvDC,uvAC intra: 8.1% 0.0% 0.0% inter: 0.5% 0.0% 0.0%\n[libx264 @ 0x570e87d31f80] i16 v,h,dc,p: 91% 7% 3% 0%\n[libx264 @ 0x570e87d31f80] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 35% 13% 51% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x570e87d31f80] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 20% 22% 41% 3% 2% 2% 4% 2% 4%\n[libx264 @ 0x570e87d31f80] i8c dc,h,v,p: 100% 0% 0% 0%\n[libx264 @ 0x570e87d31f80] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x570e87d31f80] ref P L0: 48.1% 51.9%\n[libx264 @ 0x570e87d31f80] ref B L0: 87.9% 12.1%\n[libx264 @ 0x570e87d31f80] ref B L1: 97.4% 2.6%\n[libx264 @ 0x570e87d31f80] kb/s:240.40", "metrics": { "predict_time": 19.079653052, "total_time": 19.088309 }, "output": { "response": "Sure, [SEG].<|end|>", "masked_video": "https://replicate.delivery/xezq/fOm8DB1CneoV3UYOICBnRpG3kMp3pYVC8feKcBapkEW9xBJRB/masked_video.mp4" }, "started_at": "2025-02-23T17:04:12.802656Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/0mrgkhhsz9rm80cn6f2awk4z28", "cancel": "https://api.replicate.com/v1/predictions/0mrgkhhsz9rm80cn6f2awk4z28/cancel" }, "version": "12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd" }
Generated inpropagate in video: 0%| | 0/45 [00:00<?, ?it/s] propagate in video: 18%|█▊ | 8/45 [00:00<00:00, 73.96it/s] propagate in video: 36%|███▌ | 16/45 [00:00<00:00, 37.22it/s] propagate in video: 47%|████▋ | 21/45 [00:00<00:00, 32.76it/s] propagate in video: 56%|█████▌ | 25/45 [00:00<00:00, 30.91it/s] propagate in video: 64%|██████▍ | 29/45 [00:00<00:00, 29.74it/s] propagate in video: 73%|███████▎ | 33/45 [00:01<00:00, 28.97it/s] propagate in video: 80%|████████ | 36/45 [00:01<00:00, 28.56it/s] propagate in video: 87%|████████▋ | 39/45 [00:01<00:00, 28.26it/s] propagate in video: 93%|█████████▎| 42/45 [00:01<00:00, 28.02it/s] propagate in video: 100%|██████████| 45/45 [00:01<00:00, 27.85it/s] propagate in video: 100%|██████████| 45/45 [00:01<00:00, 30.70it/s] ffmpeg version 4.4.2-0ubuntu0.22.04.1 Copyright (c) 2000-2021 the FFmpeg developers built with gcc 11 (Ubuntu 11.2.0-19ubuntu1) configuration: --prefix=/usr --extra-version=0ubuntu0.22.04.1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-librsvg --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 70.100 / 56. 70.100 libavcodec 58.134.100 / 58.134.100 libavformat 58. 76.100 / 58. 76.100 libavdevice 58. 13.100 / 58. 13.100 libavfilter 7.110.100 / 7.110.100 libswscale 5. 9.100 / 5. 9.100 libswresample 3. 9.100 / 3. 9.100 libpostproc 55. 9.100 / 55. 9.100 Input #0, avi, from '/tmp/output/temp_masked.avi': Metadata: software : Lavf59.27.100 Duration: 00:00:06.01, start: 0.000000, bitrate: 1700 kb/s Stream #0:0: Video: mjpeg (Baseline) (MJPG / 0x47504A4D), yuvj420p(pc, bt470bg/unknown/unknown), 1940x1096, 1725 kb/s, 7.48 fps, 7.48 tbr, 7.48 tbn, 7.48 tbc Stream mapping: Stream #0:0 -> #0:0 (mjpeg (native) -> h264 (libx264)) Press [q] to stop, [?] for help [swscaler @ 0x570e87d69d00] deprecated pixel format used, make sure you did set range correctly [libx264 @ 0x570e87d31f80] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 AVX512 [libx264 @ 0x570e87d31f80] profile High, level 4.2, 4:2:0, 8-bit [libx264 @ 0x570e87d31f80] 264 - core 163 r3060 5db6aa6 - H.264/MPEG-4 AVC codec - Copyleft 2003-2021 - http://www.videolan.org/x264.html - options: cabac=1 ref=2 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=6 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=15 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=1 keyint=250 keyint_min=7 scenecut=40 intra_refresh=0 rc_lookahead=30 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/output/masked_video.mp4': Metadata: software : Lavf59.27.100 encoder : Lavf58.76.100 Stream #0:0: Video: h264 (avc1 / 0x31637661), yuv420p(tv, bt470bg/unknown/unknown, progressive), 1940x1096, q=2-31, 7.48 fps, 14964 tbn Metadata: encoder : Lavc58.134.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A frame= 1 fps=0.0 q=0.0 size= 0kB time=00:00:00.00 bitrate=N/A speed=N/A frame= 45 fps=0.0 q=-1.0 Lsize= 178kB time=00:00:05.61 bitrate= 260.5kbits/s speed=10.9x video:177kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.743007% [libx264 @ 0x570e87d31f80] frame I:1 Avg QP: 6.99 size: 5411 [libx264 @ 0x570e87d31f80] frame P:17 Avg QP:16.48 size: 4561 [libx264 @ 0x570e87d31f80] frame B:27 Avg QP:18.29 size: 3622 [libx264 @ 0x570e87d31f80] consecutive B-frames: 13.3% 8.9% 33.3% 44.4% [libx264 @ 0x570e87d31f80] mb I I16..4: 87.2% 10.8% 2.0% [libx264 @ 0x570e87d31f80] mb P I16..4: 0.4% 2.0% 1.3% P16..4: 1.7% 0.5% 0.2% 0.0% 0.0% skip:93.9% [libx264 @ 0x570e87d31f80] mb B I16..4: 0.4% 0.0% 0.2% B16..8: 2.6% 1.3% 0.4% direct: 0.1% skip:95.1% L0:50.2% L1:46.8% BI: 3.0% [libx264 @ 0x570e87d31f80] 8x8 transform intra:25.1% inter:1.7% [libx264 @ 0x570e87d31f80] coded y,uvDC,uvAC intra: 8.1% 0.0% 0.0% inter: 0.5% 0.0% 0.0% [libx264 @ 0x570e87d31f80] i16 v,h,dc,p: 91% 7% 3% 0% [libx264 @ 0x570e87d31f80] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 35% 13% 51% 0% 0% 0% 0% 0% 0% [libx264 @ 0x570e87d31f80] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 20% 22% 41% 3% 2% 2% 4% 2% 4% [libx264 @ 0x570e87d31f80] i8c dc,h,v,p: 100% 0% 0% 0% [libx264 @ 0x570e87d31f80] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x570e87d31f80] ref P L0: 48.1% 51.9% [libx264 @ 0x570e87d31f80] ref B L0: 87.9% 12.1% [libx264 @ 0x570e87d31f80] ref B L1: 97.4% 2.6% [libx264 @ 0x570e87d31f80] kb/s:240.40
Prediction
bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bdIDmen8bt47chrme0cn6f1tjyw7z4StatusSucceededSourceWebHardwareL40STotal durationCreatedInput
- video
- instruction
- Segment the person wearing sunglasses
- frame_interval
- 4
{ "video": "https://replicate.delivery/pbxt/MXbKM7pps3c43Ypbd0zwxYftWeoEFzxzDzy9pLkcrsrhzypr/sora-woman.mp4", "instruction": "Segment the person wearing sunglasses", "frame_interval": 4 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", { input: { video: "https://replicate.delivery/pbxt/MXbKM7pps3c43Ypbd0zwxYftWeoEFzxzDzy9pLkcrsrhzypr/sora-woman.mp4", instruction: "Segment the person wearing sunglasses", frame_interval: 4 } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", input={ "video": "https://replicate.delivery/pbxt/MXbKM7pps3c43Ypbd0zwxYftWeoEFzxzDzy9pLkcrsrhzypr/sora-woman.mp4", "instruction": "Segment the person wearing sunglasses", "frame_interval": 4 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", "input": { "video": "https://replicate.delivery/pbxt/MXbKM7pps3c43Ypbd0zwxYftWeoEFzxzDzy9pLkcrsrhzypr/sora-woman.mp4", "instruction": "Segment the person wearing sunglasses", "frame_interval": 4 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
response
Sure, the segmentation result is [SEG].<|end|>masked_video
{ "completed_at": "2025-02-23T17:03:57.273898Z", "created_at": "2025-02-23T17:03:27.076000Z", "data_removed": false, "error": null, "id": "men8bt47chrme0cn6f1tjyw7z4", "input": { "video": "https://replicate.delivery/pbxt/MXbKM7pps3c43Ypbd0zwxYftWeoEFzxzDzy9pLkcrsrhzypr/sora-woman.mp4", "instruction": "Segment the person wearing sunglasses", "frame_interval": 4 }, "logs": "propagate in video: 0%| | 0/38 [00:00<?, ?it/s]\npropagate in video: 21%|██ | 8/38 [00:00<00:00, 66.78it/s]\npropagate in video: 39%|███▉ | 15/38 [00:00<00:00, 37.46it/s]\npropagate in video: 53%|█████▎ | 20/38 [00:00<00:00, 32.99it/s]\npropagate in video: 63%|██████▎ | 24/38 [00:00<00:00, 31.10it/s]\npropagate in video: 74%|███████▎ | 28/38 [00:00<00:00, 29.84it/s]\npropagate in video: 84%|████████▍ | 32/38 [00:00<00:00, 29.06it/s]\npropagate in video: 92%|█████████▏| 35/38 [00:01<00:00, 28.59it/s]\npropagate in video: 100%|██████████| 38/38 [00:01<00:00, 28.25it/s]\npropagate in video: 100%|██████████| 38/38 [00:01<00:00, 31.26it/s]\nffmpeg version 4.4.2-0ubuntu0.22.04.1 Copyright (c) 2000-2021 the FFmpeg developers\nbuilt with gcc 11 (Ubuntu 11.2.0-19ubuntu1)\nconfiguration: --prefix=/usr --extra-version=0ubuntu0.22.04.1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-librsvg --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 70.100 / 56. 70.100\nlibavcodec 58.134.100 / 58.134.100\nlibavformat 58. 76.100 / 58. 76.100\nlibavdevice 58. 13.100 / 58. 13.100\nlibavfilter 7.110.100 / 7.110.100\nlibswscale 5. 9.100 / 5. 9.100\nlibswresample 3. 9.100 / 3. 9.100\nlibpostproc 55. 9.100 / 55. 9.100\nInput #0, avi, from '/tmp/output/temp_masked.avi':\nMetadata:\nsoftware : Lavf59.27.100\nDuration: 00:00:05.07, start: 0.000000, bitrate: 718 kb/s\nStream #0:0: Video: mjpeg (Baseline) (MJPG / 0x47504A4D), yuvj420p(pc, bt470bg/unknown/unknown), 1280x720, 726 kb/s, 7.50 fps, 7.50 tbr, 7.50 tbn, 7.50 tbc\nStream mapping:\nStream #0:0 -> #0:0 (mjpeg (native) -> h264 (libx264))\nPress [q] to stop, [?] for help\n[swscaler @ 0x5d8e51f77600] deprecated pixel format used, make sure you did set range correctly\n[libx264 @ 0x5d8e51f45c40] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 AVX512\n[libx264 @ 0x5d8e51f45c40] profile High, level 3.1, 4:2:0, 8-bit\n[libx264 @ 0x5d8e51f45c40] 264 - core 163 r3060 5db6aa6 - H.264/MPEG-4 AVC codec - Copyleft 2003-2021 - http://www.videolan.org/x264.html - options: cabac=1 ref=2 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=6 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=15 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=1 keyint=250 keyint_min=7 scenecut=40 intra_refresh=0 rc_lookahead=30 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/output/masked_video.mp4':\nMetadata:\nsoftware : Lavf59.27.100\nencoder : Lavf58.76.100\nStream #0:0: Video: h264 (avc1 / 0x31637661), yuv420p(tv, bt470bg/unknown/unknown, progressive), 1280x720, q=2-31, 7.50 fps, 15360 tbn\nMetadata:\nencoder : Lavc58.134.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nframe= 1 fps=0.0 q=0.0 size= 0kB time=00:00:00.00 bitrate=N/A speed=N/A\nframe= 38 fps=0.0 q=-1.0 Lsize= 47kB time=00:00:04.66 bitrate= 83.1kbits/s speed=22.5x\nvideo:46kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 2.798286%\n[libx264 @ 0x5d8e51f45c40] frame I:1 Avg QP: 6.28 size: 2084\n[libx264 @ 0x5d8e51f45c40] frame P:10 Avg QP:21.95 size: 1508\n[libx264 @ 0x5d8e51f45c40] frame B:27 Avg QP:23.53 size: 1085\n[libx264 @ 0x5d8e51f45c40] consecutive B-frames: 5.3% 0.0% 0.0% 94.7%\n[libx264 @ 0x5d8e51f45c40] mb I I16..4: 70.9% 27.1% 2.0%\n[libx264 @ 0x5d8e51f45c40] mb P I16..4: 0.4% 0.6% 0.4% P16..4: 1.0% 1.1% 0.3% 0.0% 0.0% skip:96.3%\n[libx264 @ 0x5d8e51f45c40] mb B I16..4: 0.3% 0.0% 0.0% B16..8: 1.6% 1.4% 0.3% direct: 0.0% skip:96.3% L0:50.4% L1:45.8% BI: 3.8%\n[libx264 @ 0x5d8e51f45c40] 8x8 transform intra:26.9% inter:1.5%\n[libx264 @ 0x5d8e51f45c40] coded y,uvDC,uvAC intra: 3.2% 0.0% 0.0% inter: 0.4% 0.0% 0.0%\n[libx264 @ 0x5d8e51f45c40] i16 v,h,dc,p: 95% 2% 3% 0%\n[libx264 @ 0x5d8e51f45c40] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 44% 5% 51% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x5d8e51f45c40] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 39% 8% 40% 3% 1% 3% 1% 4% 1%\n[libx264 @ 0x5d8e51f45c40] i8c dc,h,v,p: 100% 0% 0% 0%\n[libx264 @ 0x5d8e51f45c40] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x5d8e51f45c40] ref P L0: 63.9% 36.1%\n[libx264 @ 0x5d8e51f45c40] ref B L0: 81.5% 18.5%\n[libx264 @ 0x5d8e51f45c40] ref B L1: 90.9% 9.1%\n[libx264 @ 0x5d8e51f45c40] kb/s:73.34", "metrics": { "predict_time": 9.630203571, "total_time": 30.197898 }, "output": { "response": "Sure, the segmentation result is [SEG].<|end|>", "masked_video": "https://replicate.delivery/xezq/dyieaQGN5CRbViFozTRN3E2XHx4gbd4kWUfaSKbKF6r9bQSUA/masked_video.mp4" }, "started_at": "2025-02-23T17:03:47.643694Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/men8bt47chrme0cn6f1tjyw7z4", "cancel": "https://api.replicate.com/v1/predictions/men8bt47chrme0cn6f1tjyw7z4/cancel" }, "version": "12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd" }
Generated inpropagate in video: 0%| | 0/38 [00:00<?, ?it/s] propagate in video: 21%|██ | 8/38 [00:00<00:00, 66.78it/s] propagate in video: 39%|███▉ | 15/38 [00:00<00:00, 37.46it/s] propagate in video: 53%|█████▎ | 20/38 [00:00<00:00, 32.99it/s] propagate in video: 63%|██████▎ | 24/38 [00:00<00:00, 31.10it/s] propagate in video: 74%|███████▎ | 28/38 [00:00<00:00, 29.84it/s] propagate in video: 84%|████████▍ | 32/38 [00:00<00:00, 29.06it/s] propagate in video: 92%|█████████▏| 35/38 [00:01<00:00, 28.59it/s] propagate in video: 100%|██████████| 38/38 [00:01<00:00, 28.25it/s] propagate in video: 100%|██████████| 38/38 [00:01<00:00, 31.26it/s] ffmpeg version 4.4.2-0ubuntu0.22.04.1 Copyright (c) 2000-2021 the FFmpeg developers built with gcc 11 (Ubuntu 11.2.0-19ubuntu1) configuration: --prefix=/usr --extra-version=0ubuntu0.22.04.1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-librsvg --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 70.100 / 56. 70.100 libavcodec 58.134.100 / 58.134.100 libavformat 58. 76.100 / 58. 76.100 libavdevice 58. 13.100 / 58. 13.100 libavfilter 7.110.100 / 7.110.100 libswscale 5. 9.100 / 5. 9.100 libswresample 3. 9.100 / 3. 9.100 libpostproc 55. 9.100 / 55. 9.100 Input #0, avi, from '/tmp/output/temp_masked.avi': Metadata: software : Lavf59.27.100 Duration: 00:00:05.07, start: 0.000000, bitrate: 718 kb/s Stream #0:0: Video: mjpeg (Baseline) (MJPG / 0x47504A4D), yuvj420p(pc, bt470bg/unknown/unknown), 1280x720, 726 kb/s, 7.50 fps, 7.50 tbr, 7.50 tbn, 7.50 tbc Stream mapping: Stream #0:0 -> #0:0 (mjpeg (native) -> h264 (libx264)) Press [q] to stop, [?] for help [swscaler @ 0x5d8e51f77600] deprecated pixel format used, make sure you did set range correctly [libx264 @ 0x5d8e51f45c40] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 AVX512 [libx264 @ 0x5d8e51f45c40] profile High, level 3.1, 4:2:0, 8-bit [libx264 @ 0x5d8e51f45c40] 264 - core 163 r3060 5db6aa6 - H.264/MPEG-4 AVC codec - Copyleft 2003-2021 - http://www.videolan.org/x264.html - options: cabac=1 ref=2 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=6 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=15 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=1 keyint=250 keyint_min=7 scenecut=40 intra_refresh=0 rc_lookahead=30 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/output/masked_video.mp4': Metadata: software : Lavf59.27.100 encoder : Lavf58.76.100 Stream #0:0: Video: h264 (avc1 / 0x31637661), yuv420p(tv, bt470bg/unknown/unknown, progressive), 1280x720, q=2-31, 7.50 fps, 15360 tbn Metadata: encoder : Lavc58.134.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A frame= 1 fps=0.0 q=0.0 size= 0kB time=00:00:00.00 bitrate=N/A speed=N/A frame= 38 fps=0.0 q=-1.0 Lsize= 47kB time=00:00:04.66 bitrate= 83.1kbits/s speed=22.5x video:46kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 2.798286% [libx264 @ 0x5d8e51f45c40] frame I:1 Avg QP: 6.28 size: 2084 [libx264 @ 0x5d8e51f45c40] frame P:10 Avg QP:21.95 size: 1508 [libx264 @ 0x5d8e51f45c40] frame B:27 Avg QP:23.53 size: 1085 [libx264 @ 0x5d8e51f45c40] consecutive B-frames: 5.3% 0.0% 0.0% 94.7% [libx264 @ 0x5d8e51f45c40] mb I I16..4: 70.9% 27.1% 2.0% [libx264 @ 0x5d8e51f45c40] mb P I16..4: 0.4% 0.6% 0.4% P16..4: 1.0% 1.1% 0.3% 0.0% 0.0% skip:96.3% [libx264 @ 0x5d8e51f45c40] mb B I16..4: 0.3% 0.0% 0.0% B16..8: 1.6% 1.4% 0.3% direct: 0.0% skip:96.3% L0:50.4% L1:45.8% BI: 3.8% [libx264 @ 0x5d8e51f45c40] 8x8 transform intra:26.9% inter:1.5% [libx264 @ 0x5d8e51f45c40] coded y,uvDC,uvAC intra: 3.2% 0.0% 0.0% inter: 0.4% 0.0% 0.0% [libx264 @ 0x5d8e51f45c40] i16 v,h,dc,p: 95% 2% 3% 0% [libx264 @ 0x5d8e51f45c40] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 44% 5% 51% 0% 0% 0% 0% 0% 0% [libx264 @ 0x5d8e51f45c40] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 39% 8% 40% 3% 1% 3% 1% 4% 1% [libx264 @ 0x5d8e51f45c40] i8c dc,h,v,p: 100% 0% 0% 0% [libx264 @ 0x5d8e51f45c40] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x5d8e51f45c40] ref P L0: 63.9% 36.1% [libx264 @ 0x5d8e51f45c40] ref B L0: 81.5% 18.5% [libx264 @ 0x5d8e51f45c40] ref B L1: 90.9% 9.1% [libx264 @ 0x5d8e51f45c40] kb/s:73.34
Prediction
bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bdID72xbcr2e79rmc0cn5dr93sa8h0StatusSucceededSourceWebHardwareL40STotal durationCreatedInput
- video
- instruction
- Describe the video
- frame_interval
- 6
{ "video": "https://replicate.delivery/pbxt/MXbIfZbIkPB7ZvfQ58hlMruwDpKJln28EfCJSCUPMaZ1oC8h/sora-otter-5.mp4", "instruction": "Describe the video", "frame_interval": 6 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", { input: { video: "https://replicate.delivery/pbxt/MXbIfZbIkPB7ZvfQ58hlMruwDpKJln28EfCJSCUPMaZ1oC8h/sora-otter-5.mp4", instruction: "Describe the video", frame_interval: 6 } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", input={ "video": "https://replicate.delivery/pbxt/MXbIfZbIkPB7ZvfQ58hlMruwDpKJln28EfCJSCUPMaZ1oC8h/sora-otter-5.mp4", "instruction": "Describe the video", "frame_interval": 6 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", "input": { "video": "https://replicate.delivery/pbxt/MXbIfZbIkPB7ZvfQ58hlMruwDpKJln28EfCJSCUPMaZ1oC8h/sora-otter-5.mp4", "instruction": "Describe the video", "frame_interval": 6 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
The video features a seal swimming in the ocean, surrounded by a vibrant and colorful underwater environment. The seal is seen swimming towards a series of floating white balls, which appear to be connected by strings. The seal gracefully moves through the water, reaching for the balls, and the scene is filled with a sense of wonder and excitement. The underwater world is depicted with a variety of colors and textures, creating a visually stunning and captivating experience for the viewer.<|end|>{ "completed_at": "2025-02-22T02:16:07.358248Z", "created_at": "2025-02-22T02:15:35.226000Z", "data_removed": false, "error": null, "id": "72xbcr2e79rmc0cn5dr93sa8h0", "input": { "video": "https://replicate.delivery/pbxt/MXbIfZbIkPB7ZvfQ58hlMruwDpKJln28EfCJSCUPMaZ1oC8h/sora-otter-5.mp4", "instruction": "Describe the video", "frame_interval": 6 }, "logs": null, "metrics": { "predict_time": 9.065061488, "total_time": 32.132248 }, "output": { "response": "The video features a seal swimming in the ocean, surrounded by a vibrant and colorful underwater environment. The seal is seen swimming towards a series of floating white balls, which appear to be connected by strings. The seal gracefully moves through the water, reaching for the balls, and the scene is filled with a sense of wonder and excitement. The underwater world is depicted with a variety of colors and textures, creating a visually stunning and captivating experience for the viewer.<|end|>", "masked_video": null }, "started_at": "2025-02-22T02:15:58.293187Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/72xbcr2e79rmc0cn5dr93sa8h0", "cancel": "https://api.replicate.com/v1/predictions/72xbcr2e79rmc0cn5dr93sa8h0/cancel" }, "version": "12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd" }
Generated inPrediction
bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bdIDm5b32ebbxxrmc0cn5drv74jp3gStatusSucceededSourceWebHardwareL40STotal durationCreatedInput
- video
- instruction
- Can you describe the video?
- frame_interval
- 6
{ "video": "https://replicate.delivery/pbxt/MXbJodN6qlN42U6BouZw4MqR26M85qAJlQaCrENG42djBKsC/sora-woman.mp4", "instruction": "Can you describe the video?", "frame_interval": 6 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", { input: { video: "https://replicate.delivery/pbxt/MXbJodN6qlN42U6BouZw4MqR26M85qAJlQaCrENG42djBKsC/sora-woman.mp4", instruction: "Can you describe the video?", frame_interval: 6 } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", input={ "video": "https://replicate.delivery/pbxt/MXbJodN6qlN42U6BouZw4MqR26M85qAJlQaCrENG42djBKsC/sora-woman.mp4", "instruction": "Can you describe the video?", "frame_interval": 6 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", "input": { "video": "https://replicate.delivery/pbxt/MXbJodN6qlN42U6BouZw4MqR26M85qAJlQaCrENG42djBKsC/sora-woman.mp4", "instruction": "Can you describe the video?", "frame_interval": 6 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
Sure, the video shows a woman walking down a busy city street at night. She is wearing a black leather jacket and a red dress, and she is carrying a black handbag. The street is illuminated by bright lights, and there are many people walking around her. The woman is walking towards the camera, and she appears to be in a hurry. The video captures the bustling atmosphere of the city at night, with the woman as the main focus.<|end|>{ "completed_at": "2025-02-22T02:16:52.651752Z", "created_at": "2025-02-22T02:16:48.367000Z", "data_removed": false, "error": null, "id": "m5b32ebbxxrmc0cn5drv74jp3g", "input": { "video": "https://replicate.delivery/pbxt/MXbJodN6qlN42U6BouZw4MqR26M85qAJlQaCrENG42djBKsC/sora-woman.mp4", "instruction": "Can you describe the video?", "frame_interval": 6 }, "logs": null, "metrics": { "predict_time": 4.276755474, "total_time": 4.284752 }, "output": { "response": "Sure, the video shows a woman walking down a busy city street at night. She is wearing a black leather jacket and a red dress, and she is carrying a black handbag. The street is illuminated by bright lights, and there are many people walking around her. The woman is walking towards the camera, and she appears to be in a hurry. The video captures the bustling atmosphere of the city at night, with the woman as the main focus.<|end|>", "masked_video": null }, "started_at": "2025-02-22T02:16:48.374996Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/m5b32ebbxxrmc0cn5drv74jp3g", "cancel": "https://api.replicate.com/v1/predictions/m5b32ebbxxrmc0cn5drv74jp3g/cancel" }, "version": "12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd" }
Generated inPrediction
bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bdID6sghyryar1rme0cn6eyrvnk29rStatusSucceededSourceWebHardwareL40STotal durationCreatedInput
- video
- instruction
- Segment the flower
- frame_interval
- 4
{ "video": "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", "instruction": "Segment the flower", "frame_interval": 4 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", { input: { video: "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", instruction: "Segment the flower", frame_interval: 4 } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", input={ "video": "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", "instruction": "Segment the flower", "frame_interval": 4 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run bytedance/sa2va-4b-video using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "bytedance/sa2va-4b-video:12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd", "input": { "video": "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", "instruction": "Segment the flower", "frame_interval": 4 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
response
Sure, [SEG].<|end|>masked_video
{ "completed_at": "2025-02-23T16:57:28.188399Z", "created_at": "2025-02-23T16:57:11.104000Z", "data_removed": false, "error": null, "id": "6sghyryar1rme0cn6eyrvnk29r", "input": { "video": "https://replicate.delivery/pbxt/MXbMlgO6lBD93p0OZrWjcQzRrGa3tuns7q7Si64C15pNs4yT/flower-6.mp4", "instruction": "Segment the flower", "frame_interval": 4 }, "logs": "propagate in video: 0%| | 0/45 [00:00<?, ?it/s]\npropagate in video: 18%|█▊ | 8/45 [00:00<00:00, 73.20it/s]\npropagate in video: 36%|███▌ | 16/45 [00:00<00:00, 37.01it/s]\npropagate in video: 47%|████▋ | 21/45 [00:00<00:00, 32.92it/s]\npropagate in video: 56%|█████▌ | 25/45 [00:00<00:00, 30.94it/s]\npropagate in video: 64%|██████▍ | 29/45 [00:00<00:00, 29.67it/s]\npropagate in video: 73%|███████▎ | 33/45 [00:01<00:00, 28.79it/s]\npropagate in video: 80%|████████ | 36/45 [00:01<00:00, 28.28it/s]\npropagate in video: 87%|████████▋ | 39/45 [00:01<00:00, 27.86it/s]\npropagate in video: 93%|█████████▎| 42/45 [00:01<00:00, 27.66it/s]\npropagate in video: 100%|██████████| 45/45 [00:01<00:00, 27.39it/s]\npropagate in video: 100%|██████████| 45/45 [00:01<00:00, 30.44it/s]\nffmpeg version 4.4.2-0ubuntu0.22.04.1 Copyright (c) 2000-2021 the FFmpeg developers\nbuilt with gcc 11 (Ubuntu 11.2.0-19ubuntu1)\nconfiguration: --prefix=/usr --extra-version=0ubuntu0.22.04.1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-librsvg --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 70.100 / 56. 70.100\nlibavcodec 58.134.100 / 58.134.100\nlibavformat 58. 76.100 / 58. 76.100\nlibavdevice 58. 13.100 / 58. 13.100\nlibavfilter 7.110.100 / 7.110.100\nlibswscale 5. 9.100 / 5. 9.100\nlibswresample 3. 9.100 / 3. 9.100\nlibpostproc 55. 9.100 / 55. 9.100\nInput #0, avi, from '/tmp/output/temp_masked.avi':\nMetadata:\nsoftware : Lavf59.27.100\nDuration: 00:00:06.04, start: 0.000000, bitrate: 2362 kb/s\nStream #0:0: Video: mjpeg (Baseline) (MJPG / 0x47504A4D), yuvj420p(pc, bt470bg/unknown/unknown), 1920x1080, 2403 kb/s, 7.45 fps, 7.45 tbr, 7.45 tbn, 7.45 tbc\nStream mapping:\nStream #0:0 -> #0:0 (mjpeg (native) -> h264 (libx264))\nPress [q] to stop, [?] for help\n[swscaler @ 0x5dcaa0724e00] deprecated pixel format used, make sure you did set range correctly\n[libx264 @ 0x5dcaa06eaf80] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 AVX512\n[libx264 @ 0x5dcaa06eaf80] profile High, level 4.0, 4:2:0, 8-bit\n[libx264 @ 0x5dcaa06eaf80] 264 - core 163 r3060 5db6aa6 - H.264/MPEG-4 AVC codec - Copyleft 2003-2021 - http://www.videolan.org/x264.html - options: cabac=1 ref=2 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=6 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=15 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=1 keyint=250 keyint_min=7 scenecut=40 intra_refresh=0 rc_lookahead=30 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/output/masked_video.mp4':\nMetadata:\nsoftware : Lavf59.27.100\nencoder : Lavf58.76.100\nStream #0:0: Video: h264 (avc1 / 0x31637661), yuv420p(tv, bt470bg/unknown/unknown, progressive), 1920x1080, q=2-31, 7.45 fps, 14906 tbn\nMetadata:\nencoder : Lavc58.134.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nframe= 1 fps=0.0 q=0.0 size= 0kB time=00:00:00.00 bitrate=N/A speed=N/A\nframe= 45 fps=0.0 q=-1.0 Lsize= 248kB time=00:00:05.63 bitrate= 360.2kbits/s speed=9.76x\nvideo:246kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.556042%\n[libx264 @ 0x5dcaa06eaf80] frame I:1 Avg QP:18.58 size: 9682\n[libx264 @ 0x5dcaa06eaf80] frame P:12 Avg QP:25.95 size: 6920\n[libx264 @ 0x5dcaa06eaf80] frame B:32 Avg QP:28.12 size: 4966\n[libx264 @ 0x5dcaa06eaf80] consecutive B-frames: 2.2% 8.9% 0.0% 88.9%\n[libx264 @ 0x5dcaa06eaf80] mb I I16..4: 20.8% 76.1% 3.1%\n[libx264 @ 0x5dcaa06eaf80] mb P I16..4: 0.5% 2.7% 1.2% P16..4: 3.7% 1.4% 0.6% 0.0% 0.0% skip:89.9%\n[libx264 @ 0x5dcaa06eaf80] mb B I16..4: 0.4% 0.0% 0.1% B16..8: 3.6% 2.5% 0.6% direct: 0.2% skip:92.5% L0:51.3% L1:44.9% BI: 3.8%\n[libx264 @ 0x5dcaa06eaf80] 8x8 transform intra:63.5% inter:2.9%\n[libx264 @ 0x5dcaa06eaf80] coded y,uvDC,uvAC intra: 6.3% 0.0% 0.0% inter: 0.9% 0.0% 0.0%\n[libx264 @ 0x5dcaa06eaf80] i16 v,h,dc,p: 78% 13% 9% 0%\n[libx264 @ 0x5dcaa06eaf80] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 44% 3% 53% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x5dcaa06eaf80] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 25% 17% 42% 3% 2% 3% 2% 3% 3%\n[libx264 @ 0x5dcaa06eaf80] i8c dc,h,v,p: 100% 0% 0% 0%\n[libx264 @ 0x5dcaa06eaf80] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x5dcaa06eaf80] ref P L0: 70.6% 29.4%\n[libx264 @ 0x5dcaa06eaf80] ref B L0: 85.7% 14.3%\n[libx264 @ 0x5dcaa06eaf80] ref B L1: 92.3% 7.7%\n[libx264 @ 0x5dcaa06eaf80] kb/s:333.40", "metrics": { "predict_time": 17.075671752, "total_time": 17.084399 }, "output": { "response": "Sure, [SEG].<|end|>", "masked_video": "https://replicate.delivery/xezq/ZqEKd3Gu29bzDRseblqs0gdwVLOShebPgSbplhFKglA4VQSUA/masked_video.mp4" }, "started_at": "2025-02-23T16:57:11.112727Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/6sghyryar1rme0cn6eyrvnk29r", "cancel": "https://api.replicate.com/v1/predictions/6sghyryar1rme0cn6eyrvnk29r/cancel" }, "version": "12093a2cbbd53848e1033678c0f9bc89ff9c0a390a2c416a885cb3715b9926bd" }
Generated inpropagate in video: 0%| | 0/45 [00:00<?, ?it/s] propagate in video: 18%|█▊ | 8/45 [00:00<00:00, 73.20it/s] propagate in video: 36%|███▌ | 16/45 [00:00<00:00, 37.01it/s] propagate in video: 47%|████▋ | 21/45 [00:00<00:00, 32.92it/s] propagate in video: 56%|█████▌ | 25/45 [00:00<00:00, 30.94it/s] propagate in video: 64%|██████▍ | 29/45 [00:00<00:00, 29.67it/s] propagate in video: 73%|███████▎ | 33/45 [00:01<00:00, 28.79it/s] propagate in video: 80%|████████ | 36/45 [00:01<00:00, 28.28it/s] propagate in video: 87%|████████▋ | 39/45 [00:01<00:00, 27.86it/s] propagate in video: 93%|█████████▎| 42/45 [00:01<00:00, 27.66it/s] propagate in video: 100%|██████████| 45/45 [00:01<00:00, 27.39it/s] propagate in video: 100%|██████████| 45/45 [00:01<00:00, 30.44it/s] ffmpeg version 4.4.2-0ubuntu0.22.04.1 Copyright (c) 2000-2021 the FFmpeg developers built with gcc 11 (Ubuntu 11.2.0-19ubuntu1) configuration: --prefix=/usr --extra-version=0ubuntu0.22.04.1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-librsvg --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 70.100 / 56. 70.100 libavcodec 58.134.100 / 58.134.100 libavformat 58. 76.100 / 58. 76.100 libavdevice 58. 13.100 / 58. 13.100 libavfilter 7.110.100 / 7.110.100 libswscale 5. 9.100 / 5. 9.100 libswresample 3. 9.100 / 3. 9.100 libpostproc 55. 9.100 / 55. 9.100 Input #0, avi, from '/tmp/output/temp_masked.avi': Metadata: software : Lavf59.27.100 Duration: 00:00:06.04, start: 0.000000, bitrate: 2362 kb/s Stream #0:0: Video: mjpeg (Baseline) (MJPG / 0x47504A4D), yuvj420p(pc, bt470bg/unknown/unknown), 1920x1080, 2403 kb/s, 7.45 fps, 7.45 tbr, 7.45 tbn, 7.45 tbc Stream mapping: Stream #0:0 -> #0:0 (mjpeg (native) -> h264 (libx264)) Press [q] to stop, [?] for help [swscaler @ 0x5dcaa0724e00] deprecated pixel format used, make sure you did set range correctly [libx264 @ 0x5dcaa06eaf80] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 AVX512 [libx264 @ 0x5dcaa06eaf80] profile High, level 4.0, 4:2:0, 8-bit [libx264 @ 0x5dcaa06eaf80] 264 - core 163 r3060 5db6aa6 - H.264/MPEG-4 AVC codec - Copyleft 2003-2021 - http://www.videolan.org/x264.html - options: cabac=1 ref=2 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=6 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=15 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=1 keyint=250 keyint_min=7 scenecut=40 intra_refresh=0 rc_lookahead=30 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/output/masked_video.mp4': Metadata: software : Lavf59.27.100 encoder : Lavf58.76.100 Stream #0:0: Video: h264 (avc1 / 0x31637661), yuv420p(tv, bt470bg/unknown/unknown, progressive), 1920x1080, q=2-31, 7.45 fps, 14906 tbn Metadata: encoder : Lavc58.134.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A frame= 1 fps=0.0 q=0.0 size= 0kB time=00:00:00.00 bitrate=N/A speed=N/A frame= 45 fps=0.0 q=-1.0 Lsize= 248kB time=00:00:05.63 bitrate= 360.2kbits/s speed=9.76x video:246kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.556042% [libx264 @ 0x5dcaa06eaf80] frame I:1 Avg QP:18.58 size: 9682 [libx264 @ 0x5dcaa06eaf80] frame P:12 Avg QP:25.95 size: 6920 [libx264 @ 0x5dcaa06eaf80] frame B:32 Avg QP:28.12 size: 4966 [libx264 @ 0x5dcaa06eaf80] consecutive B-frames: 2.2% 8.9% 0.0% 88.9% [libx264 @ 0x5dcaa06eaf80] mb I I16..4: 20.8% 76.1% 3.1% [libx264 @ 0x5dcaa06eaf80] mb P I16..4: 0.5% 2.7% 1.2% P16..4: 3.7% 1.4% 0.6% 0.0% 0.0% skip:89.9% [libx264 @ 0x5dcaa06eaf80] mb B I16..4: 0.4% 0.0% 0.1% B16..8: 3.6% 2.5% 0.6% direct: 0.2% skip:92.5% L0:51.3% L1:44.9% BI: 3.8% [libx264 @ 0x5dcaa06eaf80] 8x8 transform intra:63.5% inter:2.9% [libx264 @ 0x5dcaa06eaf80] coded y,uvDC,uvAC intra: 6.3% 0.0% 0.0% inter: 0.9% 0.0% 0.0% [libx264 @ 0x5dcaa06eaf80] i16 v,h,dc,p: 78% 13% 9% 0% [libx264 @ 0x5dcaa06eaf80] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 44% 3% 53% 0% 0% 0% 0% 0% 0% [libx264 @ 0x5dcaa06eaf80] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 25% 17% 42% 3% 2% 3% 2% 3% 3% [libx264 @ 0x5dcaa06eaf80] i8c dc,h,v,p: 100% 0% 0% 0% [libx264 @ 0x5dcaa06eaf80] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x5dcaa06eaf80] ref P L0: 70.6% 29.4% [libx264 @ 0x5dcaa06eaf80] ref B L0: 85.7% 14.3% [libx264 @ 0x5dcaa06eaf80] ref B L1: 92.3% 7.7% [libx264 @ 0x5dcaa06eaf80] kb/s:333.40
Want to make some of these yourself?
Run this model