lucataco / bulk-video-caption

Video Preprocessing tool for captioning multiple videos using GPT, Claude or Gemini

  • Public
  • 121 runs
  • GitHub
  • License
Iterate in playground

Run time and cost

This model runs on CPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

Batch video caption

A cog model for batch video captioning using various AI from OpenAI, Anthropic, and Google

Features

  • Process multiple images from a ZIP archive
  • supports mov, mp4
  • Customizable caption prefixes and suffixes
  • Support for multiple AI models:
    • OpenAI: GPT-4 and variants
    • Anthropic: Claude-3.5, Claude-3 variants
    • Google: Gemini-1.5 variants
  • Flexible system prompts
  • Error handling and retry mechanism
  • Output as a ZIP file containing captions that match image filenames as well as an optional CSV summary