This model runs on CPU hardware. We don't yet have enough runs of this model to provide performance information.

Batch image captioning

A cog model for batch image captioning using various AI from OpenAI, Anthropic, and Google’s Generative AI,

Features

Process multiple images from a ZIP archive
supports png, jpg, jpeg, webp
Optional image resizing for more cost-effective captioning
Customizable caption prefixes and suffixes
Support for multiple AI models:
OpenAI: GPT-4 and variants
Anthropic: Claude-3.5, Claude-3 variants
Google: Gemini-1.5 variants
Flexible system and message prompts
Error handling and retry mechanism
Output as a ZIP file containing captions that match image filenames as well as a CSV summary