adirik/bunny-phi-2-siglip

Lightweight multimodal model for visual question answering, reasoning and captioning

Public
7.8K runs

Want to make some of these yourself?

Run this model