adirik / bunny-phi-2-siglip

Lightweight multimodal model for visual question answering, reasoning and captioning (Updated 1 year, 3 months ago)

  • Public
  • 7.8K runs
  • GitHub
  • Paper
  • License
Iterate in playground