lucataco/kosmos-2

Grounding Multimodal Large Language Models to the World

Public
1.9K runs

Want to make some of these yourself?

Run this model