zsxkib/uform-gen

🖼️ Super fast 1.5B Image Captioning/VQA Multimodal LLM (Image-to-Text) 🖋️

Public
2.3K runs

Want to make some of these yourself?

Run this model