lucataco/paligemma-3b-pt-224

PaliGemma 3B, an open VLM by Google, pre-trained with 224*224 input images and 128 token input/output text sequences

Public
1.4K runs

Want to make some of these yourself?

Run this model