lucataco/paligemma-3b-pt-224

PaliGemma 3B, an open VLM by Google, pre-trained with 224*224 input images and 128 token input/output text sequences

Public
1.4K runs
  1. Author
    @lucataco

    c519755c

    Latest