lucataco / paligemma-3b-pt-224

PaliGemma 3B, an open VLM by Google, pre-trained with 224*224 input images and 128 token input/output text sequences

  • Public
  • 476 runs
  • GitHub
  • Paper
  • License
  1. Author
    @lucataco

    c519755c

    Latest