cjwbw/pix2struct

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

Public
6.1K runs

Want to make some of these yourself?

Run this model