microsoft / omniparser-v2

OmniParser is a screen parsing tool to convert general GUI screen to structured elements.

  • Public
  • 31.6K runs
  • GitHub
  • Weights
  • Paper
  • License

Want to make some of these yourself?

Run this model