daanelson/imagebind

A model for text, audio, and image embeddings in one space

Public
9.4M runs

Want to make some of these yourself?

Run this model