joehoover / mplug-owl

An instruction-tuned multimodal large language model that generates text based on user-provided prompts and images

No versions have been pushed to this model yet.