Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

DeepFloyd IF model, a state-of-the-art text-to-image synthesis model that generates high-quality, photorealistic images based on your text prompts.

Updated 15.8K runs

Updated 91 runs

datasets: Flickr8k

Updated 11.1K runs

The DeepFloyd IF model has been initially released as a non-commercial research-only model. Please make sure you read and abide to the license before using it.

Updated 2M runs

A language model by Google for tasks like classification, summarization, and more

Updated 144.1K runs

🔊 Text-Prompted Generative Audio Model

Updated 273.7K runs

Whisper transcription plus speaker diarization

Updated 22.2K runs

Tryout SegmentAnything Model (SAM) by Meta.

Updated 67.4K runs

image captioning

Updated 1.2K runs

Generate image prompts for Midjourney. Prefix inputs with "Image: "

Updated 52.9K runs

Tryout SegmentAnything Model (SAM) by Meta.

Updated 2.7K runs

🔊 Text-Prompted Generative Audio Model Topics Resources

Updated 1.1K runs

MiniGPT-4 w/ Vicuna-7B (Image Question/Captioning Use)

Updated 9.9K runs

MiniGPT-4 w/ Vicuna-13B (Image Question/Captioning Use)

Updated 52K runs

Custom model trained with dreambooth

Updated 226 runs

Train your own custom Stable Diffusion model using a small set of images

Updated 295.3K runs

Lora & openjourney V4

Updated 18.8K runs

7 billion parameter version of Stability AI's language model

Updated 112.5K runs

Extract structured data from receipt images using Donut 🍩 (Document Understanding Transformer)

Updated 2.1K runs

Updated 159 runs

A stable diffusion model trained on pictures from my buddy Toshiro (truly the best boy there is)

Updated 108 runs

Updated 420 runs

Deforum Stable Diffusion

Updated 71.7K runs

Nightly release of ControlNet 1.1

Updated 8.2K runs

openai/whisper with exposed settings for word_timestamps

Updated 557.3K runs

Updated 150 runs

SD 1.5 trained with +124k MJv4 images by PromptHero

Updated 237.7K runs

Helps you with work

Updated 1.2K runs

Adding semantic labels for segment anything

Updated 22.3K runs

ControlNet with SD 2.1

Updated 15.2K runs

llama-7b trained on the Memory Alpha Star Trek Wiki

Updated 125 runs

Sketch2Image

Updated 205 runs

观照AI

Updated 5.1K runs

Get image quality scores

Updated 1.4K runs

The Picsart Text2Video-Zero model leverages the power of existing text-to-image synthesis methods (e.g., Stable Diffusion), making them suitable for the video domain

Updated 2.2K runs

gpt-j-6b trained on the Memory Alpha Star Trek Wiki

Updated 431 runs

The Picsart Text2Video-Zero model leverages the power of existing text-to-image synthesis methods (e.g., Stable Diffusion), making them suitable for the video domain.

Updated 13.1K runs

Updated 309 runs

flan-t5-xl trained on the Memory Alpha Star Trek Wiki

Updated 131 runs

Stable Diffusion Meets Karlo: a combination of the Karlo CLIP image embedding prior, and Stable Diffusion v2.1-768.

Updated 986 runs

Sketch2Image

Updated 302 runs

Text-to-Image Diffusion Models are Zero-Shot Video Generators

Updated 41.1K runs

A large language model by EleutherAI

Updated 9.2K runs

A language model for tasks like classification, summarization, and more.

Updated 1.4K runs

SegmentAnything Model (SAM) automatic mask generator

Updated 3.7K runs

Transform your image editing experience with our AI generative model-based image inpainting solution by EpochsAI

Updated 6.4K runs

Updated 24.1K runs

Updated 638 runs

Updated 132 runs

Stable Diffusion model - (openjourney-v4)

Updated 571 runs