Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Diffusion Models as Text Painters

Updated 1.7K runs

Prompt-free Diffusion

Updated 736 runs

Generate a new image from an input image with AbsoluteReality v1.0

Updated 248.2K runs

Generate a new image given any input text with AbsoluteReality v1.0

Updated 263.2K runs

Generate a new image from an input image with DreamShaper V6

Updated 130.3K runs

Generate a new image given any input text with DreamShaper V6

Updated 421.8K runs

Generate a new image from an input image with Babes 2.0

Updated 1.4M runs

Generate a new image from an input image with RPG V4

Updated 2.2K runs

Generate a new image from an input image with URPM v1.3

Updated 2.2K runs

Generate a new image from an input image with Deliberate v2

Updated 9.1K runs

Generate a new image from an input image with Edge Of Realism - EOR v2.0

Updated 527.9K runs

Generate a new image from an input image with Realistic Vision V2.0

Updated 54.4K runs

Generate a new image given any input text with Babes 2.0

Updated 25.6K runs

Generate a new image given any input text with RPG V4

Updated 58.2K runs

Generate a new image given any input text with URPM v1.3

Updated 53.7K runs

Generate a new image given any input text with Deliberate v2

Updated 609.8K runs

Generate a new image given any input text with Edge Of Realism - EOR v2.0

Updated 129K runs

Generate a new image given any input text with Realistic Vision V2.0

Updated 522.6K runs

This is a language model that can be used to obtain document embeddings suitable for downstream tasks like semantic search and clustering.

Updated 2.1M runs

Real-World Super-Resolution Models for Animation Videos

Updated 10K runs

This model can detect clothing using a custom state of the art clothing segmentation algorithm.

Updated 3.2K runs

Training-free Controllable Text-to-Video Generation

Updated 2.1K runs

This model is actually: prompthero / openjourney-v4

Updated 279 runs

Regression of musical arousal and valence values

Updated 5.3K runs

Classification of music approachability and engagement

Updated 6.9K runs

An EfficientNet for music style classification by 400 styles from the Discogs taxonomy

Updated 115.1K runs

My own personal try of Stable Diffusion

Updated 41 runs

Updated 31.3K runs

Updated 38 runs

A multi-input ControlNet model. Pass in control images and set the weights.

Updated 248 runs

Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.

Updated 5K runs

Generating Conditional 3D Implicit Functions

Updated 14.5K runs

Updated 27.5K runs

Image captioning via vision-language models with instruction tuning

Updated 539.1K runs

Generate Pokémon from a text description

Updated 7.9M runs

A model for text, audio, and image embeddings in one space

Updated 3.4M runs

music label

Updated 176 runs

image tagger

Updated 37.2M runs

ControlNet annotators - the initial image that is fed into a stable diffusion pipeline with ControlNet

Updated 346 runs

Detects tents in satellite images

Updated 31 runs

album cover generator

Updated 898 runs

Image Inpainting

Updated 3.9K runs

Updated 250 runs

T5 model fine tuned on GPT-3.5 generated paraphrase corpus of 6.3 million unique sentences.

Updated 4.3K runs

this is a proof of concept for dreambooth that uses tar weights

Updated 30 runs

Consistent view characters with ControlNet and Stable Diffusion fine-tuned on Ready Player Me characters based on OpenJourneyV4

Updated 937 runs

Updated 477 runs

Updated 27 runs

3B parameter base version of Stability AI's language model

Updated 416 runs

Object Detector Using Yolo

Updated 598 runs