Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Animating prompts with stable diffusion

Updated 253.8K runs

Activating More Pixels in Image Super-Resolution Transformer

Updated 25K runs

End-to-End Document Image Enhancement Transformer

Updated 3.7K runs

(development branch) Inpainting for Stable Diffusion

Updated 13.4K runs

Entity linking to wikipedia pages using facebook's GENRE

Updated 41 runs

[CASUAL] A Neural Language Style Transfer framework to transfer styles on natural language text

Updated 27 runs

[PASSIVE VOICE] A Neural Language Style Transfer framework to transfer styles on natural language text

Updated 72 runs

A Neural Language Style Transfer framework to transfer styles on natural language text

Updated 110 runs

Get an approximate text prompt, with style, matching an image. (Optimized for stable-diffusion (clip ViT-L/14))

Updated 2.6M runs

Sentence embedding using mpnet

Updated 45.8K runs

Embedding generation using microsoft's xtremedistil-l6-h384-uncased model

Updated 16 runs

Updated 2.6K runs

Updated 53 runs

Text summarization using brio-xsum-cased model

Updated 488 runs

Pose-Invariant Hairstyle Transfer

Updated 9.1K runs

Colab version of AI Dungeon

Updated 366 runs

Generate music based on the Monkey Island theme using the FIGARO model

Updated 305 runs

Generate 768px images from text using CompVis `retrieval-augmented-diffusion`

Updated 38.4K runs

Monkey Island database for Retrieval-augmented Diffusion model

Updated 622 runs

Inpainting using Denoising Diffusion Probabilistic Models

Updated 3.9K runs

Unsupervised Night Image Enhancement

Updated 41.2K runs

text-to-image with latent diffusion

Updated 4.1K runs

Panoptic Scene Graph Generation

Updated 1.3K runs

DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation

Updated 5.2K runs

text-to-image generation

Updated 1.7K runs

VQ-Diffusion for Text-to-Image Synthesis

Updated 20.7K runs

Create a 3D photo from single in-the-wild 2D images

Updated 5.7K runs

Updated 5.7K runs

Visualize the encoded latents of an image

Updated 72.7K runs

Generate a painting using text.

Updated 133.6K runs

Generate a logo using text.

Updated 348K runs

CompVis `latent-diffusion text2im` finetuned for inpainting.

Updated 8K runs

Composable Diffusion

Updated 845 runs

Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN

Updated 8.1K runs

Language-Free Training of a Text-to-Image Generator with CLIP

Updated 955 runs

Colorization using a Generative Color Prior for Natural Images

Updated 483.1K runs

Global Tracking Transformers

Updated 143 runs

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Updated 139.4K runs

Image Manipulatinon with Diffusion Autoencoders

Updated 16.8K runs

Fast, minimal port of DALL·E Mini to PyTorch

Updated 503.2K runs

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

Updated 164.4K runs

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Updated 175.2K runs

Generate a collection of logos based on your text input. Use longer and more detailed inputs for better results. The first time it takes a few minutes to load the model. Subsequent generations are much faster.

Updated 4.4K runs

Contrastive Coherence Preserving Loss for Versatile Style Transfer

Updated 1.9K runs

CLIP Guided latent k-diffusion

Updated 7.4K runs

Generate images using a variety of techniques - Powered by Discoart

Updated 64.3K runs

Generate images from text using CLIP guided latent diffusion

Updated 8.3K runs

Text-to-video generation

Updated 32.7K runs

Cartoonifies a video

Updated 13.7K runs

Cartoonifies an image

Updated 4.2K runs