Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Practical Image Restoration Algorithms for General/Anime Images

Updated 7.1M runs

Use stable diffusion and aesthetic CLIP embeddings to guide boring outputs to be more aesthetically pleasing.

Updated 4.3K runs

Activating More Pixels in Image Super-Resolution Transformer

Updated 25.1K runs

End-to-End Document Image Enhancement Transformer

Updated 4.3K runs

(development branch) Inpainting for Stable Diffusion

Updated 13.4K runs

Entity linking to wikipedia pages using facebook's GENRE

Updated 43 runs

[CASUAL] A Neural Language Style Transfer framework to transfer styles on natural language text

Updated 28 runs

[PASSIVE VOICE] A Neural Language Style Transfer framework to transfer styles on natural language text

Updated 73 runs

A Neural Language Style Transfer framework to transfer styles on natural language text

Updated 113 runs

Get an approximate text prompt, with style, matching an image. (Optimized for stable-diffusion (clip ViT-L/14))

Updated 2.6M runs

Sentence embedding using mpnet

Updated 45.8K runs

Embedding generation using microsoft's xtremedistil-l6-h384-uncased model

Updated 18 runs

Updated 2.6K runs

Updated 54 runs

Text summarization using brio-xsum-cased model

Updated 501 runs

Pose-Invariant Hairstyle Transfer

Updated 9.8K runs

Colab version of AI Dungeon

Updated 366 runs

Generate music based on the Monkey Island theme using the FIGARO model

Updated 305 runs

Generate 768px images from text using CompVis `retrieval-augmented-diffusion`

Updated 38.4K runs

Monkey Island database for Retrieval-augmented Diffusion model

Updated 622 runs

Inpainting using Denoising Diffusion Probabilistic Models

Updated 4K runs

Unsupervised Night Image Enhancement

Updated 42.1K runs

text-to-image with latent diffusion

Updated 4.1K runs

Panoptic Scene Graph Generation

Updated 1.5K runs

DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation

Updated 5.3K runs

text-to-image generation

Updated 1.8K runs

VQ-Diffusion for Text-to-Image Synthesis

Updated 20.7K runs

Create a 3D photo from single in-the-wild 2D images

Updated 5.7K runs

Updated 7.1K runs

Visualize the encoded latents of an image

Updated 72.7K runs

Generate a painting using text.

Updated 133.6K runs

Generate a logo using text.

Updated 349.1K runs

CompVis `latent-diffusion text2im` finetuned for inpainting.

Updated 8K runs

Composable Diffusion

Updated 846 runs

Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN

Updated 8.2K runs

Language-Free Training of a Text-to-Image Generator with CLIP

Updated 960 runs

Colorization using a Generative Color Prior for Natural Images

Updated 579.1K runs

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Updated 140.2K runs

Image Manipulatinon with Diffusion Autoencoders

Updated 17.1K runs

Fast, minimal port of DALL·E Mini to PyTorch

Updated 505.6K runs

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

Updated 169.8K runs

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Updated 231.9K runs

Generate a collection of logos based on your text input. Use longer and more detailed inputs for better results. The first time it takes a few minutes to load the model. Subsequent generations are much faster.

Updated 4.4K runs

Contrastive Coherence Preserving Loss for Versatile Style Transfer

Updated 1.9K runs

CLIP Guided latent k-diffusion

Updated 7.4K runs

Generate images using a variety of techniques - Powered by Discoart

Updated 64.6K runs

Generate images from text using CLIP guided latent diffusion

Updated 8.3K runs