Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

Masked-attention Mask Transformer for Universal Image Segmentation

Updated 658 runs

Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement

Updated 358 runs

Updated 94 runs

Updated 147 runs

Generate Pokemons with Projected GAN

Updated 9.7K runs

Text-Guided Image Generation and Manipulation

Updated 824 runs

A Single Model for Many Visual Modalities

Updated 247 runs

Supervised Weakly from hashtAGs

Updated 294 runs

Classify numerical digits.

Updated 115 runs

democratizing automatic music transcription

Updated 3K runs

Uses pixray to generate an image from text prompt

Updated 148.4K runs

Training-Free Text-to-Image Generation

Updated 2.4K runs

Updated 1.7K runs

Updated 750 runs

Image generation from Wav2CLIP through VQGAN-CLIP

Updated 896 runs

Guide Style Transfer with CLIP loss

Updated 1.3K runs

Baseline models demo of the IEEE L3DAS22 Challenge

Updated 228 runs

Simple example of a Cog model that produces Markdown output

Updated 23 runs

Huan's first cog with Replicate.AI

Updated 35 runs

Deep Halftoning with Reversible Binary Pattern

Updated 364 runs

Guide a StyleGAN3 trained on pictures of mannequins with CLIP.

Updated 850 runs

Guesses your age based on a photo

Updated 421 runs

Age prediction using CLIP

Updated 528 runs

Selfie to anime

Updated 3.1K runs

Generates pokemon sprites from prompt

Updated 4.9K runs

Disentangled face manipulation using CLIP-based annotations

Updated 1.8K runs

Scaling-up Disentanglement for Image Translation

Updated 984 runs

Image generation with CLIP + VQGAN / PixelDraw

Updated 6.7K runs

Instance-Conditioned GAN

Updated 26.7K runs

Generates images with VQGAN and CLIP

Updated 6.6K runs

Extracts "bass", "drums", "other" and "vocals" tracks from mixed audio track

Updated 170 runs

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

Updated 244 runs

Photo to cartoon translation

Updated 4K runs

A Glow-based Waveform Generative Model for Audio Super-Resolution. Intelligently upsamples audio by 2x resolution

Updated 1.5K runs

PyTorch implementation of state-of-the-art music tagging models 🎶

Updated 1.9K runs

Text-to-image synthesis using contrastive learning

Updated 1.2K runs

A Fast and Stable GAN for Small and High Resolution Imagesets

Updated 3.3K runs