Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source

Updated 1.8M runs

BROKEN - DO NOT USE!

Updated 189 runs

Best-in-class clothing virtual try on in the wild (non-commercial use only)

Updated 845.7K runs

0.5B

Updated 209 runs

Updated 241 runs

8B TTS

Updated 77 runs

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

Updated 4M runs

Recraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.

Updated 157.6K runs

Affordable and fast vector images

Updated 33.4K runs

Affordable and fast images

Updated 199.4K runs

Updated 400 runs

The Fish Speech V1.1 model.

Updated 190 runs

Generates MagicaVoxel VOX models, using flux dev + hunyuan3d-2. Can generate high detail and low detail models at varying resolutions.

Updated 112 runs

RF-DETR: SOTA Real-Time Object Detection Model

Updated 51 runs

epicrealism-naturalsinfinal-SD1.5-by-epinikion + perfectdeliberate by Desync + More Details by Lykon

Updated 61 runs

Updated 862 runs

Updated 109 runs

Extract the first or last frame from any video file as a high-quality image

Updated 789 runs

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs

Updated 490 runs

flux-1.dev

Updated 20 runs

Updated 501 runs

Fish Speech V1.5-SOTA Open Source TTS

Updated 412 runs

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Updated 2.8K runs

Updated 141.9K runs

Orpheus 3B - high quality, emotive Text to Speech

Updated 17.2K runs

Updated 137.5K runs

CosyVoice2-0.5B-Scalable Streaming Speech Synthesis with Large Language Models

Updated 1K runs

A model Flux.1-dev-Controlnet-Upscaler by www.androcoders.in

Updated 482 runs

Realistic Inpainting with ControlNET (M-LSD + SEG)

Updated 506.8K runs

Hunyuan3D-2mv is finetuned from Hunyuan3D-2 to support multiview controlled shape generation.

Updated 1.5K runs

LatentSync: generate high-quality lip sync animations

Updated 45K runs

detect correct orientation of images

Updated 18 runs

Hyper FLUX 8-step by ByteDance

Updated 14.2M runs

ShieldGemma 2 is a model trained on Gemma 3's 4B IT checkpoint for image safety classification across key categories that takes in images and outputs safety labels per policy.

Updated 202 runs

Fast, efficient image variation model for rapid iteration and experimentation.

Updated 44.5K runs

Open-weight image variation model. Create new versions while preserving key elements of your original.

Updated 236.6K runs

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 1B runs

Run Wan2.1 14b or 1.3b with a lora

Updated 26.3K runs

Wan2.1 14B 480p LoRA inference via Diffusers (Work in progress)

Updated 526 runs

Open-weight depth-aware image generation. Edit images while preserving spatial relationships.

Updated 448K runs

Open-weight edge-guided image generation. Control structure and composition using Canny edge detection.

Updated 112.3K runs

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Updated 10.3K runs

a test run for hello world

Updated 6 runs