Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Outpaint an image using controlnet union for SDXL.

Updated 7.5K runs

Bilateral Reference for High-Resolution Dichotomous Image Segmentation (CAAI AIR 2024)

Updated 2.6M runs

A fully open-sourced, large flow-based text-to-image generation model

Updated 416 runs

LLM-powered applications are susceptible to prompt attacks, which are prompts intentionally designed to subvert the developer’s intended behavior of the LLM

Updated 28 runs

Updated 31 runs

Personalized Image Filters at Your Fingertips

Updated 100 runs

Real-ESRGAN with optional face correction and adjustable upscale

Updated 68.1M runs

Real-ESRGAN for image upscaling on an A100

Updated 14.1M runs

Synthetic image detection and model identification

Updated 1.4K runs

Fork of https://replicate.com/zsxkib/ic-light that allows any image resolution

Updated 16.5K runs

Upscale pictures and videos using the nunif repo (formerly waifu2x).

Updated 808 runs

PrometheusV1 is presumed to be the first full rank finetune of Playground v2.5

Updated 169 runs

Updated 78 runs

ProteusV0.5 is the latest full release built as a sophisticated enhancement over OpenDalleV1.1

Updated 3.3K runs

Meta's flagship 405 billion parameter language model, fine-tuned for chat completions

Updated 5.8M runs

dolly-v2-12b, just for testing

Updated 17 runs

Ultra high resolution images (up to 4096x4096) based on Stable Cascade

Updated 8.4K runs

Seamlessly create stunning product shots by blending with inspirational references for a fresh, modern look

Updated 456 runs

Detect hate speech or toxic comments in tweets/texts

Updated 112.8K runs

Kolors with style transfer, composition transfer and other IPAdapter techniques

Updated 17.7K runs

Largest completely open sourced flow-based generation model that is capable of text-to-image generation

Updated 7.8K runs

A large-scale text-to-image generation model based on latent diffusion, developed by the Kuaishou Kolors team

Updated 32.8K runs

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 27.4M runs

Generate seamless 360 photos using SDXL

Updated 416 runs

Face Restoration

Updated 4.8K runs

A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency

Updated 1.6M runs

MimicMotion: High-quality human motion video generation with pose-guided control

Updated 2.5K runs

remove background for retailer product images

Updated 67 runs

Make realistic images of real people instantly (w/ ip-adapter-plus-face_sdxl_vit-h)

Updated 4.2K runs

Updated 84 runs

PixArt Sigma 900M is a text-to-image generation model based on the PixArt Sigma architecture

Updated 2.2K runs

Updated 46.5K runs

araby.ai oneshot video faceswap

Updated 23.4K runs

MARS5, a fully open-source (commercially usable) voice-cloning/TTS with break-through prosody and realism.

Updated 672 runs

for backsound

Updated 119 runs

audio to srt

Updated 30 runs

Cog wrapper for Ollama llama3:70b

Updated 6.6K runs

Cog wrapper for Ollama llama3:8b

Updated 20 runs

Input a video. Ask anything about it

Updated 3.5K runs

YOLOv10: Real-Time End-to-End Object Detection

Updated 313 runs

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Updated 500 runs

Take audio from one video and add it to a second video. Good for adding back audio to liveportrait.

Updated 207 runs

Change the fps of a video without changing its length or speed

Updated 111 runs

Portrait animation using a driving video source

Updated 82.2K runs

Efficient Portrait Animation with Stitching and Retargeting Control

Updated 1.2K runs

Kolors is a SOTA base image model for high quality image generation

Updated 1.2K runs

Updated 15 runs