Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

The Qwen2.5-Coder-32B-Instruct is a state-of-the-art, open-source large language model (LLM). It is specifically designed for coding tasks and is part of the Qwen2.5-Coder series, featuring 32 billion parameters.

Updated 83 runs

Accelerated variant of Photon prioritizing speed while maintaining quality

Updated 94K runs

High-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs

Updated 936.6K runs

The Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out).

Updated 30 runs

Hey, this is a fork of flux pulid to support multiple ids, use with a depth map and define bounding boxes for each face

Updated 2.5K runs

Generate 4s and 6s videos from a prompt or image

Updated 10.8K runs

Updated 385 runs

flux.1-dev: hyper-sd 8 steps + instanx ip adataper + pulid + depth controlnet

Updated 228 runs

A version of mochi-1 (a text to video model) that supports fine-tuned lora inference

Updated 100 runs

Let Vision Language Models Reason Step-by-Step

Updated 40 runs

Mochi 1 preview is an open video generation model with high-fidelity motion and strong prompt adherence in preliminary evaluation

Updated 2.8K runs

SVDQuant Optimized Flux.Schnell

Updated 30 runs

SmolVLM-Instruct by HuggingFaceTB

Updated 1.1K runs

AnimateDiff-Lightning: Cross-Model Diffusion Distillation

Updated 46 runs

Updated 526 runs

Jina-CLIP v2: 0.9B multimodal embedding model with 89-language multilingual support, 512x512 image resolution, and Matryoshka representations

Updated 216.1K runs

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Updated 151 runs

Segment Anything with prompts

Updated 1.4M runs

Convert speech in audio to text

Updated 94.5M runs

Anima Pencil XL v5 Model (Text2Img, Img2Img and Inpainting)

Updated 18.8K runs

music generation with fine tuned stable audio

Updated 8.4K runs

cerate music with open source

Updated 69K runs

DiT-based video generation model for generating high-quality videos in real-time

Updated 3.2K runs

Pencil XL v2 Model (Text2Img, Img2Img and Inpainting)

Updated 4.6K runs

Unlimited XL Model (Text2Img, Img2Img and Inpainting)

Updated 30.4K runs

Updated 1.4K runs

A model using microsoft/Florence-2-large to create mask of watermarked images

Updated 60 runs

xl

Updated 318 runs

Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation

Updated 53K runs

Playground v2.0: A diffusion-based text-to-image generation model trained from scratch by the research team at Playground

Updated 60 runs

Kolors: Effective Training of Diffusion Model for Photorealistic Text-to-Image Synthesis

Updated 78 runs

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Updated 19 runs

Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

Updated 71 runs

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 121 runs

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 30 runs

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 19 runs

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Updated 15 runs

Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping

Updated 21 runs

SDXL-Lightning: Progressive Adversarial Diffusion Distillation

Updated 38 runs

Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Updated 35 runs

Improved Distribution Matching Distillation for Fast Image Synthesis

Updated 81 runs

Adversarial Diffusion Distillation

Updated 126 runs

Updated 4 runs

Phased Consistency Model

Updated 30 runs

Latent Consistency Models: Synthesizing High-Resolution Images with Few-step Inference

Updated 125 runs

Template Project for Running Fast Video Effects all on the GPU with fast GPU encoding and Decoding

Updated 18 runs

2DN XL Model (Text2Img, Img2Img and Inpainting)

Updated 230 runs

Mistoon Anime XL Model (Text2Img, Img2Img and Inpainting)

Updated 22.4K runs

Realism XL Model (Text2Img, Img2Img and Inpainting)

Updated 278.8K runs