Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

Realism XL Model (Text2Img, Img2Img and Inpainting)

Updated 279.6K runs

Detects if a picture has anime face.

Updated 28K runs

Babes XL Model (Text2Img, Img2Img and Inpainting)

Updated 7.9K runs

The current model is used for graphics replacement processing

Updated 657.9K runs

Upload an image or video, and Video-LLaVa will give you a text description of what it "sees."

Updated 100 runs

without examination qwen2.5 32b

Updated 796 runs

FLUX.1 [dev] (LoRA) with several optimizations such as FP8 Quantization

Updated 76 runs

Clean Text from Manhwa/Manhua

Updated 9 runs

# Interior Decoration Space Scaling - Second Use Case

Updated 74 runs

a model to get images

Updated 277 runs

Updated 416 runs

This model is used to generate speech

Updated 35 runs

A F5-TTS fine-tuned for Spanish

Updated 547 runs

Updated 11 runs

Updated 24 runs

Dreamlike Diffusion Model for Splurge Art

Updated 2.5K runs

From Sketch to Reality: Transforming Outlines into Lifelike Images

Updated 50.2K runs

baby transformer for blog post

Updated 30 runs

staging testing

Updated 283 runs

Document translation with contextual integrity.

Updated 57 runs

Align text to audio with exact word timings. All characters supported!

Updated 113.4K runs

Projection module trained to add vision capabilties to Llama 3 using SigLIP

Updated 5.6K runs

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Updated 23 runs

Apple's monocular depth estimation foundation model (Depth Pro)

Updated 1.6K runs

OmniGen: Unified Image Generation

Updated 12.5K runs

Create audio clips from text

Updated 9 runs

Explorador FLUX.1-Dev LoRA

Updated 88 runs

Updated 61 runs

Fine-tune StableDiffusion3.5-Large with Hugging Face Diffusers

Updated 643 runs

Updated 24 runs

Run any python code

Updated 6.6K runs

Ostris AI-Toolkit for StableDiffusion3.5-Large LoRA Training

Updated 302 runs

2.5 billion parameter image model with improved MMDiT-X architecture

Updated 50.2K runs

Updated 767 runs

Analyzes music to determine song structure, bpm, downbeats, and demuxes audio

Updated 659 runs

Sayak Paul's cartoonizer, deployed to replicate. Here's the model: https://huggingface.co/instruction-tuning-sd/cartoonizer

Updated 184 runs

Updated 13 runs

flux.1-lite-8B-alpha by Freepik

Updated 330 runs

Updated 53 runs

Stable Diffusion 3.5 Large - LoRA Explorer

Updated 2K runs

One shot portrait maker.

Updated 33.9K runs

Remove Background of video and add yours

Updated 371 runs

Updated 580 runs

Updated 80 runs

Powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text prompt

Updated 135 runs

fancyfeast/joytag

Updated 18.6K runs

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching. Voice cloning

Updated 20.8K runs

Updated 137 runs

ChatTTS is a text-to-speech model designed specifically for dialogue scenarios such as LLM assistant.

Updated 133 runs