Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

staging testing

Updated 283 runs

Document translation with contextual integrity.

Updated 57 runs

Align text to audio with exact word timings. All characters supported!

Updated 113.4K runs

Projection module trained to add vision capabilties to Llama 3 using SigLIP

Updated 5.6K runs

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Updated 23 runs

Apple's monocular depth estimation foundation model (Depth Pro)

Updated 1.6K runs

OmniGen: Unified Image Generation

Updated 12.5K runs

Create audio clips from text

Updated 15 runs

Explorador FLUX.1-Dev LoRA

Updated 88 runs

Updated 61 runs

Fine-tune StableDiffusion3.5-Large with Hugging Face Diffusers

Updated 652 runs

Updated 24 runs

Run any python code

Updated 6.6K runs

Ostris AI-Toolkit for StableDiffusion3.5-Large LoRA Training

Updated 316 runs

2.5 billion parameter image model with improved MMDiT-X architecture

Updated 55.6K runs

Updated 791 runs

Analyzes music to determine song structure, bpm, downbeats, and demuxes audio

Updated 664 runs

Sayak Paul's cartoonizer, deployed to replicate. Here's the model: https://huggingface.co/instruction-tuning-sd/cartoonizer

Updated 185 runs

Updated 14 runs

flux.1-lite-8B-alpha by Freepik

Updated 330 runs

Updated 54 runs

Stable Diffusion 3.5 Large - LoRA Explorer

Updated 2K runs

One shot portrait maker.

Updated 34.6K runs

Remove Background of video and add yours

Updated 376 runs

Updated 595 runs

Updated 80 runs

Powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text prompt

Updated 135 runs

fancyfeast/joytag

Updated 18.6K runs

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching. Voice cloning

Updated 20.9K runs

Updated 161 runs

ChatTTS is a text-to-speech model designed specifically for dialogue scenarios such as LLM assistant.

Updated 134 runs

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps

Updated 606.1K runs

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.

Updated 1.5M runs

A fast image model with state of the art inpainting, prompt comprehension and text rendering.

Updated 2.2M runs

An excellent image model with state of the art inpainting, prompt comprehension and text rendering

Updated 1.5M runs

Lip Read silent videos with AI

Updated 2.3K runs

Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Updated 58 runs

Depth Any Video with Scalable Synthetic Data

Updated 168 runs

Check our flamel.app

Updated 1.9K runs

Tells the weather, given the name of a city

Updated 588 runs

Efficient Visual Generation with Hybrid Autoregressive Transformer

Updated 152 runs

CogvideoX Keyframe Interpolation by Zhengcong Fei

Updated 233 runs

Ollama Nemotron 70b

Updated 8.8K runs

OpenFLUX.1 (Beta v0.1.0), is a fine tune of the FLUX.1-Schnell model with the distillation trained out of it

Updated 780 runs

8-step distilled lora for FLUX.1-dev model released by the Alimama-Creative Team

Updated 2K runs

Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

Updated 181.4K runs

Granite-3.0-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

Updated 420.3K runs

FLUX.1 Dev with STOIQO NewReality

Updated 1.9K runs

Finer and Faster Text-to-Image Generation via Relay Diffusion

Updated 47 runs