Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

SDXL Inpainting by the HF Diffusers team

Updated 2.5M runs

Zero-Shot Text-Based Audio Editing Using DDPM Inversion

Updated 1.8K runs

Virtual dressing room

Updated 22.5K runs

Generate a model with a garment faster if you have a mask image

Updated 610 runs

Only makes segmentations for further processing

Updated 200 runs

Find out how similar Japanese sentences are

Updated 13 runs

Fast text-to-3D Gaussian generation by bridging 2D and 3D diffusion models

Updated 253 runs

Yuan2.0 is a new generation LLM developed by IEIT System, enhanced the model's understanding of semantics, mathematics, reasoning, code, knowledge, and other aspects.

Updated 31 runs

Trajectory Consistency Distillation

Updated 576 runs

Removes silence from your audio

Updated 77 runs

A diffusion-based method to enhance visual consistency for I2V generation

Updated 3.2K runs

Rethinking Inductive Biases for Surface Normal Estimation

Updated 75 runs

Updated 1.1K runs

AI Music Structure Analyzer + Stem Splitter using Demucs & Mdx-Net with Python-Audio-Separator

Updated 36.2K runs

Experimental & for non-commercial use only

Updated 6.6K runs

High-quality multilingual text-to-speech library

Updated 1.4K runs

DUSt3R: Geometric 3D Vision Made Easy

Updated 433 runs

Sentiment Analysis with Texts

Updated 4.9K runs

A wrapper around bel-tts

Updated 1.4K runs

Turn a face into a sticker

Updated 1.5M runs

Updated 254 runs

Surya is a document OCR toolkit that does:

Updated 6K runs

Generates 3D assets from images

Updated 2.9K runs

SDXL lightning mult-controlnet, img2img & inpainting

Updated 9.4K runs

dreamshaper-xl-lightning is a Stable Diffusion model that has been fine-tuned on SDXL

Updated 121.4K runs

ProteusV0.4: The Style Update

Updated 111.5K runs

Updated 198 runs

Lightweight multimodal model for visual question answering, reasoning and captioning

Updated 7.8K runs

Updated 218.4K runs

Simple video chroma keying

Updated 51 runs

Multilingual E5-small language embedding model

Updated 52 runs

Multilingual E5-large language embedding model

Updated 66 runs

Multilingual E5-large language embedding model

Updated 539 runs

Tea Segmentation Demo

Updated 29 runs

Function calling LLM that surpasses the state-of-the-art in function calling capabilities

Updated 65 runs

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Updated 117 runs

Updated 82 runs

AnimateDiff video to video

Updated 654 runs

Segments an audio recording based on who is speaking

Updated 3K runs

Updated 5.1K runs

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0F model and does NOT use LLaVA-13b.

Updated 16.5K runs

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0Q model and does NOT use LLaVA-13b.

Updated 110.9K runs

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This version uses LLaVA-13b for captioning.

Updated 187.8K runs

POC implementation of Depth-anything to produce a 3D SBS video

Updated 199 runs

E5-mistral-7b-instruct language embedding model

Updated 645 runs

Merge two images together with a prompt

Updated 6.3K runs

Honeycomb NLQ Generator

Updated 181 runs

ProteusV0.4: The Style Update - enhances stylistic capabilities, similar to Midjourney's approach, rather than advancing prompt comprehension

Updated 131.6K runs

hello-world from cog example

Updated 34 runs

A collection of anime stable diffusion models with VAEs and LORAs.

Updated 3.7K runs