Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

DOVER video quality assessment tool, assigning videos both aesthetic and technical quality scores

Updated 27 runs

Generate Product photography backgrounds using Stable Diffusion

Updated 536 runs

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation. Hologram optimized

Updated 347 runs

Transfer learning models for music classification by genres, moods, and instrumentation

Updated 10.5K runs

🫦 Realistic facial expression manipulation (lip-syncing) using audio or video

Updated 1.1K runs

Super fast clothing (and face) segmentation and masking with erosion and dilation capability, made for https://outfit.fm

Updated 17.2K runs

The best Pony-SDXL models! Current one is based on Pony Realism.

Updated 104.8K runs

# Interior Decoration Space Scaling - First Use Case

Updated 66 runs

A tiny model for testing out Cog

Updated 1.1K runs

Updated 8.3K runs

Updated 1.6K runs

Create images of a given character in different poses

Updated 1M runs

Updated 177 runs

Real-Time High Quality Lip Synchronization with Latent Space Inpainting

Updated 2.6K runs

Turns 10 mp4 into 1

Updated 72 runs

An improved outpainting model that supports LoRA urls. This model uses PatchMatch to improve the mask quality.

Updated 89.1K runs

Llama-3-8B finetuned with ReFT to hyperfocus on New Jersey, the Garden State, the best state, the only state!

Updated 107 runs

🎙️Hololive text-to-speech and voice-to-voice (Japanese🇯🇵 + English🇬🇧)

Updated 866 runs

Upscaler and detailer for a selected area

Updated 4.8K runs

Convert LLM's coding to image generation

Updated 1.9K runs

epiCRealism v7-Final Destination. Top Realism Model on Civitai

Updated 1.8K runs

blue_pencil-XL meets ANIMAGINE XL 3.0 / ANIMAGINE XL 3.1, The top ranked model on Civitai

Updated 4K runs

Updated 119 runs

This is an implementation of the ChatTTS as a Cog model.

Updated 3.1K runs

Stylized Audio-Driven Single Image Talking Face Animation

Updated 134.8K runs

Recreate images with Emojis

Updated 203 runs

Fast and High-Quality Text-to-video Generation

Updated 4.6K runs

A PhotoBooth style transfer workflow that utilizes IPadapter Style, Canny, OpenPose, RemoveBackground, HumanSegmentation, Cloth Segmentation for initial input, and concludes with the application of DeepFake techniques.

Updated 191 runs

AI Photorealistic Image Ultra-Resolution, Restoration and Upscale!

Updated 85.1K runs

SDXL LoRA finetuned on spectrograms of Beethoven songs

Updated 19 runs

Transfer empty room into fabulous interior design

Updated 29.1K runs

Guided Text to Speech Generator

Updated 401 runs

viⓍTTS vixTTS là mô hình tạo sinh giọng nói cho phép bạn sao chép giọng nói sang các ngôn ngữ khác nhau chỉ bằng cách sử dụng một đoạn âm thanh nhanh dài 6 giây

Updated 492 runs

Given image of an face, the it generates full images with given prompt

Updated 418 runs

Jina Turbo Reranker that is small but performant

Updated 19 runs

SDXL based text-to-image model applying Distribution Matching Distillation, supporting zero-shot identity generation in 2-5s. https://ai-visionboard.com

Updated 5.8M runs

Fast sdxl with higher quality

Updated 803.2K runs

for test

Updated 256 runs

A text-to-image generative AI model that creates beautiful images

Updated 80M runs

A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Updated 335 runs

Use a face to make images. Uses SDXL fine-tuned checkpoints.

Updated 213K runs

This is phi-3-vision model , cost by time ,have fun~

Updated 14.5K runs

Convert story to StableDiffusion prompts format

Updated 40 runs

openai whisper model on A100 hardware

Updated 50 runs

Image tagger fine-tuned on WaifuDiffusion w/ (SwinV2, SwinV2, ConvNext, and ViT)

Updated 1K runs

✍️✨Prompts to auto-magically relights your images

Updated 455.9K runs

Replicate version from the work of Shanglin Li et al. called "ZONE: Zero-Shot Instruction-Guided Local Editing"

Updated 152 runs

🖼️✨Background images + prompts to auto-magically relights your images (+normal maps🗺️)

Updated 12.1K runs