Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Updated 190 runs

Updated 213 runs

Updated 191 runs

https://civitai.com/models/833294

Updated 27.6K runs

Rembg implementation with mask output

Updated 45 runs

Updated 195 runs

Janus-Pro is a novel autoregressive framework for multimodal understanding

Updated 10.7K runs

Generate music with YuE-s1-7B (English, chain of thought model)

Updated 851 runs

Test deployment of OuteTTS 500M

Updated 500 runs

Interior Design with RealVisXL V5.0-Lightning and ControlNet to generate photorealistic, high-resolution interior designs.

Updated 625 runs

Ultimate anime-themed finetuned SDXL model and the latest installment of the Animagine XL series

Updated 646 runs

Interior Design with RealVisXL V5.0 and ControlNet (Depth & Union SDXL ProMax) to generate photorealistic, high-resolution interior designs with enhanced depth and structure.

Updated 541 runs

STAR Video Upscaler: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Updated 414 runs

Updated 527 runs

Takes audio (mp3) and a "source-of-truth" audio transcript (string) as input and returns precise timestamps.

Updated 795 runs

Updated 604 runs

A demo model for a guide I'm working on...

Updated 7 runs

DeepSeek-R1 distilled on LLaMA3.3 70B and quantized by ollama

Updated 18 runs

Updated 892 runs

Updated 231 runs

DeepSeek-R1 distilled on LLaMA 8B

Updated 523 runs

Updated 168 runs

Updated 142 runs

Updated 221 runs

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 12.3M runs

The VALL-E models by Amphion.

Updated 574 runs

The Vevo model by Amphion.

Updated 271 runs

Customise your hair with AI. Swap hair with anyone, copy anyone's hair color.

Updated 414 runs

Upscale videos + images with BSRGAN

Updated 3.1K runs

The MaskGCT model by Amphion.

Updated 234 runs

deepseek-ai/deepseek-r1

A reasoning model trained with reinforcement learning, on par with OpenAI o1

Updated 1.1M runs

The Fish Speech V1.2 SFT model.

Updated 240 runs

The Fish Speech V1.5 model.

Updated 199 runs

The Fish Speech V1.4 model.

Updated 212 runs

Adapted to have multi-lora support also for schnell: https://replicate.com/lucataco/flux-dev-multi-lora

Updated 2.5K runs

The Fish Speech V1.2 model.

Updated 242 runs

Cavalry 1 is a hello world model.

Updated 7 runs

Run any ComfyUI workflow on an A100. Guide: https://github.com/fofr/cog-comfyui

Updated 14.9K runs

The Fish Speech V1.0 model.

Updated 182 runs

Create a dotted waveform video from an audio file

Updated 38 runs

Updated 177 runs

The small version of the Bark model by Suno.

Updated 226 runs

The Bark model by Suno.

Updated 229 runs

Updated 64 runs

Remove image background with custom model to better result.

Updated 2K runs

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions

Updated 89.7K runs

SoTA Zero Shot Voice Cloning and TTS model

Updated 886 runs

Hunyuan-Video LoRA Explorer + Trainer

Updated 38.1K runs

The NaturalSpeech2 model by Amphion.

Updated 216 runs