Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

This model classifies weather conditions based on images. It uses a Convolutional Neural Network (CNN) trained on various weather phenomena to predict the weather condition of a given image.

Updated 8 runs

Towards OCR-2.0 via a Unified End-to-end Model

Updated 27 runs

DeepSeek's first generation reasoning models with comparable performance to OpenAI-o1

Updated 1.4K runs

Inpaint anything with automatic mask generation

Updated 259 runs

minimax/video-01

Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.

Updated 436.3K runs

FLUX Dev Model (Text2Img and Img2Img)

Updated 5K runs

Robust face restoration algorithm for old photos / AI-generated faces

Updated 41.8M runs

FLUX Schnell Model (Text2Img and Img2Img)

Updated 13.7K runs

The power of flux with the model trained on VITON-HD used for try-on on categories such as upper body, lower body and full body dresses

Updated 3.6K runs

Pemisah suara temannya musisi

Updated 1.6K runs

https://civitai.com/models/317902

Updated 179K runs

Change the strength of the prompt to enable editing style and content. Recommendation: keep the seed constant and tune the strength.

Updated 318 runs

This models allow changing the strength of the Redux image prompt, which allows the text prompt to have a stronger effect. It is particularly useful at taking content from the provided image and applying style or editing changes from the prompt.

Updated 1.5K runs

Microsoft's tool to convert Office documents, PDFs, images, audio, and more to LLM-ready markdown.

Updated 27.6K runs

REAL-ESRGAN superresolution to upsample low resolution satellite imagery.

Updated 54 runs

Updated 33 runs

Updated 18 runs

recraft-ai/recraft-creative-upscale

Creative Upscale focuses on enhancing details and refining complex elements in the image. It doesn’t just increase resolution but adds depth by improving textures, fine details, and facial features.

Updated 3K runs

recraft-ai/recraft-crisp-upscale

Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.

Updated 42.2K runs

Updated 42 runs

SVFR: A Unified Framework for Generalized Video Face Restoration

Updated 451 runs

playht/play-dialog

End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware prosody, intonation, and emotional expression.

Updated 17.7K runs

Image generation, Added: inpaint_strength loras_custom_urls

Updated 296.5K runs

Simple tool to merge together separate video snippets

Updated 252 runs

allenai/OLMo-2-1124-13B-Instruct, text generation model

Updated 100 runs

refinement module to improve satellite derived shorelines

Updated 4 runs

2025 fork of closed Coqui XTTS-v2: Multilingual Text To Speech Voice Clone

Updated 372 runs

kwaivgi/kling-v1.6-standard

Generate 5s and 10s videos in 720p resolution

Updated 356.4K runs

A powerful 3D asset generation model

Updated 112.5K runs

Cog implementation of LTX video from its diffusers pipeline

Updated 65 runs

Cog implementation of LTX image to video from its diffusers pipeline

Updated 110 runs

Island Segmentation!

Updated 14 runs

SoTA depth estimation

Updated 580 runs

SDXL Canny controlnet with LoRA support.

Updated 393.5K runs

test

Updated 16 runs

Whisper Model that can be use for adding domain-specific words

Updated 31.3K runs

Kokoro is a frontier TTS model for its size of 82 million parameters (text in/audio out).

Updated 327 runs

Updated 50 runs

LoRA Inference for hunyuanvideo-community/HunyuanVideo finetunes

Updated 77 runs

Updated 186 runs

LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched.

Updated 80K runs

Finetune HunyuanVideo LoRAs with kohya-ss/musibi-tuner

Updated 85 runs

Updated 2.2K runs

Updated 28 runs

Simple tool to merge a foreground and background image

Updated 1.6K runs

A SOTA for background removal - Bria v2.0

Updated 57.3K runs

Convert musubi-tuner LoRA to ComfyUI compatible format

Updated 44 runs

Fine-tune HunyuanVideo via a-r-r-o-w/finetrainers (Work In Progress)

Updated 52 runs

Microsoft's Florence 2 Base

Updated 242 runs