Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Synthetic image detection and model identification

Updated 1.4K runs

Fork of https://replicate.com/zsxkib/ic-light that allows any image resolution

Updated 15.5K runs

Upscale pictures and videos using the nunif repo (formerly waifu2x).

Updated 764 runs

PrometheusV1 is presumed to be the first full rank finetune of Playground v2.5

Updated 169 runs

Updated 78 runs

ProteusV0.5 is the latest full release built as a sophisticated enhancement over OpenDalleV1.1

Updated 3.3K runs

meta/meta-llama-3.1-405b-instruct

Meta's flagship 405 billion parameter language model, fine-tuned for chat completions

Updated 5.4M runs

dolly-v2-12b, just for testing

Updated 16 runs

Ultra high resolution images (up to 4096x4096) based on Stable Cascade

Updated 8.4K runs

Seamlessly create stunning product shots by blending with inspirational references for a fresh, modern look

Updated 409 runs

Detect hate speech or toxic comments in tweets/texts

Updated 111.8K runs

Kolors with style transfer, composition transfer and other IPAdapter techniques

Updated 17.6K runs

Largest completely open sourced flow-based generation model that is capable of text-to-image generation

Updated 7.5K runs

A large-scale text-to-image generation model based on latent diffusion, developed by the Kuaishou Kolors team

Updated 30.4K runs

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 25.8M runs

Generate seamless 360 photos using SDXL

Updated 401 runs

Face Restoration

Updated 4.5K runs

stability-ai/stable-diffusion-3

A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency

Updated 1.6M runs

MimicMotion: High-quality human motion video generation with pose-guided control

Updated 2.4K runs

remove background for retailer product images

Updated 60 runs

Make realistic images of real people instantly (w/ ip-adapter-plus-face_sdxl_vit-h)

Updated 4K runs

Updated 80 runs

PixArt Sigma 900M is a text-to-image generation model based on the PixArt Sigma architecture

Updated 2.2K runs

Updated 46.5K runs

araby.ai oneshot video faceswap

Updated 20.2K runs

MARS5, a fully open-source (commercially usable) voice-cloning/TTS with break-through prosody and realism.

Updated 661 runs

for backsound

Updated 111 runs

audio to srt

Updated 29 runs

Cog wrapper for Ollama llama3:70b

Updated 6.6K runs

Cog wrapper for Ollama llama3:8b

Updated 14 runs

Input a video. Ask anything about it

Updated 3.5K runs

YOLOv10: Real-Time End-to-End Object Detection

Updated 241 runs

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Updated 426 runs

Take audio from one video and add it to a second video. Good for adding back audio to liveportrait.

Updated 203 runs

Change the fps of a video without changing its length or speed

Updated 102 runs

Portrait animation using a driving video source

Updated 80.2K runs

Efficient Portrait Animation with Stitching and Retargeting Control

Updated 1.2K runs

Kolors is a SOTA base image model for high quality image generation

Updated 1.2K runs

Updated 14 runs

Updated 91 runs

Updated 38 runs

The API automatically detects objects in an input image and returns their positional and mask information.

Updated 4.1K runs

Create music for your content

Updated 461.8K runs

Updated 393 runs

Mama ママ 2.0 Shinsei Galverse Anime-themed text-to-image model

Updated 2.4K runs

InternLM2.5 has open-sourced a 7 billion parameter base model and a chat model tailored for practical scenarios.

Updated 58 runs

Create videos from illustrated input images

Updated 48.7K runs