Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

API for enhanced word-level timestamp accuracy using OpenAI's Whisper model

Updated 1.3K runs

Add a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI

Updated 158.3K runs

Train your own custom RVC model

Updated 196.2K runs

Deliberate V5 Model (Text2Img, Img2Img and Inpainting)

Updated 14.8K runs

Counterfeit XL v2 Model (Text2Img, Img2Img and Inpainting)

Updated 33.7K runs

Edit real or generated images

Updated 1.3K runs

Edit real or generated images

Updated 2.5K runs

Simple model to make addition and answer is send to supabase

Updated 22 runs

highist resolutioin image

Updated 92 runs

Juggernaut XL v7 Model (Text2Img, Img2Img and Inpainting)

Updated 334.4K runs

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Updated 55.9K runs

Generate color codes for prominent colors in the image

Updated 160 runs

Source: chargoddard/loyal-piano-m7 ✦ Quant: TheBloke/loyal-piano-m7-AWQ ✦ Intended to be a roleplay-focused model with some smarts and good long-context recall

Updated 42 runs

Notus-7b-v1 model

Updated 126 runs

A pipeline for superfast video editing! Make cuts to a video by editing its transcript.

Updated 731 runs

Juggernaut Aftermath Model with original TRCVAE (Text2Img, Img2Img and Inpainting)

Updated 2.5K runs

Starling-LM-7B-alpha

Updated 47 runs

PixArt-Alpha 1024px is a transformer-based text-to-image diffusion system trained on text embeddings from T5

Updated 74.4K runs

Deliberate V4 Model (Text2Img, Img2Img and Inpainting)

Updated 1.2K runs

Generates multi-view optical illusions

Updated 1.3K runs

DemoFusion: Democratising High-Resolution Image Generation With No 💰

Updated 9.2K runs

Updated 384 runs

Generates subtitles

Updated 403 runs

Separate instruments and/or vocals from any song.

Updated 966 runs

Source: fblgit/juanako-7b-UNA ✦ Quant: TheBloke/juanako-7B-UNA-AWQ ✦ juanako uses UNA, Uniform Neural Alignment. A training technique that ease alignment between transformer layers yet to be published

Updated 39 runs

Simple binary sentiment analysis with BERT

Updated 338 runs

VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing in the Wild

Updated 3K runs

Tempo BPM estimation with Essentia

Updated 798 runs

Source: berkeley-nest/Starling-LM-7B-alpha ✦ Quant: TheBloke/Starling-LM-7B-alpha-AWQ ✦ An open large language model (LLM) trained by Reinforcement Learning from AI Feedback (RLAIF)

Updated 59.7K runs

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Updated 655 runs

powerful open-source visual language model

Updated 1.5M runs

InterpAny-Clearer: Clearer anytime frame interpolation & Manipulated interpolation

Updated 11.5K runs

SDXL Model (Text2Img, Img2Img and Inpainting)

Updated 1.9K runs

Generate music in stereo, restricted to chord sequences and tempo

Updated 3.2K runs

Source: monology/openinstruct-mistral-7b ✦ Quant: TheBloke/openinstruct-mistral-7B-AWQ ✦ Commercially-usable 7B model, based on mistralai/Mistral-7B-v0.1 and finetuned on VMware/open-instruct

Updated 295 runs

Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

Updated 947.6K runs

Source: TokenBender/evolvedSeeker_1_3 ✦ Quant: TheBloke/evolvedSeeker_1_3-AWQ ✦ A fine-tuned version of deepseek-ai/deepseek-coder-1.3b-base on 50k instructions for 3 epochs

Updated 29 runs

A1111 webui api

Updated 3.1K runs

Generate texture for your mesh with text prompts

Updated 1.2K runs

LCM AOM3 Superfast

Updated 232 runs

Updated 1.5K runs

Updated 107 runs

URPM V1.3 Model (Text2Img, Img2Img and Inpainting)

Updated 1.1K runs

Deliberate V3 Model (Text2Img, Img2Img and Inpainting)

Updated 372 runs

Controlnet v1.1 - Tile Version

Updated 4K runs

Updated 8.1K runs

Kandinsky 2.2 Model (Text2Img, Img2Img and Inpainting)

Updated 842 runs

Implementation of the latest Stable Video Diffusion model in Cog for inference on Replicate

Updated 495 runs