Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Convert a set of frames to a video

Updated 1.6K runs

Diffusion-based semantic image editing with mask guidance

Updated 410 runs

Kandinsky 2.1 Diffusion Model

Updated 84.8K runs

SDRV_2.0

Updated 15K runs

Stable Diffusion fine tuned on Midjourney v4 images.

Updated 12M runs

Age prediction using CLIP - Patched version of `https://replicate.com/andreasjansson/clip-age-predictor` that works with the new version of cog!

Updated 196.6K runs

Persian (Farsi) Handwritten Digit Detector

Updated 90 runs

Transform your text into a beautiful two-tone color gradient that represents your emotions.

Updated 425 runs

A "Hello World" model for me to get to grips with `cog` and Replicate

Updated 45 runs

Updated 38 runs

Diffusion Models as Text Painters

Updated 1.8K runs

Prompt-free Diffusion

Updated 747 runs

Generate a new image from an input image with AbsoluteReality v1.0

Updated 325.6K runs

Generate a new image given any input text with AbsoluteReality v1.0

Updated 270.7K runs

Generate a new image from an input image with DreamShaper V6

Updated 164.4K runs

Generate a new image given any input text with DreamShaper V6

Updated 423.1K runs

Generate a new image from an input image with Babes 2.0

Updated 1.4M runs

Generate a new image from an input image with RPG V4

Updated 2.2K runs

Generate a new image from an input image with URPM v1.3

Updated 2.6K runs

Generate a new image from an input image with Deliberate v2

Updated 9.2K runs

Generate a new image from an input image with Edge Of Realism - EOR v2.0

Updated 578.9K runs

Generate a new image from an input image with Realistic Vision V2.0

Updated 55.1K runs

Generate a new image given any input text with Babes 2.0

Updated 26.8K runs

Generate a new image given any input text with RPG V4

Updated 58.5K runs

Generate a new image given any input text with URPM v1.3

Updated 54.8K runs

Generate a new image given any input text with Deliberate v2

Updated 655.4K runs

Generate a new image given any input text with Edge Of Realism - EOR v2.0

Updated 133.5K runs

Generate a new image given any input text with Realistic Vision V2.0

Updated 530.2K runs

This is a language model that can be used to obtain document embeddings suitable for downstream tasks like semantic search and clustering.

Updated 2.2M runs

This model can detect clothing using a custom state of the art clothing segmentation algorithm.

Updated 3.4K runs

This model is actually: prompthero / openjourney-v4

Updated 279 runs

Classification of music approachability and engagement

Updated 38.3K runs

An EfficientNet for music style classification by 400 styles from the Discogs taxonomy

Updated 190.9K runs

My own personal try of Stable Diffusion

Updated 41 runs

Updated 32.5K runs

Updated 41 runs

A multi-input ControlNet model. Pass in control images and set the weights.

Updated 253 runs

Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.

Updated 5.2K runs

Generating Conditional 3D Implicit Functions

Updated 15.2K runs

Updated 27.7K runs

Generate Pokémon from a text description

Updated 7.9M runs

A model for text, audio, and image embeddings in one space

Updated 5.2M runs

music label

Updated 189 runs

image tagger

Updated 40.8M runs

ControlNet annotators - the initial image that is fed into a stable diffusion pipeline with ControlNet

Updated 352 runs

Detects tents in satellite images

Updated 36 runs

album cover generator

Updated 983 runs

Updated 250 runs