Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Zero shot Sound separation by arbitrary query samples

Updated 48.5K runs

Multi-Axis MLP for Image Processing

Updated 503.8K runs

Detect and simplify the contours of a binary image

Updated 232 runs

Super-resolves an LR video frame (ultra-wide) using a reference video frame (wide-angle)

Updated 14.3K runs

Homage to the Pixel: text prompt to 6 color squares

Updated 9.4K runs

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

Updated 1.2K runs

bare pixray for API use

Updated 11.6K runs

Design Your Hair by Text and Reference Image

Updated 290.3K runs

A Steerable Model for Bach Chorales Generation

Updated 844 runs

One-shot (any-to-any) Voice Conversion

Updated 6.3K runs

Online demo for "Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation"

Updated 1.9K runs

A model for testing pydantic cog that yields images one word at a time.

Updated 128 runs

Image Style Transfer with Text Condition

Updated 25.9K runs

Synthesize drawings to match a text prompt

Updated 5.6K runs

A model for testing pydantic cog that generates images.

Updated 410 runs

A test model that generates Haiku (and yields output one word a time)

Updated 106 runs

Clip-Guided Diffusion Model for Image Generation

Updated 4.5K runs

GLIDE-text2im w/ humans and experimental style prompts.

Updated 9.2K runs

Updated 565 runs

A fork of pixray/pixray for trying out Cog's new Predictor API

Updated 59 runs

Grad-CAM visualizations for Align before Fuse

Updated 3.7K runs

Updated 75 runs

GLIDE from OpenAI finetuned on roughly 30M more samples. See `laionide-v3` for the latest.

Updated 3.8K runs

Transcribes piano audio and makes it into a cool video

Updated 218 runs

Masked-attention Mask Transformer for Universal Image Segmentation

Updated 658 runs

Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement

Updated 358 runs

Updated 94 runs

Updated 147 runs

Generate Pokemons with Projected GAN

Updated 9.7K runs

Text-Guided Image Generation and Manipulation

Updated 824 runs

A Single Model for Many Visual Modalities

Updated 247 runs

Supervised Weakly from hashtAGs

Updated 294 runs

Classify numerical digits.

Updated 115 runs

democratizing automatic music transcription

Updated 3K runs

Uses pixray to generate an image from text prompt

Updated 148.4K runs

Training-Free Text-to-Image Generation

Updated 2.4K runs

Updated 1.7K runs

Updated 750 runs

Image generation from Wav2CLIP through VQGAN-CLIP

Updated 896 runs

Guide Style Transfer with CLIP loss

Updated 1.3K runs

Baseline models demo of the IEEE L3DAS22 Challenge

Updated 228 runs

Simple example of a Cog model that produces Markdown output

Updated 23 runs

Huan's first cog with Replicate.AI

Updated 35 runs

Deep Halftoning with Reversible Binary Pattern

Updated 364 runs

Guide a StyleGAN3 trained on pictures of mannequins with CLIP.

Updated 850 runs

Guesses your age based on a photo

Updated 421 runs

Age prediction using CLIP

Updated 528 runs

Selfie to anime

Updated 3.1K runs

Generates pokemon sprites from prompt

Updated 4.9K runs

Disentangled face manipulation using CLIP-based annotations

Updated 1.8K runs