Explore

I want to…

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

SmolVLM-Instruct by HuggingFaceTB

Updated 29 runs

AnimateDiff-Lightning: Cross-Model Diffusion Distillation

Updated 45 runs

Updated 159 runs

Jina-CLIP v2: 0.9B multimodal embedding model with 89-language multilingual support, 512x512 image resolution, and Matryoshka representations

Updated 1.1K runs

Updated 19 runs

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Updated 75 runs

Segment Anything with prompts

Updated 2.7K runs

Convert speech in audio to text

Updated 65.5M runs

Anima Pencil XL v5 Model (Text2Img, Img2Img and Inpainting)

Updated 5.5K runs

music generation with fine tuned stable audio

Updated 4.7K runs

cerate music with open source

Updated 65.2K runs

DiT-based video generation model for generating high-quality videos in real-time

Updated 2.8K runs

Pencil XL v2 Model (Text2Img, Img2Img and Inpainting)

Updated 4K runs

Unlimited XL Model (Text2Img, Img2Img and Inpainting)

Updated 10.9K runs

Updated 1.3K runs

Generate Tiktok-Style Captions powered by Whisper (GPU)

Updated 408 runs

A model using microsoft/Florence-2-large to create mask of watermarked images

Updated 26 runs

xl

Updated 304 runs

Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation

Updated 53K runs

Playground v2.0: A diffusion-based text-to-image generation model trained from scratch by the research team at Playground

Updated 53 runs

Kolors: Effective Training of Diffusion Model for Photorealistic Text-to-Image Synthesis

Updated 76 runs

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Updated 17 runs

Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

Updated 67 runs

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 119 runs

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 20 runs

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 17 runs

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Updated 13 runs

Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping

Updated 20 runs

SDXL-Lightning: Progressive Adversarial Diffusion Distillation

Updated 36 runs

Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Updated 33 runs

Improved Distribution Matching Distillation for Fast Image Synthesis

Updated 43 runs

Adversarial Diffusion Distillation

Updated 42 runs

Updated 3 runs

Phased Consistency Model

Updated 29 runs

Latent Consistency Models: Synthesizing High-Resolution Images with Few-step Inference

Updated 42 runs

Template Project for Running Fast Video Effects all on the GPU with fast GPU encoding and Decoding

Updated 14 runs

black-forest-labs/flux-depth-dev

Open-weight depth-aware image generation. Edit images while preserving spatial relationships.

Updated 64.5K runs

black-forest-labs/flux-canny-dev

Open-weight edge-guided image generation. Control structure and composition using Canny edge detection.

Updated 39.8K runs

black-forest-labs/flux-redux-schnell

Fast, efficient image variation model for rapid iteration and experimentation.

Updated 18.9K runs

black-forest-labs/flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

Updated 51.5K runs

Uses DINO to detect regions and further refines them with SAM. Returns masking data as RLE encoded JSON.

Updated 173 runs

2DN XL Model (Text2Img, Img2Img and Inpainting)

Updated 227 runs

Mistoon Anime XL Model (Text2Img, Img2Img and Inpainting)

Updated 11.5K runs

Realism XL Model (Text2Img, Img2Img and Inpainting)

Updated 105.6K runs

Detects if a picture has anime face.

Updated 28K runs

Babes XL Model (Text2Img, Img2Img and Inpainting)

Updated 4.1K runs

The current model is used for graphics replacement processing

Updated 476.1K runs

Upload an image or video, and Video-LLaVa will give you a text description of what it "sees."

Updated 90 runs

without examination qwen2.5 32b

Updated 96 runs

FLUX.1 [dev] (LoRA) with several optimizations such as FP8 Quantization

Updated 73 runs