Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

black-forest-labs / flux-kontext-dev

Open-weight version of FLUX.1 Kontext

34.9K runs

bytedance / seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

26.3K runs

bytedance / seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

15.8K runs

bytedance / seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

22.7K runs

kwaivgi / kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

28.1K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

98.7K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

59.8K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

5M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.7M runs

Official models

Official models are always on, maintained, and have predictable pricing.

black-forest-labs / flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

1.9M runs

openai / gpt-4.1-nano

Use LLMs

65.9K runs

openai / gpt-4.1-mini

Use LLMs

770.7K runs

openai / gpt-4o

Use LLMs

105.4K runs

meta / llama-guard-4-12b

66 runs

openai / gpt-4o-mini

Use LLMs

1.6M runs

resemble-ai / chatterbox

Generate speech

9.7K runs

kwaivgi / kling-v2.1-master

Generate videos

6.2K runs

kwaivgi / kling-v2.1

Generate videos

28.1K runs

minimax / video-01

Generate videos

525.4K runs

minimax / video-01-live

Generate videos

126.1K runs

minimax / video-01-director

Generate videos

40.2K runs

resemble-ai / chatterbox-pro

Generate speech

1.5K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Edit images

Tools for editing images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate speech

Convert text to speech

Transcribe speech

Models that convert speech to text

Use LLMs

Models that can understand and generate text

Caption videos

Models that generate text from videos

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Generate music

Models to generate and modify music

Caption images

Models that generate text from images

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Use handy tools

Toolbelt-type models for videos and images.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Extract text from images

Optical character recognition (OCR) and text extraction

Chat with images

Ask language models about images

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Use a face to make images

Make realistic images of people instantly

Remove backgrounds

Models that remove backgrounds from images and videos

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 months, 1 week ago 1B runs

openai/whisper

Convert speech in audio to text

Updated 7 months ago 95.5M runs

prunaai/flux.1-dev

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 2 weeks, 3 days ago 3.2M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 92.8M runs

beautyyuyanli/multilingual-e5-large

multilingual-e5-large: A multi-language text embedding model

Updated 1 year, 5 months ago 22.3M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 5 months ago 32.4M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 9 months ago 33M runs

fofr/any-comfyui-workflow

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

Updated 4 days, 3 hours ago 4.6M runs

Latest models

cuuupid/glm-4v-9b

GLM-4V is a multimodal model released by Tsinghua University that is competitive with GPT-4o and establishes a new SOTA on several benchmarks, including OCR.

Updated 11 months, 4 weeks ago 88.2K runs

zsxkib/whisper-lazyloading

Convert speech in audio to text w/ `tiny`, `small`, `base`, and `large-v3` models

Updated 11 months, 4 weeks ago 129 runs

chenxwh/diffsynth-exvideo

Extended video synthesis model that generates 128 frames

Updated 11 months, 4 weeks ago 204 runs

skytells-research/focus

Image generation, Inpaint Strength, loras custom_urls and enhancer.

Updated 1 year ago 447 runs

chenxwh/depth-anything-v2

Depth estimation with faster inference speed, fewer parameters, and higher depth accuracy.

Updated 1 year ago 198K runs

suryakantk94/whiteclaw-db

Updated 1 year ago 20 runs

lucataco/hermes-2-pro-llama-3-70b

Hermes 2 Pro is an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house

Updated 1 year ago 344 runs

ankur-singh/nexusraven-v2-13b

Best Open-Source Model for Function Calling

Updated 1 year ago 33 runs

pseudoram/rvc-v2

Speech to speech with any RVC v2 trained AI voice

Updated 1 year ago 817K runs

goodtome/hello

hello world

Updated 1 year ago 47 runs

google-deepmind/gemma2-27b-it

Google's Gemma2 27b instruct model

Updated 1 year ago 12.9K runs

zsxkib/aura-sr

AuraSR: GAN-based Super-Resolution for real-world

Updated 1 year ago 2.8K runs

google-deepmind/gemma2-9b-it

Google's Gemma2 9b instruct model

Updated 1 year ago 23.6K runs

omniedgeio/aquaaibase

Model

Updated 1 year ago 412 runs

lucataco/hunyuandit-v1.1

A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Updated 1 year ago 1.1K runs

lkincel/tinytales

Model that generates Cartoon like characters

Updated 1 year ago 770 runs

zeke/sd3-inpainting-with-differential-diffusion

Stable Diffusion 3 with Differential Diffusion inpainting (experimental)

Updated 1 year ago 271 runs

c-barron/owl-sam

Fork of https://replicate.com/schananas/grounded_sam that uses OwlV2 instead of Grounding Dino

Updated 1 year ago 3.7K runs

lucataco/florence-2-large

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Updated 1 year ago 169.6K runs

lucataco/florence-2-base

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Updated 1 year ago 70.1K runs

prakharsaxena24/2d-to-real-style

Updated 1 year ago 495 runs

zsxkib/qwen2-7b-instruct

Qwen 2: A 7 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 1 year ago 1.8K runs

zsxkib/qwen2-1.5b-instruct

Qwen 2: A 1.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 1 year ago 220 runs

platform-kit/mars5-tts

A novel speech model for insane prosody.

Updated 1 year ago 479 runs

zsxkib/qwen2-0.5b-instruct

Qwen 2: A 0.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 1 year ago 200 runs

ardianfe/musicgen-ft

good for video teaser backsound

Updated 1 year ago 62 runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 1 year ago 16.2M runs

buddhiraz/photomaker_ape_stylized

Updated 1 year ago 515 runs

zsxkib/sd3-controlnet

✨Stable Diffusion 3 w/ ⚡InstantX's Canny, Pose, and Tile ControlNets🖼️

Updated 1 year ago 1.3K runs

fofr/sd3-explorer

A model for experimenting with all the SD3 settings. Non-commercial use only, unless you have a Stability AI Self Hosted License.

Updated 1 year ago 32.2K runs

douwantech/gpt-sovits-train

Updated 1 year ago 181 runs

stackadoc/stable-audio-open-1.0

Stable Audio Open is an open-source model optimized for generating short audio samples, sound effects, and production elements using text prompts.

Updated 1 year ago 20.4K runs

fofr/sd3-with-chaos

Stable Diffusion 3 medium with added variability in outputs. Non-commercial use only, unless you have a Stability AI Self Hosted License.

Updated 1 year ago 20.2K runs

xavriley/sax_transcription

Transcribe saxophone solos directly from audio

Updated 1 year ago 202 runs

douwantech/musev

Updated 1 year ago 355 runs

franz-biz/yolo-world-xl

Real-Time Open-Vocabulary Object Detection using the xl weights

Updated 1 year ago 771.6K runs

charlesmccarthy/musicgen

MusicGen running on an a40 with 60 seconds max duration

Updated 1 year ago 1.2K runs

magpai-app/cog-puppeteer

Updated 1 year ago 177 runs

lucataco/mobius

Mobius, a diffusion model that pushes the boundaries of domain-agnostic debiasing and representation realignment

Updated 1 year ago 625 runs

turian/dover-video-quality-assessment

DOVER video quality assessment tool, assigning videos both aesthetic and technical quality scores

Updated 1 year ago 27 runs

dhanushreddy291/photo-background-generation

Generate Product photography backgrounds using Stable Diffusion

Updated 1 year ago 538 runs

mareksagan/dreamgaussian

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation. Hologram optimized

Updated 1 year ago 350 runs

mtg/music-classifiers

Transfer learning models for music classification by genres, moods, and instrumentation

Updated 1 year ago 10.6K runs

zsxkib/v-express

🫦 Realistic facial expression manipulation (lip-syncing) using audio or video

Updated 1 year ago 1.1K runs

douwantech/musepose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation.

Updated 1 year ago 977 runs

ahmdyassr/mask-clothing

Super fast clothing (and face) segmentation and masking with erosion and dilation capability, made for https://outfit.fm

Updated 1 year ago 17.7K runs

charlesmccarthy/pony-sdxl

The best Pony-SDXL models! Current one is based on Pony Realism.

Updated 1 year ago 110.4K runs

buddhiraz/chilloutmix-ni-pruned-fp32-fixx

Updated 1 year ago 186 runs

remodela-ai/scaling-model-v1

# Interior Decoration Space Scaling - First Use Case

Updated 1 year ago 66 runs

zeke/hello-world

A tiny model for testing out Cog

Updated 1 year ago 1.1K runs

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Featured models

black-forest-labs / flux-kontext-dev

bytedance / seedream-3

bytedance / seedance-1-pro

bytedance / seedance-1-lite

kwaivgi / kling-v2.1

google / veo-3

google / imagen-4-ultra

black-forest-labs / flux-kontext-pro

black-forest-labs / flux-kontext-max

Official models

black-forest-labs / flux-kontext-dev

bytedance / seedream-3

black-forest-labs / flux-dev-lora

black-forest-labs / flux-dev

black-forest-labs / flux-schnell

bytedance / seedance-1-pro

bytedance / seedance-1-lite

black-forest-labs / flux-schnell-lora

openai / gpt-4.1-nano

openai / gpt-4.1-mini

openai / gpt-4o

meta / llama-guard-4-12b

openai / gpt-4o-mini

resemble-ai / chatterbox

kwaivgi / kling-v2.1-master

kwaivgi / kling-v2.1

minimax / video-01

minimax / video-01-live

minimax / video-01-director

resemble-ai / chatterbox-pro

I want to…

Popular models

Latest models