Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

bytedance / seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

1.3K runs

bytedance / seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

3.2K runs

bytedance / seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

10.8K runs

kwaivgi / kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

11.9K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

87.7K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

45.7K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

12.5K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

4M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.5M runs

Official models

Official models are always on, maintained, and have predictable pricing.

black-forest-labs / flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

1.8M runs

openai / gpt-4.1-nano

Use LLMs

21.4K runs

openai / gpt-4.1-mini

Use LLMs

769.2K runs

openai / gpt-4o

Use LLMs

103.1K runs

meta / llama-guard-4-12b

50 runs

openai / gpt-4o-mini

Use LLMs

1.5M runs

resemble-ai / chatterbox

Generate speech

5.9K runs

kwaivgi / kling-v2.1-master

Generate videos

3.5K runs

kwaivgi / kling-v2.1

Generate videos

11.9K runs

minimax / video-01

Generate videos

520.3K runs

minimax / video-01-live

Generate videos

125.1K runs

minimax / video-01-director

Generate videos

38.8K runs

resemble-ai / chatterbox-pro

Generate speech

861 runs

google / veo-3

Generate videos

87.7K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Edit images

Tools for editing images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate speech

Convert text to speech

Transcribe speech

Models that convert speech to text

Use LLMs

Models that can understand and generate text

Caption videos

Models that generate text from videos

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Generate music

Models to generate and modify music

Caption images

Models that generate text from images

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Use handy tools

Toolbelt-type models for videos and images.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Extract text from images

Optical character recognition (OCR) and text extraction

Chat with images

Ask language models about images

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Use a face to make images

Make realistic images of people instantly

Remove backgrounds

Models that remove backgrounds from images and videos

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 months, 1 week ago 1B runs

prunaai/flux.1-dev

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 1 week, 5 days ago 1.8M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Updated 1 year, 4 months ago 8.6M runs

openai/whisper

Convert speech in audio to text

Updated 6 months, 4 weeks ago 94.5M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 3 weeks ago 31.8M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 92.2M runs

adirik/grounding-dino

Detect everything with language!

Updated 1 year, 8 months ago 9.6M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 9 months ago 32.4M runs

Latest models

qubit999/qwen2.5-coder-32b-instruct

The Qwen2.5-Coder-32B-Instruct is a state-of-the-art, open-source large language model (LLM). It is specifically designed for coding tasks and is part of the Qwen2.5-Coder series, featuring 32 billion parameters.

Updated 6 months, 2 weeks ago 83 runs

luma/photon-flash

Accelerated variant of Photon prioritizing speed while maintaining quality

Updated 6 months, 2 weeks ago 94K runs

luma/photon

High-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs

Updated 6 months, 2 weeks ago 936.6K runs

qubit999/llama3.2-3b-instruct

The Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out).

Updated 6 months, 2 weeks ago 30 runs

mattheum/flux-multi-pulid-controlnet

Hey, this is a fork of flux pulid to support multiple ids, use with a depth map and define bounding boxes for each face

Updated 6 months, 3 weeks ago 2.5K runs

haiper-ai/haiper-video-2

Generate 4s and 6s videos from a prompt or image

Updated 6 months, 3 weeks ago 10.8K runs

ulissemini/gpt2-xl-actadd

Updated 6 months, 3 weeks ago 385 runs

panjianning/gotcha

flux.1-dev: hyper-sd 8 steps + instanx ip adataper + pulid + depth controlnet

Updated 6 months, 3 weeks ago 228 runs

genmoai/mochi-1-lora

A version of mochi-1 (a text to video model) that supports fine-tuned lora inference

Updated 6 months, 3 weeks ago 100 runs

pku-yuangroup/llava-cot

Let Vision Language Models Reason Step-by-Step

Updated 6 months, 3 weeks ago 40 runs

genmoai/mochi-1

Mochi 1 preview is an open video generation model with high-fidelity motion and strong prompt adherence in preliminary evaluation

Updated 6 months, 3 weeks ago 2.8K runs

pollinations/flux-schnell-svdquant

SVDQuant Optimized Flux.Schnell

Updated 6 months, 3 weeks ago 30 runs

lucataco/smolvlm-instruct

SmolVLM-Instruct by HuggingFaceTB

Updated 6 months, 3 weeks ago 1.1K runs

toanbarcelona1998/animatediff-lightning-4-step_gif

AnimateDiff-Lightning: Cross-Model Diffusion Distillation

Updated 6 months, 3 weeks ago 46 runs

replicate/hello-concurrency

Updated 6 months, 3 weeks ago 526 runs

zsxkib/jina-clip-v2

Jina-CLIP v2: 0.9B multimodal embedding model with 89-language multilingual support, 512x512 image resolution, and Matryoshka representations

Updated 6 months, 4 weeks ago 216.1K runs

zsxkib/samurai

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Updated 6 months, 4 weeks ago 151 runs

tmappdev/lang-segment-anything

Segment Anything with prompts

Updated 6 months, 4 weeks ago 1.4M runs

openai/whisper

Convert speech in audio to text

Updated 6 months, 4 weeks ago 94.5M runs

asiryan/anima-pencil-xl-v5

Anima Pencil XL v5 Model (Text2Img, Img2Img and Inpainting)

Updated 6 months, 4 weeks ago 18.8K runs

ardianfe/stable-audio-2

music generation with fine tuned stable audio

Updated 7 months ago 8.4K runs

ardianfe/stable-audio-prod

cerate music with open source

Updated 7 months ago 69K runs

chenxwh/ltx-video

DiT-based video generation model for generating high-quality videos in real-time

Updated 7 months ago 3.2K runs

asiryan/pencil-xl-v2

Pencil XL v2 Model (Text2Img, Img2Img and Inpainting)

Updated 7 months ago 4.6K runs

asiryan/unlimited-xl

Unlimited XL Model (Text2Img, Img2Img and Inpainting)

Updated 7 months ago 30.4K runs

andreasjansson/flux-schnell-redux-layers

Updated 7 months ago 1.4K runs

shreejalmaharjan-27/tiktok-short-captions

Generate Tiktok-Style Captions powered by Whisper (GPU)

Updated 7 months ago 54.2K runs

tmappdev/img2watermarkmask

A model using microsoft/Florence-2-large to create mask of watermarked images

Updated 7 months ago 60 runs

crdbello/cblabs

Updated 7 months ago 318 runs

jyoung105/playground-v2.5

Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation

Updated 7 months ago 53K runs

jyoung105/playground-v2

Playground v2.0: A diffusion-based text-to-image generation model trained from scratch by the research team at Playground

Updated 7 months ago 60 runs

jyoung105/kolors

Kolors: Effective Training of Diffusion Model for Photorealistic Text-to-Image Synthesis

Updated 7 months ago 78 runs

jyoung105/cogview-v3-plus

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Updated 7 months ago 19 runs

jyoung105/stable-cascade

Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

Updated 7 months ago 71 runs

jyoung105/auraflow-v3

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 7 months ago 121 runs

jyoung105/auraflow-v2

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 7 months ago 30 runs

jyoung105/auraflow-v1

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 7 months ago 19 runs

jyoung105/perflow

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Updated 7 months ago 15 runs

jyoung105/tcd-sdxl

Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping

Updated 7 months ago 21 runs

jyoung105/lightning-sdxl

SDXL-Lightning: Progressive Adversarial Diffusion Distillation

Updated 7 months ago 38 runs

jyoung105/flash-sdxl

Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Updated 7 months ago 35 runs

jyoung105/dmd2

Improved Distribution Matching Distillation for Fast Image Synthesis

Updated 7 months ago 81 runs

jyoung105/sdxl-turbo

Adversarial Diffusion Distillation

Updated 7 months ago 126 runs

jyoung105/slam

Updated 7 months ago 4 runs

jyoung105/pcm

Phased Consistency Model

Updated 7 months ago 30 runs

jyoung105/lcm

Latent Consistency Models: Synthesizing High-Resolution Images with Few-step Inference

Updated 7 months ago 125 runs

lee101/fast-vfx

Template Project for Running Fast Video Effects all on the GPU with fast GPU encoding and Decoding

Updated 7 months ago 18 runs

asiryan/2dn-xl

2DN XL Model (Text2Img, Img2Img and Inpainting)

Updated 7 months ago 230 runs

asiryan/mistoon-anime-xl

Mistoon Anime XL Model (Text2Img, Img2Img and Inpainting)

Updated 7 months ago 22.4K runs

asiryan/realism-xl

Realism XL Model (Text2Img, Img2Img and Inpainting)

Updated 7 months ago 278.8K runs

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Featured models

bytedance / seedream-3

bytedance / seedance-1-pro

bytedance / seedance-1-lite

kwaivgi / kling-v2.1

google / veo-3

google / imagen-4-ultra

replicate / fast-flux-trainer

black-forest-labs / flux-kontext-pro

black-forest-labs / flux-kontext-max

Official models

bytedance / seedream-3

black-forest-labs / flux-dev-lora

black-forest-labs / flux-dev

black-forest-labs / flux-schnell

bytedance / seedance-1-pro

bytedance / seedance-1-lite

black-forest-labs / flux-schnell-lora

openai / gpt-4.1-nano

openai / gpt-4.1-mini

openai / gpt-4o

meta / llama-guard-4-12b

openai / gpt-4o-mini

resemble-ai / chatterbox

kwaivgi / kling-v2.1-master

kwaivgi / kling-v2.1

minimax / video-01

minimax / video-01-live

minimax / video-01-director

resemble-ai / chatterbox-pro

google / veo-3

I want to…

Popular models

Latest models