Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

bytedance / seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

2.1K runs

bytedance / seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

3.7K runs

bytedance / seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

11.2K runs

kwaivgi / kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

12.5K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

88.2K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

46.3K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

12.6K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

4M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.5M runs

Official models

Official models are always on, maintained, and have predictable pricing.

black-forest-labs / flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

1.8M runs

openai / gpt-4.1-nano

Use LLMs

22.6K runs

openai / gpt-4.1-mini

Use LLMs

769.5K runs

openai / gpt-4o

Use LLMs

103.1K runs

meta / llama-guard-4-12b

50 runs

openai / gpt-4o-mini

Use LLMs

1.6M runs

resemble-ai / chatterbox

Generate speech

6.1K runs

kwaivgi / kling-v2.1-master

Generate videos

3.5K runs

kwaivgi / kling-v2.1

Generate videos

12.5K runs

minimax / video-01

Generate videos

520.5K runs

minimax / video-01-live

Generate videos

125.2K runs

minimax / video-01-director

Generate videos

38.8K runs

resemble-ai / chatterbox-pro

Generate speech

864 runs

google / veo-3

Generate videos

88.2K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Edit images

Tools for editing images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate speech

Convert text to speech

Transcribe speech

Models that convert speech to text

Use LLMs

Models that can understand and generate text

Caption videos

Models that generate text from videos

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Generate music

Models to generate and modify music

Caption images

Models that generate text from images

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Use handy tools

Toolbelt-type models for videos and images.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Extract text from images

Optical character recognition (OCR) and text extraction

Chat with images

Ask language models about images

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Use a face to make images

Make realistic images of people instantly

Remove backgrounds

Models that remove backgrounds from images and videos

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 months, 1 week ago 1B runs

prunaai/flux.1-dev

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 1 week, 5 days ago 1.9M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 3 weeks ago 31.9M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 9 months ago 32.4M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 92.3M runs

openai/whisper

Convert speech in audio to text

Updated 6 months, 4 weeks ago 94.5M runs

beautyyuyanli/multilingual-e5-large

multilingual-e5-large: A multi-language text embedding model

Updated 1 year, 5 months ago 22.2M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Updated 1 year, 4 months ago 8.6M runs

Latest models

deepseek-ai/deepseek-v3

DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source

Updated 3 months ago 1.8M runs

goodguy1963/good-sdxl-models-plus-loras

BROKEN - DO NOT USE!

Updated 3 months ago 189 runs

cuuupid/idm-vton

Best-in-class clothing virtual try on in the wild (non-commercial use only)

Updated 3 months ago 845.7K runs

jichengdu/spark-tts

0.5B

Updated 3 months ago 209 runs

ttsds/gptsovits_1

Updated 3 months ago 241 runs

jichengdu/llasa

8B TTS

Updated 3 months ago 77 runs

recraft-ai/recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

Updated 3 months ago 4M runs

recraft-ai/recraft-v3-svg

Recraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.

Updated 3 months ago 157.6K runs

recraft-ai/recraft-20b-svg

Affordable and fast vector images

Updated 3 months ago 33.4K runs

recraft-ai/recraft-20b

Affordable and fast images

Updated 3 months ago 199.4K runs

ttsds/gptsovits_2

Updated 3 months ago 400 runs

ttsds/fishspeech_1_1

The Fish Speech V1.1 model.

Updated 3 months ago 190 runs

gfodor/text2vox

Generates MagicaVoxel VOX models, using flux dev + hunyuan3d-2. Can generate high detail and low detail models at varying resolutions.

Updated 3 months ago 112 runs

hardikdava/rf-detr

RF-DETR: SOTA Real-Time Object Detection Model

Updated 3 months ago 51 runs

goodguy1963/epicrealism-naturalsinfinal-byepinikion-v2

epicrealism-naturalsinfinal-SD1.5-by-epinikion + perfectdeliberate by Desync + More Details by Lykon

Updated 3 months ago 61 runs

aisha-ai-official/eternalchampond

Updated 3 months ago 862 runs

adriiita/photoshoot

Updated 3 months ago 109 runs

aisha-ai-official/miaomiao-harem-illustrious-v1

Updated 3 months ago 56K runs

aisha-ai-official/flux.1schnell-uncensored-rasch3

Updated 3 months ago 680 runs

aisha-ai-official/flux.1dev-uncensored-newreality-a2

Updated 3 months ago 1.5K runs

aisha-ai-official/flux.1dev-uncensored-msfluxnsfw-v3

Updated 3 months ago 1.7K runs

aisha-ai-official/flux.1dev-uncensored-colossus-v5

Updated 3 months ago 3K runs

aisha-ai-official/flux.1dev-uncensored-realreveal5

Updated 3 months ago 1.4K runs

lucataco/frame-extractor

Extract the first or last frame from any video file as a high-quality image

Updated 3 months ago 789 runs

lucataco/csm-1b

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs

Updated 3 months ago 490 runs

jichengdu/flux

flux-1.dev

Updated 3 months ago 20 runs

ttsds/voicecraft

Updated 3 months ago 501 runs

jichengdu/fish-speech

Fish Speech V1.5-SOTA Open Source TTS

Updated 3 months ago 412 runs

bytedance/sa2va-26b-image

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Updated 3 months ago 2.8K runs

aisha-ai-official/illust3relustion

Updated 3 months ago 141.9K runs

lucataco/orpheus-3b-0.1-ft

Orpheus 3B - high quality, emotive Text to Speech

Updated 3 months ago 17.2K runs

aisha-ai-official/anillustrious-v4

Updated 3 months ago 137.5K runs

bfirsh/concatenate-videos

Stitches videos together

Updated 3 months ago 117 runs

jichengdu/cosyvoice

CosyVoice2-0.5B-Scalable Streaming Speech Synthesis with Large Language Models

Updated 3 months ago 1K runs

abdulali025/andro-upscaler

A model Flux.1-dev-Controlnet-Upscaler by www.androcoders.in

Updated 3 months ago 482 runs

simbrams/ri

Realistic Inpainting with ControlNET (M-LSD + SEG)

Updated 3 months ago 506.8K runs

tencent/hunyuan3d-2mv

Hunyuan3D-2mv is finetuned from Hunyuan3D-2 to support multiview controlled shape generation.

Updated 3 months, 1 week ago 1.5K runs

bytedance/latentsync

LatentSync: generate high-quality lip sync animations

Updated 3 months, 1 week ago 45K runs

grace-raper/resnet-rot

detect correct orientation of images

Updated 3 months, 1 week ago 18 runs

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

Updated 3 months, 1 week ago 14.2M runs

google-deepmind/shieldgemma-2-4b-it

ShieldGemma 2 is a model trained on Gemma 3's 4B IT checkpoint for image safety classification across key categories that takes in images and outputs safety labels per policy.

Updated 3 months, 1 week ago 202 runs