Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

bytedance / seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

1.8K runs

bytedance / seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

9.9K runs

kwaivgi / kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

10.2K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

86.3K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

43.3K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

12.2K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

3.8M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.5M runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

269.7K runs

Official models

Official models are always on, maintained, and have predictable pricing.

bytedance / seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

27 runs

bytedance / seedance-1-pro

Generate videos

1.8K runs

bytedance / seedance-1-lite

Generate videos

9.9K runs

black-forest-labs / flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

1.7M runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

2.3K runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

2K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Edit images

Tools for editing images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate speech

Convert text to speech

Transcribe speech

Models that convert speech to text

Use LLMs

Models that can understand and generate text

Caption videos

Models that generate text from videos

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Generate music

Models to generate and modify music

Caption images

Models that generate text from images

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Use handy tools

Toolbelt-type models for videos and images.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Extract text from images

Optical character recognition (OCR) and text extraction

Chat with images

Ask language models about images

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Use a face to make images

Make realistic images of people instantly

Remove backgrounds

Models that remove backgrounds from images and videos

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 months, 1 week ago 1B runs

prunaai/flux.1-dev

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 1 week, 4 days ago 1.7M runs

adirik/grounding-dino

Detect everything with language!

Updated 1 year, 8 months ago 9.5M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 3 weeks ago 31.8M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 9 months ago 32.4M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 92.2M runs

openai/whisper

Convert speech in audio to text

Updated 6 months, 4 weeks ago 94.4M runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 1 year ago 15.9M runs

Latest models

jhorovitz/flux-schnell-redux-control

Change the strength of the prompt to enable editing style and content. Recommendation: keep the seed constant and tune the strength.

Updated 5 months, 1 week ago 370 runs

jhorovitz/flux-dev-redux-control

This models allow changing the strength of the Redux image prompt, which allows the text prompt to have a stronger effect. It is particularly useful at taking content from the provided image and applying style or editing changes from the prompt.

Updated 5 months, 1 week ago 1.9K runs

cuuupid/markitdown

Microsoft's tool to convert Office documents, PDFs, images, audio, and more to LLM-ready markdown.

Updated 5 months, 1 week ago 48.4K runs

wzesk/littoral_upsample

REAL-ESRGAN superresolution to upsample low resolution satellite imagery.

Updated 5 months, 1 week ago 57 runs

andreasjansson/fn-hello

Updated 5 months, 1 week ago 35 runs

andreasjansson/fn-upcase

Updated 5 months, 1 week ago 19 runs

recraft-ai/recraft-creative-upscale

Creative Upscale focuses on enhancing details and refining complex elements in the image. It doesn’t just increase resolution but adds depth by improving textures, fine details, and facial features.

Updated 5 months, 1 week ago 4.7K runs

recraft-ai/recraft-crisp-upscale

Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.

Updated 5 months, 1 week ago 104.2K runs

zsxkib/stable-video-face-restoration

SVFR: A Unified Framework for Generalized Video Face Restoration

Updated 5 months, 1 week ago 545 runs

zeeshaan28/solovision

State-of-the-art human detection and tracking system integrated with REID

Updated 5 months, 1 week ago 19 runs

playht/play-dialog

End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware prosody, intonation, and emotional expression.

Updated 5 months, 1 week ago 26.1K runs

vetkastar/fooocus

Image generation, Added: inpaint_strength loras_custom_urls

Updated 5 months, 1 week ago 326.6K runs

lucataco/video-merge

Simple tool to merge together separate video snippets

Updated 5 months, 1 week ago 441 runs

saysharastuff/olmo-2-1124-13b-instruct

allenai/OLMo-2-1124-13B-Instruct, text generation model

Updated 5 months, 1 week ago 115 runs

wzesk/littoral_refine

refinement module to improve satellite derived shorelines

Updated 5 months, 2 weeks ago 5 runs

bzikst/xtts-v2-fork

2025 fork of closed Coqui XTTS-v2: Multilingual Text To Speech Voice Clone

Updated 5 months, 2 weeks ago 405 runs

georgedavila/cog-ltx-video

Cog implementation of LTX video from its diffusers pipeline

Updated 5 months, 2 weeks ago 70 runs

georgedavila/ltx-img2vid

Cog implementation of LTX image to video from its diffusers pipeline

Updated 5 months, 2 weeks ago 119 runs

wzesk/littoral_segment

Island Segmentation!

Updated 5 months, 2 weeks ago 15 runs

kjjk10/lotus-diffusion-dense-prediction

SoTA depth estimation

Updated 5 months, 2 weeks ago 599 runs

pnyompen/sdxl-controlnet-lora-small

SDXL Canny controlnet with LoRA support.

Updated 5 months, 2 weeks ago 393.6K runs

zj7730/test-music

test

Updated 5 months, 2 weeks ago 16 runs

sabuhigr/sabuhi-model-v2

Whisper Model that can be use for adding domain-specific words

Updated 5 months, 2 weeks ago 33K runs

kjjk10/kokoro-82m

Kokoro is a frontier TTS model for its size of 82 million parameters (text in/audio out).

Updated 5 months, 2 weeks ago 810 runs

gougouccnu/stable-audio-open-1.0

Updated 5 months, 2 weeks ago 51 runs

lucataco/hunyuanvideo-community-lora

LoRA Inference for hunyuanvideo-community/HunyuanVideo finetunes

Updated 5 months, 2 weeks ago 78 runs

orbin-ahmed/interior_v2

Updated 5 months, 2 weeks ago 192 runs

lightricks/ltx-video

LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched.

Updated 5 months, 2 weeks ago 109.6K runs

lucataco/musubi-tuner

Finetune HunyuanVideo LoRAs with kohya-ss/musibi-tuner

Updated 5 months, 2 weeks ago 85 runs

razvandrl/subtitler

Updated 5 months, 2 weeks ago 2.2K runs

teaglis-kury/sana

Updated 5 months, 2 weeks ago 28 runs

lucataco/merge-img

Simple tool to merge a foreground and background image

Updated 5 months, 2 weeks ago 2K runs

lucataco/musubi-tuner-lora-converter

Convert musubi-tuner LoRA to ComfyUI compatible format

Updated 5 months, 3 weeks ago 47 runs

lucataco/hunyuanvideo-lora-trainer

Fine-tune HunyuanVideo via a-r-r-o-w/finetrainers (Work In Progress)

Updated 5 months, 3 weeks ago 53 runs

hiscodesmells/florence-2-base

Microsoft's Florence 2 Base

Updated 5 months, 3 weeks ago 246 runs

chenxwh/ominicontrol-subject

Minimal and Universal Control for Diffusion Transformer - demo for Subject-driven generation

Updated 5 months, 3 weeks ago 1.9K runs

chenxwh/ominicontrol-spatial

Minimal and Universal Control for Diffusion Transformer - demo for Spatially aligned control

Updated 5 months, 3 weeks ago 106 runs

declare-lab/tangoflux

Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

Updated 5 months, 3 weeks ago 11.1K runs

orbin-ahmed/interior

Updated 5 months, 3 weeks ago 209 runs

chenxwh/onediffusion

One Diffusion to Generate Them All

Updated 5 months, 3 weeks ago 159 runs

scamai/upscaler

Upscale low resolution images to high resolution images

Updated 5 months, 3 weeks ago 3.8K runs

lucataco/flux-rf-inversion

Cog implementation of Diffusers Flux RFInversion Pipeline

Updated 5 months, 3 weeks ago 204 runs

scamai/deepfake-faceswap-detection

Detect deepfake faceswap image

Updated 5 months, 3 weeks ago 82 runs

scamai/faceswap

Swap the source face to target face

Updated 5 months, 3 weeks ago 839 runs

lucataco/hunyuanvideo

Unofficial community fork and Diffusers formatted weights of tencent/HunyuanVideo

Updated 5 months, 3 weeks ago 183 runs

chenxwh/deepseek-vl2

Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Updated 5 months, 3 weeks ago 1.1K runs

ahmdyassr/detect-crop-face

A simple model to detect and crop face found in image, made for https://outfit.fm

Updated 5 months, 3 weeks ago 7.9K runs

tomasruizt/apollo-7b-multiturn

Fork / Remix of Apollo 7B by Luis C. (https://replicate.com/lucataco/apollo-7b) to support multi-turn conversations.

Updated 5 months, 3 weeks ago 24 runs

lucataco/qvq-72b-preview

QVQ-72B-Preview by Qwen is an experimental research model focusing on enhancing visual reasoning capabilities

Updated 5 months, 4 weeks ago 272 runs

jschoormans/interior-v2

Remodels interior

Updated 5 months, 4 weeks ago 2.4K runs

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Featured models

bytedance / seedance-1-pro

bytedance / seedance-1-lite

kwaivgi / kling-v2.1

google / veo-3

google / imagen-4-ultra

replicate / fast-flux-trainer

black-forest-labs / flux-kontext-pro

black-forest-labs / flux-kontext-max

ideogram-ai / ideogram-v3-turbo

Official models

bytedance / seedream-3

bytedance / seedance-1-pro

bytedance / seedance-1-lite

black-forest-labs / flux-schnell-lora

openai / gpt-4.1-nano

openai / gpt-4.1-mini

openai / gpt-4o

meta / llama-guard-4-12b

openai / gpt-4o-mini

resemble-ai / chatterbox

kwaivgi / kling-v2.1-master

kwaivgi / kling-v2.1

minimax / video-01

minimax / video-01-live

minimax / video-01-director

resemble-ai / chatterbox-pro

google / veo-3

anthropic / claude-4-sonnet

luma / reframe-image

luma / reframe-video

I want to…

Popular models

Latest models