Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

bytedance / seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

691 runs

bytedance / seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

2.5K runs

bytedance / seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

10.5K runs

kwaivgi / kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

11.1K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

87.2K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

44.6K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

12.3K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

3.9M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.5M runs

Official models

Official models are always on, maintained, and have predictable pricing.

black-forest-labs / flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

1.8M runs

openai / gpt-4.1-nano

Use LLMs

21.3K runs

openai / gpt-4.1-mini

Use LLMs

769.2K runs

openai / gpt-4o

Use LLMs

102.9K runs

meta / llama-guard-4-12b

46 runs

openai / gpt-4o-mini

Use LLMs

1.5M runs

resemble-ai / chatterbox

Generate speech

5.6K runs

kwaivgi / kling-v2.1-master

Generate videos

3.4K runs

kwaivgi / kling-v2.1

Generate videos

11.1K runs

minimax / video-01

Generate videos

519.9K runs

minimax / video-01-live

Generate videos

124.9K runs

minimax / video-01-director

Generate videos

38.7K runs

resemble-ai / chatterbox-pro

Generate speech

846 runs

google / veo-3

Generate videos

87.2K runs

anthropic / claude-4-sonnet

Use LLMs

163.6K runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

2.3K runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

2K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Edit images

Tools for editing images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate speech

Convert text to speech

Transcribe speech

Models that convert speech to text

Use LLMs

Models that can understand and generate text

Caption videos

Models that generate text from videos

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Generate music

Models to generate and modify music

Caption images

Models that generate text from images

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Use handy tools

Toolbelt-type models for videos and images.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Extract text from images

Optical character recognition (OCR) and text extraction

Chat with images

Ask language models about images

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Use a face to make images

Make realistic images of people instantly

Remove backgrounds

Models that remove backgrounds from images and videos

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 months, 1 week ago 1B runs

prunaai/flux.1-dev

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 1 week, 5 days ago 1.8M runs

adirik/grounding-dino

Detect everything with language!

Updated 1 year, 8 months ago 9.5M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 92.2M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 9 months ago 32.4M runs

openai/whisper

Convert speech in audio to text

Updated 6 months, 4 weeks ago 94.5M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 3 weeks ago 31.8M runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 1 year ago 15.9M runs

Latest models

chenxwh/nova-t2v

Autoregressive Video Generation without Vector Quantization

Updated 5 months, 4 weeks ago 39 runs

subhash25rawat/flawless-text

Flawless Text is a high-precision text-to-image model that generates typo-free, visually accurate images from text descriptions, ideal for seamless, error-free creative workflows.

Updated 5 months, 4 weeks ago 1.5K runs

chenxwh/nova-t2i

Autoregressive Image Generation without Vector Quantization

Updated 5 months, 4 weeks ago 15 runs

lucataco/modernbert-large

ModernBERT-large is a modernized bidirectional encoder-only Transformer model (BERT-style) pre-trained on 2 trillion tokens of English and code data

Updated 5 months, 4 weeks ago 90 runs

lucataco/modernbert-base

ModernBERT-base is a modernized bidirectional encoder-only Transformer model (BERT-style) pre-trained on 2 trillion tokens of English and code data

Updated 5 months, 4 weeks ago 73 runs

chenxwh/cosyvoice2-0.5b

Scalable Streaming Speech Synthesis with Large Language Models

Updated 5 months, 4 weeks ago 5.5K runs

fire/flux

Updated 6 months ago 39 runs

foundationvision/infinity

Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Updated 6 months ago 391 runs

meta/llama-guard-3-11b-vision

A Llama-3.2-11B pretrained model, fine-tuned for content safety classification

Updated 6 months ago 1.5K runs

ardianfe/demucs-prod

sound separation with demucs

Updated 6 months ago 47.9K runs

alexgenovese/flux-sd3-flow-edit

FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models

Updated 6 months ago 431 runs

ahm3texe/test999

Updated 6 months ago 5 runs

meta/llama-guard-3-8b

A Llama-3.1-8B pretrained model, fine-tuned for content safety classification

Updated 6 months ago 58.6K runs

meta/llamaguard-7b

A 7B parameter Llama 2-based input-output safeguard model

Updated 6 months ago 22 runs

daanelson/flux-fill-dev-big

Image inpainting with flux

Updated 6 months ago 70 runs

lucataco/qwen2-vl-7b-instruct

Latest model in the Qwen family for chatting with video and image models

Updated 6 months ago 150.2K runs

851-labs/background-remover

Remove backgrounds from images.

Updated 6 months ago 3.5M runs

ibm-granite/granite-3.1-8b-instruct

Granite-3.1-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

Updated 6 months ago 765.6K runs

ibm-granite/granite-3.1-2b-instruct

Granite-3.1-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

Updated 6 months ago 9.1K runs

jzhang38/fast-hunyuan-video

Fast Hunyuan Video by Hao AI Lab

Updated 6 months ago 439 runs

ryan5453/demucs

Demucs is an audio source separator created by Facebook Research.

Updated 6 months ago 466K runs

ahm3texe/blur

Açıklama Testi

Updated 6 months ago 9 runs

arthur630-tech/mob

Updated 6 months, 1 week ago 1.2K runs

jzhang38/fast-mochi

Fast Mochi by Hao AI Lab

Updated 6 months, 1 week ago 60 runs

minimax/music-01

Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track

Updated 6 months, 1 week ago 251.7K runs

toanbarcelona1998/dream_shaper8

Updated 6 months, 1 week ago 9 runs

lucataco/ollama-llama3.2-vision-90b

Ollama Llama 3.2 Vision 90B

Updated 6 months, 1 week ago 3K runs

vetkastar/comfy-flux

comfy with flux model,

Updated 6 months, 1 week ago 173K runs

lucataco/ollama-llama3.2-vision-11b

Ollama Llama 3.2 Vision 11B

Updated 6 months, 1 week ago 1.9K runs

lucataco/ollama-qwq

Ollama QwQ 32B

Updated 6 months, 1 week ago 56 runs

asyasyarif/ootd_masking

Clothing segmentation tool that generates masks from outfit images, separating them into top and bottom pieces with automatic background removal and edge refinement.

Updated 6 months, 1 week ago 81 runs

lucataco/ollama-llama3.3-70b

Ollama Llama 3.3 70B

Updated 6 months, 1 week ago 16.7K runs

jyoung105/hyper-sdxl

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

Updated 6 months, 1 week ago 123 runs

lucataco/apollo-7b

Apollo 7B - An Exploration of Video Understanding in Large Multimodal Models

Updated 6 months, 1 week ago 47.2K runs

lucataco/apollo-3b

Apollo 3B - An Exploration of Video Understanding in Large Multimodal Models

Updated 6 months, 1 week ago 119 runs

lucataco/rembg-video

Video Background Removal

Updated 6 months, 1 week ago 2K runs

turian/arxiv-llm-text

Prepare arXiv papers for processing by Large Language Models (LLMs) by converting them into a single, expanded LaTeX file.

Updated 6 months, 1 week ago 19 runs

zsyoaoa/invsr

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Updated 6 months, 1 week ago 3.8K runs

lucataco/bulk-video-caption

Video Preprocessing tool for captioning multiple videos using GPT, Claude or Gemini

Updated 6 months, 1 week ago 120 runs

lucataco/video-split

Simple tool to split apart a video into snippets

Updated 6 months, 1 week ago 126 runs

subhash25rawat/logo-in-context

Create ads for marketing, social media with your own company logo on any object you want.

Updated 6 months, 1 week ago 358 runs

genmoai/mochi-1-lora-trainer

a-r-r-o-w/cogvideox-factory for Mochi-1 LoRA Training

Updated 6 months, 1 week ago 28 runs

zsxkib/instant-id

Make realistic images of real people instantly

Updated 6 months, 2 weeks ago 908.8K runs

zsxkib/hunyuan-video2video

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions

Updated 6 months, 2 weeks ago 2.6K runs

impetusdesign/rqi-txc-itp

Updated 6 months, 2 weeks ago 360 runs

zurk/hunyuan-video-8bit

Hunyuan Video 8bit model API for video generation

Updated 6 months, 2 weeks ago 238 runs

chenxwh/nitrofusion

High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

Updated 6 months, 2 weeks ago 156 runs

nvidia/sana

A fast image model with wide artistic range and resolutions up to 4096x4096

Updated 6 months, 2 weeks ago 160.7K runs

lucataco/moondream-0.5b

Moondream 0.5B, the world's smallest vision language model

Updated 6 months, 2 weeks ago 53 runs

qubit999/qwen2.5-coder-32b-instruct

The Qwen2.5-Coder-32B-Instruct is a state-of-the-art, open-source large language model (LLM). It is specifically designed for coding tasks and is part of the Qwen2.5-Coder series, featuring 32 billion parameters.

Updated 6 months, 2 weeks ago 83 runs