Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

bytedance / seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

1.3K runs

bytedance / seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

9.2K runs

kwaivgi / kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

9.3K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

85.3K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

42.1K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

12K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

3.8M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.5M runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

267.9K runs

Official models

Official models are always on, maintained, and have predictable pricing.

black-forest-labs / flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

1.7M runs

bytedance / seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

1.3K runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

2.3K runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

2K runs

openai / o1

Use LLMs

14.4K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Edit images

Tools for editing images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate speech

Convert text to speech

Transcribe speech

Models that convert speech to text

Use LLMs

Models that can understand and generate text

Caption videos

Models that generate text from videos

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Generate music

Models to generate and modify music

Caption images

Models that generate text from images

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Use handy tools

Toolbelt-type models for videos and images.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Extract text from images

Optical character recognition (OCR) and text extraction

Chat with images

Ask language models about images

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Use a face to make images

Make realistic images of people instantly

Remove backgrounds

Models that remove backgrounds from images and videos

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 months, 1 week ago 1B runs

prunaai/flux.1-dev

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 1 week, 4 days ago 1.6M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 3 weeks ago 31.7M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 9 months ago 32.3M runs

851-labs/background-remover

Remove backgrounds from images.

Updated 6 months ago 3.5M runs

openai/whisper

Convert speech in audio to text

Updated 6 months, 4 weeks ago 94.4M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 92.1M runs

krthr/clip-embeddings

Generate CLIP (clip-vit-large-patch14) text & image embeddings

Updated 1 year, 10 months ago 39.6M runs

Latest models

rocketdigitalai/animagine-xl-4.0

Ultimate anime-themed finetuned SDXL model and the latest installment of the Animagine XL series

Updated 4 months, 3 weeks ago 754 runs

rocketdigitalai/interior-design-sdxl

Interior Design with RealVisXL V5.0 and ControlNet (Depth & Union SDXL ProMax) to generate photorealistic, high-resolution interior designs with enhanced depth and structure.

Updated 4 months, 3 weeks ago 2K runs

zsxkib/star

STAR Video Upscaler: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Updated 4 months, 3 weeks ago 591 runs

ttsds/openvoice_2

Updated 4 months, 3 weeks ago 758 runs

cureau/force-align-wordstamps

Takes audio (mp3) and a "source-of-truth" audio transcript (string) as input and returns precise timestamps.

Updated 4 months, 3 weeks ago 1.6K runs

ttsds/metavoice

Updated 4 months, 3 weeks ago 640 runs

zeke/ai-ci-cd-example

A demo model for a guide I'm working on...

Updated 4 months, 3 weeks ago 8 runs

edoproch/deepseekr1-distilled-llama-70b-ollama

DeepSeek-R1 distilled on LLaMA3.3 70B and quantized by ollama

Updated 4 months, 3 weeks ago 24 runs

ttsds/f5

Updated 4 months, 3 weeks ago 2.3K runs

ttsds/fishspeech_1_1_large

Updated 4 months, 3 weeks ago 232 runs

edoproch/deepseekr1-distilled-llama-8b-ollama

DeepSeek-R1 distilled on LLaMA 8B

Updated 4 months, 3 weeks ago 564 runs

ttsds/hierspeechpp_1_1

Updated 4 months, 3 weeks ago 256 runs

ttsds/hierspeechpp_1

Updated 4 months, 3 weeks ago 143 runs

ttsds/hierspeechpp_lt460

Updated 4 months, 3 weeks ago 224 runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 3 weeks ago 31.7M runs

ttsds/amphion_valle

The VALL-E models by Amphion.

Updated 4 months, 3 weeks ago 616 runs

ttsds/amphion_vevo

The Vevo model by Amphion.

Updated 4 months, 3 weeks ago 493 runs

subhash25rawat/custom-hair

Customise your hair with AI. Swap hair with anyone, copy anyone's hair color.

Updated 4 months, 3 weeks ago 507 runs

zsxkib/bsrgan

Upscale videos + images with BSRGAN

Updated 4 months, 3 weeks ago 4.2K runs

ttsds/amphion_maskgct

The MaskGCT model by Amphion.

Updated 4 months, 3 weeks ago 466 runs

deepseek-ai/deepseek-r1

A reasoning model trained with reinforcement learning, on par with OpenAI o1

Updated 4 months, 3 weeks ago 1.4M runs

ttsds/fishspeech_1_2_sft

The Fish Speech V1.2 SFT model.

Updated 4 months, 3 weeks ago 246 runs

ttsds/fishspeech_1_5

The Fish Speech V1.5 model.

Updated 4 months, 3 weeks ago 559 runs

ttsds/fishspeech_1_4

The Fish Speech V1.4 model.

Updated 4 months, 3 weeks ago 214 runs

jensbosseparra/flux1-schnell-multi-lora

Adapted to have multi-lora support also for schnell: https://replicate.com/lucataco/flux-dev-multi-lora

Updated 4 months, 3 weeks ago 2.5K runs

ttsds/fishspeech_1_2

The Fish Speech V1.2 model.

Updated 4 months, 3 weeks ago 245 runs

meltred/cavalry-1

Cavalry 1 is a hello world model.

Updated 4 months, 3 weeks ago 8 runs

fofr/any-comfyui-workflow-a100

Run any ComfyUI workflow on an A100. Guide: https://github.com/fofr/cog-comfyui

Updated 4 months, 3 weeks ago 18.3K runs

ttsds/fishspeech_1_0

The Fish Speech V1.0 model.

Updated 4 months, 3 weeks ago 183 runs

lucataco/dotted-waveform-visualizer

Create a dotted waveform video from an audio file

Updated 4 months, 4 weeks ago 57 runs

ttsds/e2

Updated 4 months, 4 weeks ago 254 runs

ttsds/bark_small

The small version of the Bark model by Suno.

Updated 4 months, 4 weeks ago 229 runs

ttsds/bark

The Bark model by Suno.

Updated 4 months, 4 weeks ago 485 runs

wzesk/littoral_draw_refine

Updated 4 months, 4 weeks ago 72 runs

fottoai/remove-bg

Remove image background with custom model to better result.

Updated 4 months, 4 weeks ago 4.4K runs

wynncjf/musicgen_balearic_house_finetune

Updated 4 months, 4 weeks ago 54 runs

tencent/hunyuan-video

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions

Updated 5 months ago 106.4K runs

kjjk10/llasa-3b-long

SoTA Zero Shot Voice Cloning and TTS model

Updated 5 months ago 1.1K runs

zsxkib/hunyuan-video-lora

Hunyuan-Video LoRA Explorer + Trainer

Updated 5 months ago 42.4K runs

ttsds/amphion_naturalspeech2

The NaturalSpeech2 model by Amphion.

Updated 5 months ago 239 runs

czeslov/weather-classification

This model classifies weather conditions based on images. It uses a Convolutional Neural Network (CNN) trained on various weather phenomena to predict the weather condition of a given image.

Updated 5 months ago 8 runs