Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

bytedance / seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

1.7K runs

bytedance / seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

9.8K runs

kwaivgi / kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

10.1K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

86.2K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

43.1K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

12.1K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

3.8M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.5M runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

269.5K runs

Official models

Official models are always on, maintained, and have predictable pricing.

bytedance / seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

23 runs

bytedance / seedance-1-pro

Generate videos

1.7K runs

bytedance / seedance-1-lite

Generate videos

9.8K runs

black-forest-labs / flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

1.7M runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

2.3K runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

2K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Edit images

Tools for editing images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate speech

Convert text to speech

Transcribe speech

Models that convert speech to text

Use LLMs

Models that can understand and generate text

Caption videos

Models that generate text from videos

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Generate music

Models to generate and modify music

Caption images

Models that generate text from images

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Use handy tools

Toolbelt-type models for videos and images.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Extract text from images

Optical character recognition (OCR) and text extraction

Chat with images

Ask language models about images

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Use a face to make images

Make realistic images of people instantly

Remove backgrounds

Models that remove backgrounds from images and videos

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 months, 1 week ago 1B runs

prunaai/flux.1-dev

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 1 week, 4 days ago 1.7M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 9 months ago 32.4M runs

adirik/grounding-dino

Detect everything with language!

Updated 1 year, 8 months ago 9.5M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 3 weeks ago 31.8M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 92.2M runs

851-labs/background-remover

Remove backgrounds from images.

Updated 6 months ago 3.5M runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 1 year ago 15.9M runs

Latest models

aisha-ai-official/perfect-rsb-mix-v15

Updated 4 months ago 67 runs

aisha-ai-official/realism-xl-v9

Updated 4 months ago 122 runs

aisha-ai-official/anillustrious-v3

Updated 4 months ago 451 runs

catacolabs/dis-background-removal

ECCV2022 Quick background removal

Updated 4 months ago 48 runs

aisha-ai-official/anillustrious-v2

A good anime merge from 12 other models

Updated 4 months ago 1.2K runs

aisha-ai-official/noobai-real-sdxl-v0.1

Updated 4 months ago 258 runs

aisha-ai-official/realism-xl-v11

Updated 4 months ago 224 runs

aisha-ai-official/animagine-xl-4.0

Great text-to-image model by Cagliostro Lab

Updated 4 months ago 3.4K runs

aisha-ai-official/cyber-realistic-pony-v8

Updated 4 months ago 815 runs

thomasmol/whisper-diarization

⚡️ Blazing fast audio transcription with speaker diarization | Whisper Large V3 Turbo | word & sentence level timestamps | prompt

Updated 4 months ago 1.6M runs

microsoft/omniparser-v2

OmniParser is a screen parsing tool to convert general GUI screen to structured elements.

Updated 4 months ago 59.2K runs

zf-kbot/inpaint-and-guess-prompt

Use a mask to inpaint the image or generate a prompt based on the mask.

Updated 4 months ago 76.4K runs

jhorovitz/omini-schnell

Place items in a scene without needing to train on them

Updated 4 months, 1 week ago 2.7K runs

jhorovitz/omini-dev

Cogified implementation of OminiControl

Updated 4 months, 1 week ago 75 runs

moonpig/dis-background-removal

Updated 4 months, 1 week ago 82 runs

mtg/music-arousal-valence

Regression of musical arousal and valence values

Updated 4 months, 1 week ago 8.8K runs

lucataco/step-audio-tts-3b

Step-Audio-TTS-3B represents the industry's first Text-to-Speech (TTS) model trained on a large-scale synthetic dataset utilizing the LLM-Chat paradigm

Updated 4 months, 1 week ago 1.1K runs

ocg2347/plksr-tiled-lowvram

Tiled inference implementation of PLKSR

Updated 4 months, 1 week ago 69 runs

ttsds/speecht5

Updated 4 months, 1 week ago 182 runs

lucataco/videollama3-7b

VideoLLaMA 3: Frontier Multimodal Foundation Models for Video Understanding

Updated 4 months, 1 week ago 2.4K runs

ostris/flex.1-alpha

Flex.1 alpha is a pre-trained base 8 billion parameter rectified flow transformer capable of generating images from text descriptions

Updated 4 months, 1 week ago 317 runs

tmappdev/change_video_bg

Change or Replace Video Background with any Image

Updated 4 months, 1 week ago 930 runs

jaaari/zonos

Zonos-v0.1 by Zyphra, voice cloning, 5 languages and emotion control

Updated 4 months, 1 week ago 1.5K runs

deepseek-ai/janus-pro-1b

Janus-Pro is a novel autoregressive framework for multimodal understanding

Updated 4 months, 1 week ago 6.7K runs

ttsds/pheme

Updated 4 months, 1 week ago 682 runs

anthropic/claude-3.5-haiku

Anthropic's fastest, most cost-effective model, with a 200K token context window (claude-3-5-haiku-20241022)

Updated 4 months, 1 week ago 1.5M runs

anthropic/claude-3.5-sonnet

Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)

Updated 4 months, 1 week ago 486.5K runs

mmezhov/catvton-flux

Updated 4 months, 1 week ago 325 runs

subhash25rawat/morphix3d

Transform Images & Text into 3D Models with AI

Updated 4 months, 1 week ago 50 runs

deepseek-ai/deepseek-vl2

DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL

Updated 4 months, 1 week ago 57.1K runs

deepseek-ai/deepseek-vl2-small

DeepSeek-VL2-small, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL

Updated 4 months, 1 week ago 1K runs