Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

1.1K runs

google / imagen-4-fast

Use this fast version of Imagen 4 when speed and cost are more important than quality

6.4K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

14.8K runs

google / imagen-4

Google's Imagen 4 flagship model

333.6K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

7.8K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

59.5K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

2.2M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.3M runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

210K runs

Official models

Official models are always on, maintained, and have predictable pricing.

anthropic / claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions

104.8K runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

286 runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

1.1K runs

flux-kontext-apps / face-to-many-kontext

Become a character, in style

4.3K runs

flux-kontext-apps / renaissance

Turn yourself into a renaissance-era painting for those renaissance moments

1.1K runs

flux-kontext-apps / multi-image-list

FLUX Kontext max with list input for multiple images

11.4K runs

kwaivgi / kling-lip-sync

Add lip-sync to any video with an audio file or text

2.1K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Remove backgrounds

Models that remove backgrounds from images and videos

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Caption videos

Models that generate text from videos

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Get embeddings

Models that generate embeddings from inputs

Generate speech

Convert text to speech

Generate music

Models to generate and modify music

Generate text

Models that can understand and generate text

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 4 weeks ago 993.4M runs

krthr/clip-embeddings

Generate CLIP (clip-vit-large-patch14) text & image embeddings

Updated 1 year, 10 months ago 39.1M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 2 weeks ago 30.5M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 91.1M runs

prunaai/flux-schnell

This is an optimised version of the FLUX.1 [schnell] model from Black Forest Labs made with Pruna. We achieve a ~3x speedup over the original model with minimal quality loss.

Updated 1 week, 5 days ago 1.4M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Updated 1 year, 4 months ago 8M runs

salesforce/blip

Generate image captions

Updated 2 years, 8 months ago 166.1M runs

adirik/grounding-dino

Detect everything with language!

Updated 1 year, 7 months ago 9M runs

Latest models

jyoung105/imp

a family of multimodal small language models

Updated 1 year, 4 months ago 70 runs

ltejedor/prolific-splats

image to 3D

Updated 1 year, 4 months ago 45 runs

spuuntries/flatdolphinmaid-8x7b-gguf

Undi95's FlatDolphinMaid 8x7B Mixtral Merge, GGUF Q5_K_M quantized by TheBloke.

Updated 1 year, 4 months ago 414.8K runs

tgohblio/instant-id-albedobase-xl

InstantID : Zero-shot Identity-Preserving Generation in Seconds with ⚡️LCM-LoRA⚡️. Using AlbedoBase-XL v2.0 as base model.

Updated 1 year, 4 months ago 178.8K runs

lucataco/img-and-audio2video

Take an image and an audio file and create a video clip

Updated 1 year, 4 months ago 3.6K runs

lucataco/watermark_detector

amrul-hzz's fine-tuned version of vit-base-patch16-224-in21k for watermark image detection

Updated 1 year, 4 months ago 355 runs

asiryan/proteus-v0.2

Proteus v0.2 Model (Text2Img, Img2Img and Inpainting)

Updated 1 year, 4 months ago 13.6K runs

fofr/txt2img

Many models: RealVisXL, Juggernaut, Proteus, DreamShaper, etc.

Updated 1 year, 4 months ago 10.6K runs

dsingal0/mixtral-single-gpu

Runs Mixtral 8x7B on a single A40 GPU

Updated 1 year, 4 months ago 88 runs

sakemin/musicgen-remixer

Remix the music into another styles with MusicGen Chord

Updated 1 year, 4 months ago 15.8K runs

lucataco/moondream1

(Research only) Moondream1 is a vision language model that performs on par with models twice its size

Updated 1 year, 4 months ago 11.5K runs

datacte/proteus-v0.2

Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding that surpasses MJ6, while also approaching its stylistic capabilities.

Updated 1 year, 4 months ago 10.1M runs

jyoung105/moondream

Tiny vision language model

Updated 1 year, 4 months ago 306 runs

spuuntries/borealis-10.7b-dpo-gguf

Undi95's Borealis 10.7B Mistral DPO Finetune, GGUF Q5_K_M quantized by Undi95.

Updated 1 year, 4 months ago 76 runs

grandlineai/instant-id-photorealistic

InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Juggernaut-XL v8 as the base model to encourage photorealism

Updated 1 year, 4 months ago 42.7K runs

grandlineai/instant-id-artistic

InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Dreamshaper-XL as the base model to encourage artistic generations

Updated 1 year, 4 months ago 10.9K runs

nateraw/musicgen-songstarter-v0.1

Generate song ideas!

Updated 1 year, 4 months ago 586 runs

cjwbw/depth-anything

Highly practical solution for robust monocular depth estimation by training on a combination of 1.5M labeled images and 62M+ unlabeled images

Updated 1 year, 4 months ago 12.1K runs

lucataco/siglip

SigLIP proposes to replace the loss function used in CLIP by a simple pairwise sigmoid loss

Updated 1 year, 4 months ago 764 runs

lucataco/wizardcoder-33b-v1.1-gguf

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

Updated 1 year, 4 months ago 17K runs

cjwbw/tokenflow

Consistent Diffusion Features for Consistent Video Editing

Updated 1 year, 4 months ago 2K runs

artificialguybr/nebul.redmond

Nebul.Redmond - Stable Diffusion SD XL Finetuned Model

Updated 1 year, 4 months ago 16.9K runs

tencentarc/photomaker-style

Create photos, paintings and avatars for anyone in any style within seconds. (Stylization version)

Updated 1 year, 4 months ago 1.4M runs

pollinations/amt

Video Smoother: AMT All-Pairs Multi-Field Transforms for Efficient Frame Interpolation

Updated 1 year, 4 months ago 16.6K runs

meepo-pro-player/invoker

Updated 1 year, 4 months ago 34.8K runs

sontungpytn/comfyui-lora-upscaler

Updated 1 year, 4 months ago 85.5K runs

kcaverly/neuralbeagle14-7b-gguf

NeuralBeagle14-7B is (probably) the best 7B model you can find!

Updated 1 year, 4 months ago 12.2K runs

voku682/video_style_transfer

Updated 1 year, 4 months ago 277 runs

tomasmcm/sensei-7b-v1

Source: SciPhi/Sensei-7B-V1 ✦ Quant: TheBloke/Sensei-7B-V1-AWQ ✦ Sensei is specialized in performing RAG over detailed web search results

Updated 1 year, 4 months ago 37 runs

tomasmcm/whiterabbitneo-13b

Source: WhiteRabbitNeo/WhiteRabbitNeo-13B-v1 ✦ TheBloke/WhiteRabbitNeo-13B-AWQ ✦ WhiteRabbitNeo is a model series that can be used for offensive and defensive cybersecurity

Updated 1 year, 4 months ago 126 runs

mbukerepo/photomaker

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

Updated 1 year, 4 months ago 4.9K runs

tencentarc/photomaker

Create photos, paintings and avatars for anyone in any style within seconds.

Updated 1 year, 4 months ago 7.1M runs

konieshadow/fooocus-api-anime

Third party Fooocus replicate model with preset 'anime'

Updated 1 year, 4 months ago 286.9K runs

konieshadow/fooocus-api-realistic

Third party Fooocus replicate model with preset 'realistic'

Updated 1 year, 4 months ago 775.2K runs

konieshadow/fooocus-api

Third party Fooocus replicate model

Updated 1 year, 4 months ago 1.4M runs

lucataco/whisperspeech-small

An Open Source text-to-speech system built by inverting Whisper

Updated 1 year, 4 months ago 1.6K runs

zsxkib/moore-animateanyone

Unofficial Re-Trained AnimateAnyone (Image + DWPose Video → Animated Video of Image)

Updated 1 year, 4 months ago 873 runs

dhanushreddy291/arthemy-comics

Updated 1 year, 4 months ago 280 runs

adirik/realvisxl-v3.0-turbo

Photorealism with RealVisXL V3.0 Turbo based on SDXL

Updated 1 year, 4 months ago 312.4K runs

adirik/imagedream

Image-Prompt Multi-view Diffusion for 3D Generation

Updated 1 year, 4 months ago 1.5K runs

smoosh-sh/baby-mystic

Implementation of Realistic Vision v5.1 to conjure up images of the potential baby using a single photo from each parent

Updated 1 year, 4 months ago 3.2M runs

lucataco/magnet

MAGNeT: Masked Audio Generation using a Single Non-Autoregressive Transformer

Updated 1 year, 4 months ago 2.4K runs

dhanushreddy291/forge-saga-landscape

ForgeSaga Landscape

Updated 1 year, 4 months ago 110 runs

dhanushreddy291/manmaru-mix-v3

Manmaru mix v3.0

Updated 1 year, 4 months ago 697 runs

tomasmcm/digital-socrates-13b

Source: allenai/digital-socrates-13b ✦ Quant: TheBloke/digital-socrates-13B-AWQ ✦ Digital Socrates is an open-source, automatic explanation-critiquing model

Updated 1 year, 4 months ago 19 runs

tomasmcm/towerinstruct-7b-v0.1

Source: Unbabel/TowerInstruct-7B-v0.1 ✦ Quant: TheBloke/TowerInstruct-7B-v0.1-AWQ ✦ This model is trained to handle several translation-related tasks, such as general machine translation, gramatical error correction, and paraphrase generation

Updated 1 year, 4 months ago 191 runs

csslc/ccsr

Improving the Stability of Diffusion Models for Content Consistent Super-Resolution

Updated 1 year, 5 months ago 3.4K runs

datacte/proteus-v0.1

ProteusV0.1 uses OpenDalleV1.1 as a base and further refines prompt adherence and stylistic capabilities to a measurable degree

Updated 1 year, 5 months ago 6.7K runs

fermatresearch/sdxl-improved-refiner

Great image quality, good old SDXL with a new and improved Tile refiner.

Updated 1 year, 5 months ago 830 runs

chenxwh/video-retalking

Audio-based Lip Synchronization for Talking Head Video

Updated 1 year, 5 months ago 30.3K runs

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Featured models

luma / reframe-video

google / imagen-4-fast

google / imagen-4-ultra

google / imagen-4

replicate / fast-flux-trainer

google / veo-3

black-forest-labs / flux-kontext-pro

black-forest-labs / flux-kontext-max

ideogram-ai / ideogram-v3-turbo

Official models

anthropic / claude-4-sonnet

luma / reframe-image

luma / reframe-video

openai / o1

openai / o4-mini

openai / gpt-4o-mini

openai / gpt-4o

openai / gpt-4.1

openai / gpt-4.1-mini

openai / gpt-4.1-nano

google / imagen-4-fast

google / imagen-4-ultra

google / imagen-4

google / imagen-3-fast

google / imagen-3

google / veo-3

flux-kontext-apps / face-to-many-kontext

flux-kontext-apps / renaissance

flux-kontext-apps / multi-image-list

kwaivgi / kling-lip-sync

I want to…

Popular models

Latest models