Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

61.5K runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

1.2K runs

google / imagen-4-fast

Use this fast version of Imagen 4 when speed and cost are more important than quality

7.3K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

16.8K runs

google / imagen-4

Google's Imagen 4 flagship model

344.7K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

8.1K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

2.3M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.3M runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

216.1K runs

Official models

Official models are always on, maintained, and have predictable pricing.

google / veo-3

Generate videos

61.5K runs

anthropic / claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions

107.4K runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

420 runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

1.2K runs

flux-kontext-apps / face-to-many-kontext

Become a character, in style

4.4K runs

flux-kontext-apps / renaissance

Turn yourself into a renaissance-era painting for those renaissance moments

1.1K runs

flux-kontext-apps / multi-image-list

FLUX Kontext max with list input for multiple images

12.5K runs

kwaivgi / kling-lip-sync

Add lip-sync to any video with an audio file or text

2.1K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Remove backgrounds

Models that remove backgrounds from images and videos

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Caption videos

Models that generate text from videos

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Get embeddings

Models that generate embeddings from inputs

Generate speech

Convert text to speech

Generate music

Models to generate and modify music

Generate text

Models that can understand and generate text

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 4 weeks ago 994.2M runs

prunaai/flux-schnell

This is an optimised version of the FLUX.1 [schnell] model from Black Forest Labs made with Pruna. We achieve a ~3x speedup over the original model with minimal quality loss.

Updated 1 week, 6 days ago 1.6M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 2 weeks ago 30.6M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 91.1M runs

salesforce/blip

Generate image captions

Updated 2 years, 8 months ago 166.1M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 8 months ago 31.6M runs

adirik/grounding-dino

Detect everything with language!

Updated 1 year, 7 months ago 9.1M runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 11 months, 3 weeks ago 15.5M runs

Latest models

tomasmcm/docsgpt-7b-mistral

Source: Arc53/docsgpt-7b-mistral ✦ Quant: TheBloke/docsgpt-7B-mistral-AWQ ✦ DocsGPT is optimized for Documentation (RAG), fine-tuned for providing answers that are based on context

Updated 1 year, 5 months ago 77 runs

alexgenovese/upscaler

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration

Updated 1 year, 5 months ago 4.3M runs

fermatresearch/open-dalle-1.1-lora

Better than SDXL at both prompt adherence and image quality, by dataautogpt3

Updated 1 year, 5 months ago 132.4K runs

fictions-ai/autocaption

Automatically add captions to a video

Updated 1 year, 5 months ago 43.6K runs

bawgz/stable-dripfusion-2

Updated 1 year, 5 months ago 262 runs

musicly-ai/singing_voice_conversion

this is the replicate version of singing_voice_conversion from amphion

Updated 1 year, 5 months ago 571 runs

charlesmccarthy/animagine-xl

Animagine XL 2.0 is an advanced latent text-to-image diffusion model designed to create high-resolution, detailed anime images.

Updated 1 year, 5 months ago 9.3K runs

moayedhajiali/elasticdiffusion

ElasticDiffusion: Training-free Arbitrary Size Image Generation

Updated 1 year, 5 months ago 170 runs

zsxkib/patch-fusion

Super High Quality Depth Maps 🗺️: An End-to-End Tile-Based Framework 🏗️ for High-Resolution Monocular Metric Depth Estimation 🔍📏

Updated 1 year, 5 months ago 369 runs

lucataco/open-dalle-v1.1

A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension

Updated 1 year, 5 months ago 127.2K runs

lucataco/diffusion-motion-transfer

Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer

Updated 1 year, 5 months ago 178 runs

kcaverly/nous-hermes-2-yi-34b-gguf

Nous Hermes 2 - Yi-34B is a state of the art Yi Fine-tune, fine tuned on GPT-4 generated synthetic data

Updated 1 year, 5 months ago 11.6K runs

charlesmccarthy/terminus-xl-otaku-v1

Terminus XL Otaku is a latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 1 year, 5 months ago 42 runs

usamaehsan/controlnet-x-majic-mix-realistic-x-ip-adapter

works with inpainting and multi-controlnet + single-controlnet || ip-adapter + without ip adapter

Updated 1 year, 5 months ago 23.8K runs

cjwbw/faster-diffusion

Rethinking the Role of UNet Encoder in Diffusion Models

Updated 1 year, 5 months ago 133 runs

meepo-pro-player/winter-wyvern

Updated 1 year, 5 months ago 256.1K runs

charlesmccarthy/terminus-xl-gamma-v2

Terminus XL Gamma is a new state-of-the-art latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 1 year, 5 months ago 279 runs

tomasmcm/sam-7b

Source: SuperAGI/SAM ✦ Quant: TheBloke/SAM-AWQ ✦ SAM (Small Agentic Model), a 7B model that demonstrates impressive reasoning abilities despite its smaller size

Updated 1 year, 5 months ago 78 runs

fofr/sdxl-multi-controlnet-lora

Multi-controlnet, lora loading, img2img, inpainting

Updated 1 year, 5 months ago 211.3K runs

jd7h/luciddreamer

High-Fidelity Text-to-3D Generation via Interval Score Matching

Updated 1 year, 5 months ago 71 runs

intentface/poro-34b-gguf-checkpoint

Try out akx/Poro-34B-gguf, Q5_K, This is 1000B checkpoint model

Updated 1 year, 5 months ago 26 runs

cloversid099/deepfake

DeepFake AI

Updated 1 year, 5 months ago 63.1K runs

lucataco/singing_voice_conversion

Amphion Singing Voice Conversion: DiffWaveNetSVC

Updated 1 year, 5 months ago 973 runs

zelenioncode/custum_model_safetonsors

DreamBooth safetensors model use RealVisXL

Updated 1 year, 5 months ago 755 runs

fofr/realvisxl-v3

Amazing photorealism with RealVisXL_V3.0, based on SDXL, trainable

Updated 1 year, 5 months ago 743.7K runs

sakemin/all-in-one-music-structure-analyzer

Cog implementation of mir-aidj(Taejun Kim)'s 'All-In-One Music Structure Analyzer'

Updated 1 year, 5 months ago 23.9K runs

lucataco/ip-adapter-faceid

(Research only) IP-Adapter-FaceID can generate various style images conditioned on a face with only text prompts

Updated 1 year, 5 months ago 30.7K runs

culturecloud/dreamshaper-xl-turbo

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to go against other general purpose models and pipelines like Midjourney and DALL-E.

Updated 1 year, 5 months ago 1.4K runs

fermatresearch/dpo-sdxl-controlnet-lora

DPO-SDXL Canny controlnet with LoRA support.

Updated 1 year, 5 months ago 769 runs

leandroamaral/segmentanything

Segment Anything MASK

Updated 1 year, 5 months ago 1.2K runs

lucataco/dreamshaper-xl-turbo

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.

Updated 1 year, 5 months ago 221.6K runs

lucataco/dpo-sdxl

Direct Preference Optimization (DPO) is a method to align diffusion models to text human preferences by directly optimizing on human comparison data

Updated 1 year, 5 months ago 2.2K runs

zust-ai/zust-diffusion

auto1111_ds8

Updated 1 year, 5 months ago 61.8K runs

lucataco/seamless_communication

FacebookResearch/SeamlessM4T v2 - Massively Multilingual & Multimodal Machine Translation

Updated 1 year, 5 months ago 802 runs

anotherjesse/sdxl-lcm-testing

Updated 1 year, 5 months ago 364 runs

kcaverly/openchat-3.5-1210-gguf

The "Overall Best Performing Open Source 7B Model" for Coding + Generalization or Mathematical Reasoning

Updated 1 year, 5 months ago 26.3K runs

carcruz97/scaling-model-v3

Updated 1 year, 5 months ago 64 runs

tomasmcm/prometheus-13b-v1.0

Source: kaist-ai/prometheus-13b-v1.0 ✦ Quant: TheBloke/prometheus-13B-v1.0-AWQ ✦ An alternative to GPT-4 when evaluating LLMs & Reward models for RLHF

Updated 1 year, 5 months ago 54K runs