Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

344 runs

google / imagen-4-fast

Use this fast version of Imagen 4 when speed and cost are more important than quality

2.2K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

7.1K runs

google / imagen-4

Google's Imagen 4 flagship model

294.8K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

6.6K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

48.3K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

1.8M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.1M runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

181K runs

Official models

Official models are always on, maintained, and have predictable pricing.

anthropic / claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions

96K runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

85 runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

344 runs

flux-kontext-apps / face-to-many-kontext

Become a character, in style

3.5K runs

flux-kontext-apps / renaissance

Turn yourself into a renaissance-era painting for those renaissance moments

884 runs

flux-kontext-apps / multi-image-list

FLUX Kontext max with list input for multiple images

8.8K runs

kwaivgi / kling-lip-sync

Add lip-sync to any video with an audio file or text

1.9K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Remove backgrounds

Models that remove backgrounds from images and videos

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Caption videos

Models that generate text from videos

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Get embeddings

Models that generate embeddings from inputs

Generate speech

Convert text to speech

Generate music

Models to generate and modify music

Generate text

Models that can understand and generate text

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 3 weeks ago 988.7M runs

beautyyuyanli/multilingual-e5-large

multilingual-e5-large: A multi-language text embedding model

Updated 1 year, 5 months ago 21.3M runs

prunaai/flux-schnell

This is an optimised version of the FLUX.1 [schnell] model from Black Forest Labs made with Pruna. We achieve a ~3x speedup over the original model with minimal quality loss.

Updated 1 week, 2 days ago 763.1K runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 2 weeks ago 29.9M runs

adirik/grounding-dino

Detect everything with language!

Updated 1 year, 7 months ago 8.8M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 90.8M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Updated 1 year, 3 months ago 7.9M runs

krthr/clip-embeddings

Generate CLIP (clip-vit-large-patch14) text & image embeddings

Updated 1 year, 9 months ago 38.6M runs

Latest models

mattt/shh

Updated 1 year ago 21 runs

hadilq/hair-segment

This is an ML model to segment hairs in pictures.

Updated 1 year ago 352 runs

swook/inspyrenet

Segment foreground objects with high resolution and matting, using InSPyReNet

Updated 1 year ago 692.2K runs

chenxwh/openvoice

Updated to OpenVoice v2: Versatile Instant Voice Cloning

Updated 1 year ago 60.8K runs

tonyhopkins994/sdxl-prod

Updated 1 year ago 5.9K runs

asiryan/triple-absolute-dreamshaper-meina

Three models in one Cog: Absolute Reality v1.8.1, DreamShaper v8 and Meina V4

Updated 1 year ago 22K runs

deepeshsharma2003/3dmg

Updated 1 year ago 432 runs

tomasmcm/llama-3-8b-instruct-gradient-4194k

Source: gradientai/Llama-3-8B-Instruct-Gradient-4194k ✦ Quant: solidrust/Llama-3-8B-Instruct-Gradient-4194k-AWQ ✦ Extending LLama-3 8B's context length from 8k to 4194K

Updated 1 year ago 142 runs

lucataco/sdxl-clip-interrogator

CLIP Interrogator for SDXL optimizes text prompts to match a given image

Updated 1 year ago 847.3K runs

bytedance/pulid

📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Updated 1 year ago 2.9M runs

jluse/api-model

An example model created from cli

Updated 1 year ago 24 runs

lucataco/paligemma-3b-pt-224

PaliGemma 3B, an open VLM by Google, pre-trained with 224*224 input images and 128 token input/output text sequences

Updated 1 year ago 1.4K runs

daanelson/minigpt-4

A model which generates text in response to an input image and prompt.

Updated 1 year ago 1.7M runs

aicapcut/anima-pencil-v310-with-layer-diffuse

Generate image with transparent background

Updated 1 year ago 632 runs

lucataco/yi-1.5-6b

Yi-1.5 is continuously pre-trained on Yi with a high-quality corpus of 500B tokens and fine-tuned on 3M diverse fine-tuning samples

Updated 1 year, 1 month ago 64 runs

aryamansital/instant_mesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view LRMs

Updated 1 year, 1 month ago 262.9K runs

zsxkib/blip-3

Blip 3 / XGen-MM, Answers questions about images ({blip3,xgen-mm}-phi3-mini-base-r-v1)

Updated 1 year, 1 month ago 1.3M runs

mikeei/dolphin-2.9.1-llama3-8b-gguf

Dolphin is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant.

Updated 1 year, 1 month ago 2K runs

aliakbarghayoori/dfn5b-clip-vit-h-14-384

return CLIP features for the dfn5b-clip-vit-h-14-384, current highest average perf. in openclip models leaderboard.

Updated 1 year, 1 month ago 392 runs

mikeei/dolphin-2.9-llama3-70b-gguf

Dolphin is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant.

Updated 1 year, 1 month ago 77.3K runs

mikeei/dolphin-2.9-llama3-8b-gguf

Dolphin is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant.

Updated 1 year, 1 month ago 6.1K runs

expa-ai/cloudflare-hack

Updated 1 year, 1 month ago 6.3K runs

re-mix-1/rembg

Implementation of the RemBG library

Updated 1 year, 1 month ago 351 runs

lucataco/blip3-phi3-mini-instruct-r-v1

BLIP3(XGen-MM) is a series of foundational Large Multimodal Models (LMMs) developed by Salesforce AI Research

Updated 1 year, 1 month ago 383 runs

hovevideo/stable-whisper

Transcribe audios using OpenAI's Whisper with stabilizing timestamps by stable-ts python package.

Updated 1 year, 1 month ago 152 runs

sourav-sarkar-doc32/smile-correct

Updated 1 year, 1 month ago 1.5K runs

fofr/pulid-lightning

Use a face to instantly make images. Uses SDXL Lightning checkpoints.

Updated 1 year, 1 month ago 134.7K runs

asiryan/dark-sushi-mix-225d

Dark Sushi Mix 2.25D Model with vae-ft-mse-840000-ema (Text2Img, Img2Img and Inpainting)

Updated 1 year, 1 month ago 59.3K runs

deepseek-ai/deepseek-67b-base

DeepSeek LLM, an advanced language model comprising 67 billion parameters. Trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese

Updated 1 year, 1 month ago 466 runs

remodela-ai/style-transfer-i

Updated 1 year, 1 month ago 1.7K runs

georgedavila/cog-tex2pdf

turns text into pdf files with TeX

Updated 1 year, 1 month ago 257 runs

meta/meta-llama-guard-2-8b

A llama-3 based moderation and safeguarding language model

Updated 1 year, 1 month ago 734.9K runs

muqtadar08/llm_finetuning_dataset_generator

Updated 1 year, 1 month ago 9 runs

hadilq/dragon-notdragon

a fine-tuned model to detect dragon in images.

Updated 1 year, 1 month ago 32 runs

tgohblio/instant-id-multicontrolnet

InstantID. ControlNets. More base SDXL models. And the latest ByteDance's ⚡️SDXL-Lightning !⚡️

Updated 1 year, 1 month ago 295.1K runs

kitaef/mytestmodel

The img2img pipeline that makes an anime-style image of a person. It uses one of sd1.5 models as a base, depth-estimation as a ControleNet and IPadapter model for face consistency.

Updated 1 year, 1 month ago 121 runs