Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

minimax / hailuo-02

Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.

22K runs

minimax / hailuo-02-fast

A low cost and fast version of Hailuo 02. Generate 6s and 10s videos in 512p

399 runs

bytedance / omni-human

Turns your audio/video/images into professional-quality animated videos

521 runs

google / veo-3-fast

A faster and cheaper version of Google’s Veo 3 video model, with audio

14.6K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

129.4K runs

flux-kontext-apps / kontext-emoji-maker

Use kontext to turn any image into an emoji, using a lora by starsfriday

464 runs

wan-video / wan-2.2-t2v-fast

A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video

5K runs

black-forest-labs / flux-krea-dev

An opinionated text-to-image model from Black Forest Labs in collaboration with Krea that excels in photorealism. Creates images that avoid the oversaturated "AI look".

13.6K runs

wan-video / wan-2.2-i2v-a14b

Image-to-video at 720p and 480p with Wan 2.2 A14B

2.9K runs

Official models

Official models are always on, maintained, and have predictable pricing.

minimax / hailuo-02

Generate videos, and Videos from images

22K runs

bytedance / omni-human

Turns your audio/video/images into professional-quality animated videos

521 runs

openai / clip

Official CLIP models, generate CLIP (clip-vit-large-patch14) text & image embeddings

90 runs

ibm-granite / granite-speech-3.3-8b

Granite-speech-3.3-8b is a compact and efficient speech-language model, specifically designed for automatic speech recognition (ASR) and automatic speech translation (AST).

733 runs

black-forest-labs / flux-krea-dev

An opinionated text-to-image model from Black Forest Labs in collaboration with Krea that excels in photorealism. Creates images that avoid the oversaturated "AI look".

13.6K runs

wan-video / wan-2.2-i2v-a14b

Generate videos, Videos from images, and Make videos with Wan

2.9K runs

minimax / video-01

Generate videos, and Videos from images

554.8K runs

ibm-granite / granite-3.3-8b-instruct

Use LLMs

855.4K runs

ibm-granite / granite-vision-3.3-2b

Granite-vision-3.3-2b is a compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

5.3K runs

bytedance / seedance-1-lite

Generate videos, and Videos from images

236.7K runs

bytedance / seedance-1-pro

Generate videos, and Videos from images

167.7K runs

luma / photon-flash

Generate images

128.2K runs

luma / ray-2-540p

Generate videos, and Videos from images

9.8K runs

luma / ray-2-720p

Generate videos, and Videos from images

24.2K runs

luma / ray-flash-2-720p

Generate videos, and Videos from images

25.5K runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

6K runs

View all official models

I want to…

Generate images

Use AI To Generate Images & Photos with an API

Caption videos

Use AI To Caption Videos with an API

Generate speech

Convert text to speech

Use a face to make images

Make realistic images of people instantly

Generate videos

Use AI To Generate Videos with an API

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate music

Use AI To Generate Music with an API

Edit images

Use AI To Edit Any Image with an API

Transcribe speech

Models that convert speech to text

Extract text from images

Optical character recognition (OCR) and text extraction

Remove backgrounds

Models that remove backgrounds from images and videos

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Upscaling models that create high-quality video from low-quality videos

Edit Videos

Tools for editing videos.

Videos from images

Use AI To Generate Videos from images with an API

Make videos with Wan

Generate videos with Wan, the fastest and highest quality open-source video generation model.

Use Kontext fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Caption images

Use AI To Caption Images with an API

Chat with images

Ask language models about images

Use LLMs

Models that can understand and generate text

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use handy tools

Toolbelt-type models for videos and images.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use official models

Official models are always on, maintained, and have predictable pricing.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Popular models

openai/whisper

Convert speech in audio to text

Updated 8 months, 1 week ago 109.8M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 4 months ago 97.3M runs

prunaai/flux.1-dev

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 1 week, 1 day ago 11.8M runs

turian/insanely-fast-whisper-with-video

whisper-large-v3, incredibly fast, with video transcription

Updated 1 year, 6 months ago 2.5M runs

851-labs/background-remover

Remove backgrounds from images.

Updated 7 months, 2 weeks ago 5.4M runs

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 4 months, 2 weeks ago 1B runs

salesforce/blip

Generate image captions

Updated 2 years, 10 months ago 167.3M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 10 months ago 36M runs

Latest models

aodianyun/minicpm-v-26

Updated 10 months, 2 weeks ago 13 runs

aodianyun/minicpm-v-26-int4

Updated 10 months, 2 weeks ago 12 runs

remodela-ai/style-transfer-ii

Updated 10 months, 3 weeks ago 323 runs

hexiaochun/video_merge

视频合并

Updated 10 months, 3 weeks ago 1.7K runs

hexiaochun/img2video

输入图片和音频合并关键帧视频

Updated 10 months, 3 weeks ago 6K runs

hexiaochun/video_uitls

视频转换工具包

Updated 10 months, 3 weeks ago 8 runs

lucataco/ollama-reflection-70b

Ollama Reflection 70b

Updated 10 months, 3 weeks ago 1.6K runs

hexiaochun/minicpm_v26

minicpm 视频理解

Updated 10 months, 3 weeks ago 515 runs

0xroyce/plutus

Fine-tuned version of the LLaMA-3.1-8B model, specifically optimized for tasks in finance, economics, trading, psychology, and social engineering.

Updated 10 months, 3 weeks ago 66 runs

nicknaskida/incredibly-fast-whisper

whisper-large-v3, incredibly fast, with speaker diarization, powered by Hugging Face Transformers! 🤗

Updated 10 months, 3 weeks ago 218 runs

aodianyun/qwen2-vl-7b

Updated 10 months, 3 weeks ago 1.6K runs

aodianyun/qwen2-vl-2b

Updated 10 months, 3 weeks ago 129 runs

lucataco/flux-schnell-lora

FLUX.1-Schnell LoRA Explorer

Updated 10 months, 3 weeks ago 1.5M runs

xavriley/beat_this

Detect beats in music

Updated 10 months, 3 weeks ago 53 runs

fofr/nsfw-model-comparison

Compare nsfw models against inputs

Updated 10 months, 4 weeks ago 157 runs

pipi32167/minicpm-v-26

Chat with image or video.

Updated 10 months, 4 weeks ago 1K runs

argildotai/sam2removevideobackground

This project uses the Segment Anything 2 (SAM2) model to remove backgrounds from videos.

Updated 11 months ago 1.2K runs

usamaehsan/flux-multicontrolnet

multi controlnet union pro <-

Updated 11 months ago 92 runs

helios-infotech/sketch_to_image

AI that transforms sketches into realistic images. Upload your drawing and describe it in the prompt. You can also adjust the ControlNet parameters and scale the image to a higher resolution for better results

Updated 11 months ago 2.3K runs

asiryan/kolors

Kolors Model (Text2Img and Img2Img)

Updated 11 months ago 20.4K runs

cuuupid/qwen2-vl-2b

SOTA open-source model for chatting with videos and the newest model in the Qwen family

Updated 11 months ago 576 runs

victor-upmeet/whisperx-a40-large

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3 for large audio files

Updated 11 months ago 299.5K runs

victor-upmeet/whisperx

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3

Updated 11 months ago 3.4M runs

pnyompen/dreamshaper-controlnet

Dreamshaper canny controlnet

Updated 11 months ago 234 runs

pipi32167/joy-caption

Caption any images.

Updated 11 months ago 41.1K runs

bytedance/hyper-flux-16step

Hyper FLUX 16-step by ByteDance

Updated 11 months ago 6.6M runs

tahercoolguy/diffutoon

This will convert any video to anime

Updated 11 months ago 50 runs

wolverinn/ecommerce-model

Updated 11 months ago 77 runs

cuuupid/cogvideox-5b

Generate high quality videos from a prompt

Updated 11 months ago 2.2K runs

lucataco/controlnet-union-pro

ControlNet for FLUX.1-dev model jointly released by InstantX and Shakker Labs

Updated 11 months ago 2K runs

samim23/internlm-xcomposer2

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Updated 11 months, 1 week ago 95 runs

skytells-research/flux

Flux-dev inference & LoRa training

Updated 11 months, 1 week ago 411 runs

wolverinn/ecommerce-virtual-try-on

Virtual try-on using Stable Diffusion and IP-Adapter

Updated 11 months, 1 week ago 2.5K runs

wolverinn/realistic-background

replace background with Stable Diffusion and ControlNet

Updated 11 months, 1 week ago 138K runs

wolverinn/realisticoutpainter

outpaint with stable diffusion and ControlNet

Updated 11 months, 1 week ago 21.5K runs

xuhongming251/comfyui

run comfyui flow

Updated 11 months, 1 week ago 22 runs

ibm-granite/granite-8b-code-instruct-128k

Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community

Updated 11 months, 1 week ago 548.4K runs

rubenamtz/law2entity

Turn text from the law in Spanish to structured data to feed support building a knowledge graph

Updated 11 months, 1 week ago 5 runs

andreasjansson/random-shapes

Random rectangles, ellipses, lines, in random colors

Updated 11 months, 1 week ago 91 runs

okaris/omni-zero

Omni-Zero: A diffusion pipeline for zero-shot stylized portrait creation.

Updated 11 months, 1 week ago 227K runs

hexiaochun/pp-ocr-v4

图文识别

Updated 11 months, 1 week ago 263.6K runs

pixelprotest/flux-monkey-island

Flux LoRa trained on Secret of Monkey Island

Updated 11 months, 1 week ago 593 runs

rostikl/my-ai-model

Test model

Updated 11 months, 1 week ago 7 runs

hexiaochun/video_split

视频识别自动分割场景

Updated 11 months, 1 week ago 26 runs

hexiaochun/video2img

提取视频中的图片

Updated 11 months, 1 week ago 7 runs

hexiaochun/video2mp3

提取视频中的音频

Updated 11 months, 1 week ago 80 runs

lucataco/flux-dev

Flux Dev diffusers implementation

Updated 11 months, 1 week ago 1K runs

ibm-granite/granite-20b-code-instruct-8k

Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community

Updated 11 months, 1 week ago 110K runs

omniedgeio/cog-flux

Run FLUX.1 with lora and controlnet

Updated 11 months, 2 weeks ago 625 runs

zsxkib/flux-dev-inpainting

🎨 Fill in masked parts of images with FLUX.1-dev 🖌️

Updated 11 months, 2 weeks ago 385.5K runs

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Featured models

minimax / hailuo-02

minimax / hailuo-02-fast

bytedance / omni-human

google / veo-3-fast

google / veo-3

flux-kontext-apps / kontext-emoji-maker

wan-video / wan-2.2-t2v-fast

black-forest-labs / flux-krea-dev

wan-video / wan-2.2-i2v-a14b

Official models

minimax / hailuo-02

bytedance / omni-human

google / veo-3-fast

google / veo-3

wan-video / wan-2.2-t2v-fast

bytedance / seededit-3.0

openai / clip

ibm-granite / granite-speech-3.3-8b

black-forest-labs / flux-krea-dev

wan-video / wan-2.2-i2v-a14b

minimax / video-01

ibm-granite / granite-3.3-8b-instruct

ibm-granite / granite-vision-3.3-2b

bytedance / seedance-1-lite

bytedance / seedance-1-pro

luma / photon-flash

luma / ray-2-540p

luma / ray-2-720p

luma / ray-flash-2-720p

luma / reframe-image

I want to…

Popular models

Latest models