Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

63.3K runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

1.3K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

18.7K runs

google / imagen-4

Google's Imagen 4 flagship model

351.2K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

8.3K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

2.4M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.3M runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

220.1K runs

anthropic / claude-3.7-sonnet

The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)

1.5M runs

Official models

Official models are always on, maintained, and have predictable pricing.

google / veo-3

Generate videos

63.3K runs

anthropic / claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions

109.1K runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

544 runs

flux-kontext-apps / face-to-many-kontext

Become a character, in style

4.6K runs

flux-kontext-apps / renaissance

Turn yourself into a renaissance-era painting for those renaissance moments

1.2K runs

flux-kontext-apps / multi-image-list

FLUX Kontext max with list input for multiple images

13.2K runs

kwaivgi / kling-lip-sync

Add lip-sync to any video with an audio file or text

2.2K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points. ## Video generation - [**minimax/video-01**](https://replicate.com/minimax/video-01): Generate videos from text or images. Use a reference image to create consistent characters. - [**luma/reframe-video**](https://replicate.com/luma/reframe-video): Reframe and resize videos up to 30 seconds long. Great for social posts. - [**topazlabs/video-upscale**](https://replicate.com/topazlabs/video-upscale): Enhance and upscale low-res videos using Topaz’s upscaling models. ## Image generation - [**google/imagen-4**](https://replicate.com/google/imagen-4): Google’s latest text-to-image model. High quality and easy to prompt. - [**black-forest-labs/flux-kontext-pro**](https://replicate.com/black-forest-labs/flux-kontext-pro): Strong prompt following and style control for both photoreal and illustrated outputs. - [**ideogram-ai/ideogram-v3-turbo**](https://replicate.com/ideogram-ai/ideogram-v3-turbo): Fast, creative generation. Good for posters, products, or anything with text. - [**black-forest-labs/flux-1.1-pro**](https://replicate.com/black-forest-labs/flux-1.1-pro): Improved FLUX model with more consistent image quality and diversity. - [**black-forest-labs/flux-dev**](https://replicate.com/black-forest-labs/flux-dev): Experimental 12B parameter model. Good for testing edge cases. ## Image upscaling + restoration - [**topazlabs/image-upscale**](https://replicate.com/topazlabs/image-upscale): Professional-grade image upscaling. Clean results with minimal artifacts. - [**szcho/codeformer**](https://replicate.com/sczhou/codeformer): Restore low-quality or AI-generated faces. - [**tencentarc/gfpgan**](https://replicate.com/tencentarc/gfpgan): Fast face restoration, especially for old or damaged photos.

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Remove backgrounds

Models that remove backgrounds from images and videos

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Caption videos

Models that generate text from videos

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Get embeddings

Models that generate embeddings from inputs

Generate speech

Convert text to speech

Generate music

Models to generate and modify music

Generate text

Models that can understand and generate text

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 months ago 995M runs

beautyyuyanli/multilingual-e5-large

multilingual-e5-large: A multi-language text embedding model

Updated 1 year, 5 months ago 21.5M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 2 weeks ago 30.7M runs

prunaai/flux-schnell

This is an optimised version of the FLUX.1 [schnell] model from Black Forest Labs made with Pruna. We achieve a ~3x speedup over the original model with minimal quality loss.

Updated 1 week, 6 days ago 1.7M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Updated 1 year, 4 months ago 8.1M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 91.2M runs

allenhooo/lama

🦙 LaMa: Resolution-robust Large Mask Inpainting with Fourier Convolutions

Updated 2 years ago 8.8M runs

krthr/clip-embeddings

Generate CLIP (clip-vit-large-patch14) text & image embeddings

Updated 1 year, 10 months ago 39.2M runs

Latest models

cuuupid/seamless_expressive

Translate audio while keeping the original style, pronunciation and tone of your original audio.

Updated 1 year, 6 months ago 791 runs

lucataco/vid2densepose

Convert your videos to DensePose and use it with MagicAnimate

Updated 1 year, 6 months ago 6.1K runs

collectiveai-team/whisper-wordtimestamps

API for enhanced word-level timestamp accuracy using OpenAI's Whisper model

Updated 1 year, 6 months ago 1.3K runs

charlesmccarthy/addwatermark

Add a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI

Updated 1 year, 6 months ago 403.3K runs

chigozienri/posy-motion-extraction

Extracts motion from video

Updated 1 year, 6 months ago 188 runs

replicate/train-rvc-model

Train your own custom RVC model

Updated 1 year, 6 months ago 253.3K runs

asiryan/deliberate-v5

Deliberate V5 Model (Text2Img, Img2Img and Inpainting)

Updated 1 year, 6 months ago 14.8K runs

asiryan/counterfeit-xl-v2

Counterfeit XL v2 Model (Text2Img, Img2Img and Inpainting)

Updated 1 year, 6 months ago 33.7K runs

adirik/masactrl-anything-v4-0

Edit real or generated images

Updated 1 year, 6 months ago 1.3K runs

adirik/masactrl-stable-diffusion-v1-4

Edit real or generated images

Updated 1 year, 6 months ago 2.6K runs

wglint/1_test

Simple model to make addition and answer is send to supabase

Updated 1 year, 6 months ago 23 runs

daffaakhlaric2424/daffa

highist resolutioin image

Updated 1 year, 6 months ago 100 runs

asiryan/juggernaut-xl-v7

Juggernaut XL v7 Model (Text2Img, Img2Img and Inpainting)

Updated 1 year, 6 months ago 377.4K runs

lucataco/magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Updated 1 year, 6 months ago 56.2K runs

yuni-eng/image-to-color

Generate color codes for prominent colors in the image

Updated 1 year, 6 months ago 161 runs

tomasmcm/loyal-piano-m7

Source: chargoddard/loyal-piano-m7 ✦ Quant: TheBloke/loyal-piano-m7-AWQ ✦ Intended to be a roleplay-focused model with some smarts and good long-context recall

Updated 1 year, 6 months ago 43 runs

titocosta/notus-7b-v1

Notus-7b-v1 model

Updated 1 year, 6 months ago 130 runs

jd7h/edit-video-by-editing-text

A pipeline for superfast video editing! Make cuts to a video by editing its transcript.

Updated 1 year, 6 months ago 747 runs

asiryan/juggernaut-aftermath

Juggernaut Aftermath Model with original TRCVAE (Text2Img, Img2Img and Inpainting)

Updated 1 year, 6 months ago 2.5K runs

titocosta/starling

Starling-LM-7B-alpha

Updated 1 year, 6 months ago 48 runs

lucataco/pixart-xl-2

PixArt-Alpha 1024px is a transformer-based text-to-image diffusion system trained on text embeddings from T5

Updated 1 year, 6 months ago 77.1K runs

asiryan/deliberate-v4

Deliberate V4 Model (Text2Img, Img2Img and Inpainting)

Updated 1 year, 6 months ago 1.2K runs

chigozienri/visual-anagrams

Generates multi-view optical illusions

Updated 1 year, 6 months ago 1.3K runs

lucataco/demofusion

DemoFusion: Democratising High-Resolution Image Generation With No 💰

Updated 1 year, 6 months ago 9.3K runs

egsakash/akashi-v2

Updated 1 year, 6 months ago 384 runs

cutzudev/whisper-x

Generates subtitles

Updated 1 year, 6 months ago 403 runs

jimothyjohn/demixing

Separate instruments and/or vocals from any song.

Updated 1 year, 6 months ago 1.1K runs

tomasmcm/juanako-7b-una

Source: fblgit/juanako-7b-UNA ✦ Quant: TheBloke/juanako-7B-UNA-AWQ ✦ juanako uses UNA, Uniform Neural Alignment. A training technique that ease alignment between transformer layers yet to be published

Updated 1 year, 6 months ago 40 runs

kovrichard/bertiment

Simple binary sentiment analysis with BERT

Updated 1 year, 6 months ago 339 runs

xiankgx/video-retalking

VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing in the Wild

Updated 1 year, 6 months ago 3.1K runs

mtg/essentia-bpm

Tempo BPM estimation with Essentia

Updated 1 year, 6 months ago 1.1K runs

tomasmcm/starling-lm-7b-alpha

Source: berkeley-nest/Starling-LM-7B-alpha ✦ Quant: TheBloke/Starling-LM-7B-alpha-AWQ ✦ An open large language model (LLM) trained by Reinforcement Learning from AI Feedback (RLAIF)

Updated 1 year, 6 months ago 63.3K runs

xiankgx/short-to-long-video-diffusion

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Updated 1 year, 6 months ago 657 runs

cjwbw/cogvlm

powerful open-source visual language model

Updated 1 year, 6 months ago 1.5M runs

lucataco/interpany-clearer

InterpAny-Clearer: Clearer anytime frame interpolation & Manipulated interpolation

Updated 1 year, 6 months ago 11.6K runs

asiryan/sdxl

SDXL Model (Text2Img, Img2Img and Inpainting)

Updated 1 year, 6 months ago 2K runs

sakemin/musicgen-stereo-chord

Generate music in stereo, restricted to chord sequences and tempo

Updated 1 year, 6 months ago 3.2K runs

tomasmcm/openinstruct-mistral-7b

Source: monology/openinstruct-mistral-7b ✦ Quant: TheBloke/openinstruct-mistral-7B-AWQ ✦ Commercially-usable 7B model, based on mistralai/Mistral-7B-v0.1 and finetuned on VMware/open-instruct

Updated 1 year, 6 months ago 296 runs

lucataco/xtts-v2

Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

Updated 1 year, 6 months ago 2.8M runs

tomasmcm/evolved-seeker-1.3b

Source: TokenBender/evolvedSeeker_1_3 ✦ Quant: TheBloke/evolvedSeeker_1_3-AWQ ✦ A fine-tuned version of deepseek-ai/deepseek-coder-1.3b-base on 50k instructions for 3 epochs

Updated 1 year, 6 months ago 29 runs