Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

416 runs

google / imagen-4-fast

Use this fast version of Imagen 4 when speed and cost are more important than quality

3.3K runs

google / imagen-4-ultra

Use this ultra version of Imagen 4 when quality matters more than speed and cost

9.2K runs

google / imagen-4

Google's Imagen 4 flagship model

300.8K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

6.9K runs

google / veo-3

Sound on: Google’s flagship Veo 3 text to video model, with audio

50.3K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

1.9M runs

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.2M runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

186.3K runs

Official models

Official models are always on, maintained, and have predictable pricing.

anthropic / claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions

98.1K runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

98 runs

luma / reframe-video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

416 runs

flux-kontext-apps / face-to-many-kontext

Become a character, in style

3.7K runs

flux-kontext-apps / renaissance

Turn yourself into a renaissance-era painting for those renaissance moments

926 runs

flux-kontext-apps / multi-image-list

FLUX Kontext max with list input for multiple images

9.1K runs

kwaivgi / kling-lip-sync

Add lip-sync to any video with an audio file or text

1.9K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Remove backgrounds

Models that remove backgrounds from images and videos

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Caption videos

Models that generate text from videos

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Get embeddings

Models that generate embeddings from inputs

Generate speech

Convert text to speech

Generate music

Models to generate and modify music

Generate text

Models that can understand and generate text

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 3 weeks ago 989.6M runs

prunaai/flux-schnell

This is an optimised version of the FLUX.1 [schnell] model from Black Forest Labs made with Pruna. We achieve a ~3x speedup over the original model with minimal quality loss.

Updated 1 week, 3 days ago 911K runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months, 2 weeks ago 30M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 8 months ago 31.4M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 2 years, 3 months ago 90.8M runs

adirik/grounding-dino

Detect everything with language!

Updated 1 year, 7 months ago 8.9M runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 11 months, 3 weeks ago 15.3M runs

allenhooo/lama

🦙 LaMa: Resolution-robust Large Mask Inpainting with Fourier Convolutions

Updated 1 year, 11 months ago 8.6M runs

Latest models

muqtadar08/llava_phi-3-mini

Updated 1 year, 1 month ago 11 runs

lucataco/qwen1.5-110b

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

Updated 1 year, 1 month ago 2.7K runs

cjwbw/hyper-sdxl-1step-t2i

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

Updated 1 year, 1 month ago 1.4K runs

hayooucom/llm-60k

llm model ,for CN

Updated 1 year, 1 month ago 215 runs

asiryan/reliberate-v3

Reliberate v3 Model (Text2Img, Img2Img and Inpainting)

Updated 1 year, 1 month ago 2.3M runs

asiryan/deliberate-v6

Deliberate V6 Model (Text2Img, Img2Img and Inpainting)

Updated 1 year, 1 month ago 11.6K runs

asiryan/absolutereality-v1.8.1

AbsoluteReality V1.8.1 Model (Text2Img, Img2Img and Inpainting)

Updated 1 year, 1 month ago 87.3K runs

microsoft/phi-3-mini-128k-instruct

Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets

Updated 1 year, 1 month ago 65.1K runs

sesamo-srl/bge-reranker-v2-m3

Newest reranker model from BAAI (https://huggingface.co/BAAI/bge-reranker-v2-m3). FP16 inference enabled. Normalize param available

Updated 1 year, 1 month ago 1.4K runs

mopineyro/resnet_breeds_finetuned

ResNet Fine-Tuned on 37 dog & cat breeds

Updated 1 year, 1 month ago 176 runs

fofr/video-morpher

Generate a video that morphs between subjects, with an optional style

Updated 1 year, 1 month ago 14.6K runs

snowflake/snowflake-arctic-instruct

An efficient, intelligent, and truly open-source language model

Updated 1 year, 1 month ago 2M runs

fofr/sticker-maker

Make stickers with AI. Generates graphics with transparent backgrounds.

Updated 1 year, 1 month ago 1.1M runs

ieit-yuan/yuan2.0-2b-mars

yuan2.0-2b-mars是源2.0-2B模型的2024年3月版本，源2.0 是浪潮信息发布的新一代基础语言大模型。我们开源了全部的3个模型源2.0-102B，源2.0-51B和源2.0-2B。并且我们提供了预训练，微调，推理服务的相关脚本，以供研发人员做进一步的开发。源2.0是在源1.0的基础上，利用更多样的高质量预训练数据和指令微调数据集，令模型在语义、数学、推理、代码、知识等不同方面具备更强的理解能力。

Updated 1 year, 1 month ago 19 runs

lucataco/idefics-8b

Idefics2 is an open multimodal model that accepts arbitrary sequences of image and text inputs and produces text outputs

Updated 1 year, 1 month ago 1.1K runs

camenduru/comfyui-ipadapter-latentupscale

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

Updated 1 year, 1 month ago 383 runs

zsxkib/flash-face

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Updated 1 year, 1 month ago 4.5K runs

muqtadar08/finetuned_gemma_sql_generator

Updated 1 year, 1 month ago 19 runs

omniedgeio/virtual-dressing

Updated 1 year, 1 month ago 473 runs

qiweiii/oot_diffusion_dc

full body version

Updated 1 year, 1 month ago 807 runs

cakirilker/hello-world

Updated 1 year, 1 month ago 186 runs

ai-forever/kandinsky-2

text2img model trained on LAION HighRes and fine-tuned on internal datasets

Updated 1 year, 1 month ago 6.2M runs

lucataco/snowflake-arctic-embed-l

snowflake-arctic-embed is a suite of text embedding models that focuses on creating high-quality retrieval models optimized for performance

Updated 1 year, 1 month ago 398.4K runs

hazxone/eye-color

Change eye (iris) color

Updated 1 year, 1 month ago 630 runs

fofr/style-transfer

Transfer the style of one image to another

Updated 1 year, 1 month ago 946.3K runs

yeguangsuixing/hello

input your name, and this model will print the most handsome man

Updated 1 year, 1 month ago 24 runs

nateraw/autotune

pitch correction on your voice

Updated 1 year, 1 month ago 376 runs

camenduru/colorize-line-art

ControlNet Line Art Anime

Updated 1 year, 1 month ago 43.7K runs

meta/meta-llama-3-70b

Base version of Llama 3, a 70 billion parameter language model from Meta.

Updated 1 year, 1 month ago 830.5K runs

meta/meta-llama-3-70b-instruct

A 70 billion parameter language model from Meta, fine tuned for chat completions

Updated 1 year, 1 month ago 153.9M runs

meta/meta-llama-3-8b-instruct

An 8 billion parameter language model from Meta, fine tuned for chat completions

Updated 1 year, 1 month ago 365.1M runs

meta/meta-llama-3-8b

Base version of Llama 3, an 8 billion parameter language model from Meta.

Updated 1 year, 1 month ago 50.9M runs

jordillull/acme-assistant

An example of a rudimentary Q&A assistant for ACME SL

Updated 1 year, 1 month ago 11 runs

camenduru/zest

ZeST: Zero-Shot Material Transfer from a Single Image

Updated 1 year, 1 month ago 1.5K runs

camenduru/instantmesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Updated 1 year, 1 month ago 42.1K runs

cjwbw/parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

Updated 1 year, 1 month ago 2.5K runs

camenduru/zephyr-orpo-141b-a35b-v0.1

Mixtral 8x22b v0.1 Zephyr Orpo 141b A35b v0.1

Updated 1 year, 1 month ago 137 runs

cjwbw/pixart-sigma

Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Updated 1 year, 2 months ago 6.6K runs

camenduru/magictime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Updated 1 year, 2 months ago 368 runs

holywalley/stt_be_ctc

Updated 1 year, 2 months ago 77 runs

nateraw/musicgen-songstarter-v0.2

A large, stereo MusicGen that acts as a useful tool for music producers

Updated 1 year, 2 months ago 4.1K runs

camenduru/mixtral-8x22b-v0.1-4bit

Mixtral-8x22b-v0.1-4bit

Updated 1 year, 2 months ago 364 runs

manu-sapiens/python-pptx

Use a subset of https://github.com/barun-saha/slide-deck-ai to create powerpoint slides from a json description - using python-pptx (https://github.com/scanny/python-pptx)

Updated 1 year, 2 months ago 289 runs