Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

The Yi series models are large language models trained from scratch by developers at 01.AI.

Updated 8.2K runs

Image-to-video - SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Updated 13.2K runs

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

Updated 1K runs

Genshin Impact Image SD

Updated 6.1K runs

Controllable Text-to-Music Generation

Updated 8.7K runs

Multi-Controlnet + consistency-decoder + INPAINTING + realestic-vision-v5 + Prompt-Weight + Single-Controlnet

Updated 3.4K runs

controlnet-lineart-brightness-tile-inpainting + low res fix with tile

Updated 710 runs

Mockup generator (bags, t-shirts, mugs, billboard etc) using Stable Diffusion in-painting

Updated 249 runs

WhisperX model for spanish language.

Updated 44.8K runs

High-Quality Video Generation with Cascaded Latent Diffusion Models

Updated 13.9K runs

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 61.9M runs

BAAI/bge-reranker-large Model with fp16

Updated 286 runs

BAAI/bge-reranker-base Model with fp16

Updated 8.1K runs

Create your own Realistic Voice Cloning (RVC v2) dataset using a YouTube link

Updated 9.7K runs

Performs speaker identity verification

Updated 297 runs

Answers questions about images

Updated 30.2M runs

Simple version of https://huggingface.co/teknium/OpenHermes-2-Mistral-7B

Updated 244 runs

Very fast img2img for a collaboration with AI in real time

Updated 3.5K runs

Updated 49K runs

Fast animation using a latent consistency model

Updated 30.7K runs

Fast video2video with a latent consistency model

Updated 2.4K runs

Updated 60.5K runs

Removes background

Updated 9.8K runs

Source: NousResearch/Obsidian-3B-V0.5 ✦ Worlds smallest multi-modal LLM

Updated 119 runs

Source: PocketDoc/Dans-AdventurousWinds-Mk2-7b ✦ Quant: TheBloke/Dans-AdventurousWinds-Mk2-7B-AWQ ✦ This model is proficient in crafting text-based adventure games

Updated 132 runs

Source: Intel/neural-chat-7b-v3-1 ✦ Quant: TheBloke/neural-chat-7B-v3-1-AWQ ✦ Fine-tuned model based on mistralai/Mistral-7B-v0.1

Updated 777 runs

Animate Your Personalized Text-to-Image Diffusion Models with SDXL and LCM

Updated 332 runs

Text to video diffusion model with variable length frame conditioning for infinite length video

Updated 423 runs

Dreamshaper-7 img2img with LCM LoRA for faster inference

Updated 55.2K runs

Updated 313 runs

An auto-regressive causal LM created by combining 2x finetuned Llama-2 70B into one.

Updated 237.5K runs

RealvisXL-v2.0 with LCM LoRA - requires fewer steps (4 to 8 instead of the original 40 to 50)

Updated 293.1K runs

Updated 40 runs

Take a list of image URLs as frames and output a video

Updated 1.2K runs

Auto fuse a user's face onto the template image, with a similar appearance to the user

Updated 12.5K runs

Generate 3D assets using text descriptions

Updated 1K runs

Detects objects in an image

Updated 1.6K runs

Text to image prompt

Updated 1.1K runs

Create song covers with any RVC v2 trained AI voice from audio files.

Updated 829.6K runs

Ultimate SD Upscale with ControlNet Tile

Updated 161.9K runs

A combination of ip_adapter SDv1.5 and mediapipe-face to inpaint a face

Updated 4K runs

The Yi series models are large language models trained from scratch by developers at 01.AI.

Updated 1.7K runs

The Yi series models are large language models trained from scratch by developers at 01.AI.

Updated 3.7K runs

The Yi series models are large language models trained from scratch by developers at 01.AI.

Updated 161.1K runs

Creates an SD illusion from drawing + adds depth

Updated 330 runs

Custom improvements like a custom callback to enhance the inference | It's a WIP and it may causes some wrong outputs

Updated 1.3K runs

An extremely fast all-in-one model to use LCM with SDXL, ControlNet and custom LoRA url's!

Updated 14.8K runs

Create variations of an uploaded image. Please see README for more details

Updated 1.1K runs

Whisper is a general-purpose speech recognition model.

Updated 4.4K runs