Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

Upscale | Enhancer | Ultra-Resolution | Restoration |

Updated 1.8K runs

Bring your subjects into focus with FLUX.1 Kontext [pro]

Updated 1.2K runs

Updated 910 runs

Give the model an image and it will attempt to generate a copy using Claude and Flux dev

Updated 22 runs

Pure compute-driven AI workflow removes flaws, enriches depth, refines detail 2x & 4x upscales output into unmatched photorealistic quality. Read the description. Runtime: 4 minutes.

Updated 600 runs

Convert any image to Hilbert space filling curve effect.

Updated 32 runs

Leonardo AI’s first foundational model produces images up to 5 megapixels (fast, quality and ultra modes)

Updated 8K runs

Create 5s 480p videos from a text prompt

Updated 4.1K runs

Updated 57 runs

Vectorized dot grid - by Brett from Designjoy

Updated 164 runs

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting, misto anyline controlnet

Updated 67 runs

This is VACE-1.3B model optimised with pruna ai. Wan2.1 VACE is an all-in-one model for video creation and editing.

Updated 399 runs

Image editing with Flux-dev model

Updated 50 runs

Updated 12 runs

Optimizes audio files with speech

Updated 36 runs

Lyria 2 is a music generation model that produces 48kHz stereo audio through text-based prompts

Updated 7.4K runs

Generate 5s and 9s 720p videos, faster and cheaper than Ray 2

Updated 20.9K runs

Generate 5s and 9s 720p videos

Updated 21.7K runs

Generate 5s and 9s 540p videos

Updated 9.2K runs

Fast, high quality text-to-video and image-to-video (Also known as Dream Machine)

Updated 42.7K runs

🥯ByteDance Seed's Bagel Unified multimodal AI that generates images, edits images, and understands images in one 7B parameter model🥯

Updated 42.3K runs

State of the art video generation model. Veo 2 can faithfully follow simple and complex instructions, and convincingly simulates real-world physics as well as a wide range of visual styles.

Updated 84.8K runs

Transform PDFs into AI podcasts for engaging on-the-go audio content.

Updated 341 runs

Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.

Updated 59K runs

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p

Updated 19.6K runs

MEMO is a state-of-the-art open-weight model for audio-driven talking video generation.

Updated 825 runs

A fast, local neural text to speech system that sounds great and is optimized for the Raspberry Pi 4. Piper is used in a variety of projects.

Updated 43 runs

A speech-to-text model that uses GPT-4o to transcribe audio

Updated 1.3K runs

A speech-to-text model that uses GPT-4o mini to transcribe audio

Updated 252 runs

🧼Upscales faces in videos look to be clearer and better using KEEP, Kalman-Inspired Feature Propagation for Video Face Super-Resolution🫟

Updated 76 runs

A small model alternative to o1

Updated 192 runs

Real-ESRGAN Video Upscaler

Updated 235.3K runs

Granite-Embedding-278M-Multilingual is a 278M parameter model from the Granite Embeddings suite that can be used to generate high quality text embeddings

Updated 998 runs

Updated 130 runs

Photomaker V1 optimized with Lightning 8steps

Updated 487 runs

Revival of https://github.com/pollinations/stable-diffusion-audio-reactive

Updated 7 runs

The original classic DALLᐧE 2

Updated 184 runs

An AI system that can create realistic images and art from a description in natural language.

Updated 1.4K runs

Color match and white balance fixes for images

Updated 27.6K runs

A powerful 3D asset generation model

Updated 194.2K runs

🕹️FramePack: video diffusion that feels like image diffusion🎥

Updated 2K runs

A Step Towards Music Generation Foundation Model text2music

Updated 6.2K runs

This a pruna optimised version of the flux 1.dev model.

Updated 40.8K runs

DiT-based 13b video generation model, creating 30fps video

Updated 1.7K runs

..

Updated 93.5K runs

👗Bytedance's DreamO: unified image customization model (IP, ID, Style, Try-On, etc.)🧣

Updated 795 runs

Updated 99 runs

Models fine-tuned from NoobAI-XL/Illustrious-XL series.

Updated 34K runs