Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

Qwen-VL-Chat but with raw ChatML prompt interface and streaming

Updated 1.1K runs

Content AI detector

Updated 39.7K runs

Huggingface-sdxl-inpainting is a sdxl model of the age inpainting fine-tuning, with inputs and outputs having the same height and width!

Updated 2.7K runs

Updated 168 runs

Fuyu-8B is a multi-modal text and image transformer trained by Adept AI

Updated 4.6K runs

Separate Anything You Describe

Updated 4.8K runs

🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives

Updated 5.7K runs

A simple OCR Model that can easily extract text from an image.

Updated 89.9M runs

Updated 346 runs

Video object segmentation for short and long videos

Updated 106 runs

Open diffusion model for high-quality video generation

Updated 10.6K runs

Synthesizing High-Resolution Images with Few-Step Inference

Updated 1.1M runs

Batch mode for text & image embeddings

Updated 69 runs

Tuning-free Higher-Resolution Visual Generation with Diffusion Models

Updated 1.1K runs

Generate videos from text prompts with Kandinsky-2.2

Updated 7.3K runs

SAM(Segment Anything) ViT-H image encoder

Updated 358K runs

Whisper-Large-V2 + Pyannote 3.0 diarization via WhisperX

Updated 107 runs

Anime Pastel Dream Model For Splurge Art

Updated 5.8K runs

Neurogen Model for Splurge Art

Updated 4.9K runs

Dreamlike Anime 1.0 for Splurge Art

Updated 6.4K runs

Dreamlike Photoreal Model for Splurge Art

Updated 3.4K runs

Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Updated 985 runs

TheBloke/Nous-Hermes-Llama2-AWQ served with vLLM

Updated 7.4K runs

Gender recognition for audio files

Updated 4.6K runs

cog-resnet example trial

Updated 10 runs

Make a transcription of a phone call

Updated 15 runs

Trained on plants

Updated 28 runs

My own personal copy of daanelson/whisperx

Updated 313 runs

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

Updated 825.3K runs

Mistral-7B-v0.1 fine tuned for chat with the OpenOrca dataset.

Updated 65.9K runs

Using a ComfyUI workflow to run SDXL text2img

Updated 450 runs

Zero-shot / open vocabulary object detection

Updated 24.4K runs

A high-performing language model trained to act as a helpful assistant

Updated 8K runs

Controlling Vision-Language Models for Universal Image Restoration

Updated 2.2K runs

✨DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Updated 134.8K runs

Updated 106 runs

Object removal, video completion and video outpainting

Updated 31.1K runs

Updated 25 runs

Instruction tuned text-to-image diffusion models as vision generalists

Updated 358 runs

📽️ Increase Framerate 🎬 ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Updated 52.2K runs

Qwen-14B-Chat is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books, codes, etc.

Updated 5.4K runs

Embedding models that has been trained using Jina AI's Linnaeus-Clean dataset.

Updated 36 runs

Updated 208 runs

Stylized Audio-Driven Single Image Talking Face Animation

Updated 19.1K runs

Updated 77 runs

Text-to-gif using SDXL, with controlnet and lora support

Updated 3.7K runs

Hotshot XL using SDXL for generating one second clips of high quality! Running on a40 Made by the greats at hotshot.co and brought to you by your friends at FullJourney! Thanks to LucaTaco for the MVP!

Updated 4.4K runs

🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Updated 58K runs

Image restoration and face enhancement

Updated 20K runs