Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

stability-ai/stable-diffusion-3.5-medium

2.5 billion parameter image model with improved MMDiT-X architecture

Updated 39K runs

Updated 660 runs

Analyzes music to determine song structure, bpm, downbeats, and demuxes audio

Updated 642 runs

Sayak Paul's cartoonizer, deployed to replicate. Here's the model: https://huggingface.co/instruction-tuning-sd/cartoonizer

Updated 171 runs

Updated 13 runs

flux.1-lite-8B-alpha by Freepik

Updated 320 runs

Updated 42 runs

Stable Diffusion 3.5 Large - LoRA Explorer

Updated 1.9K runs

One shot portrait maker.

Updated 29.1K runs

Remove Background of video and add yours

Updated 305 runs

Updated 415 runs

Updated 78 runs

Powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text prompt

Updated 130 runs

fancyfeast/joytag

Updated 18.3K runs

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching. Voice cloning

Updated 18.9K runs

Updated 99 runs

ChatTTS is a text-to-speech model designed specifically for dialogue scenarios such as LLM assistant.

Updated 129 runs

stability-ai/stable-diffusion-3.5-large-turbo

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps

Updated 296.8K runs

stability-ai/stable-diffusion-3.5-large

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.

Updated 1.1M runs

ideogram-ai/ideogram-v2-turbo

A fast image model with state of the art inpainting, prompt comprehension and text rendering.

Updated 1.8M runs

ideogram-ai/ideogram-v2

An excellent image model with state of the art inpainting, prompt comprehension and text rendering

Updated 1.1M runs

Lip Read silent videos with AI

Updated 147 runs

Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Updated 36 runs

Depth Any Video with Scalable Synthetic Data

Updated 155 runs

Check our flamel.app

Updated 1.9K runs

Tells the weather, given the name of a city

Updated 584 runs

Efficient Visual Generation with Hybrid Autoregressive Transformer

Updated 126 runs

CogvideoX Keyframe Interpolation by Zhengcong Fei

Updated 201 runs

Ollama Nemotron 70b

Updated 8.8K runs

OpenFLUX.1 (Beta v0.1.0), is a fine tune of the FLUX.1-Schnell model with the distillation trained out of it

Updated 659 runs

8-step distilled lora for FLUX.1-dev model released by the Alimama-Creative Team

Updated 1.7K runs

ibm-granite/granite-3.0-8b-instruct

Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

Updated 181.4K runs

ibm-granite/granite-3.0-2b-instruct

Granite-3.0-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

Updated 420.3K runs

FLUX.1 Dev with STOIQO NewReality

Updated 1.9K runs

Finer and Faster Text-to-Image Generation via Relay Diffusion

Updated 44 runs

Image similarity metric that that compares a reference image to a set of images.

Updated 303 runs

Suite of models to evaluate the image quality of text-to-image models with respect to their input prompts.

Updated 1.9K runs

FLUX.1-Dev LoRA Training (with 2x GPUs) by Huggingface Diffusers

Updated 108 runs

F5-TTS, the new state-of-the-art in open source voice cloning

Updated 23K runs

Enhancing Video Model Post-Training through Data, Reward, and Conditional Guidance Design

Updated 598 runs

FLUX.1 Dev with STOIQO NewReality and Amateur LoRA

Updated 497 runs

NSFW Erotic Novel AI Generation -NSFW Text (Data) Generator for Detecting 'NSFW' Text: Multilingual Experience

Updated 5.7K runs

Sharp Monocular Metric Depth in Less Than a Second

Updated 1.3K runs

Updated 172 runs

Reader-LM is a series of models that convert HTML content to Markdown content

Updated 72 runs

A test of an already existing sdxl cog project to study, try, share.

Updated 1.5K runs

Split one or multiple images into four equal parts

Updated 54 runs

Text-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching

Updated 8.3K runs

FLUX.1-Dev LoRA Training by Huggingface Diffusers

Updated 184 runs