Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

Updated 71 runs

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 120 runs

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 21 runs

AuraFlow: Fully open-sourced flow-based text-to-image generation model

Updated 18 runs

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Updated 15 runs

Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping

Updated 21 runs

SDXL-Lightning: Progressive Adversarial Diffusion Distillation

Updated 37 runs

Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Updated 34 runs

Improved Distribution Matching Distillation for Fast Image Synthesis

Updated 44 runs

Adversarial Diffusion Distillation

Updated 91 runs

Updated 4 runs

Phased Consistency Model

Updated 30 runs

Latent Consistency Models: Synthesizing High-Resolution Images with Few-step Inference

Updated 124 runs

Template Project for Running Fast Video Effects all on the GPU with fast GPU encoding and Decoding

Updated 16 runs

Uses DINO to detect regions and further refines them with SAM. Returns masking data as RLE encoded JSON.

Updated 215 runs

2DN XL Model (Text2Img, Img2Img and Inpainting)

Updated 230 runs

Mistoon Anime XL Model (Text2Img, Img2Img and Inpainting)

Updated 18K runs

Realism XL Model (Text2Img, Img2Img and Inpainting)

Updated 197.6K runs

Detects if a picture has anime face.

Updated 28K runs

Babes XL Model (Text2Img, Img2Img and Inpainting)

Updated 4.8K runs

The current model is used for graphics replacement processing

Updated 627.1K runs

Upload an image or video, and Video-LLaVa will give you a text description of what it "sees."

Updated 96 runs

without examination qwen2.5 32b

Updated 219 runs

FLUX.1 [dev] (LoRA) with several optimizations such as FP8 Quantization

Updated 76 runs

Clean Text from Manhwa/Manhua

Updated 8 runs

# Interior Decoration Space Scaling - Second Use Case

Updated 70 runs

a model to get images

Updated 276 runs

Updated 416 runs

This model is used to generate speech

Updated 34 runs

A F5-TTS fine-tuned for Spanish

Updated 486 runs

Updated 11 runs

Updated 24 runs

Updated 21 runs

Dreamlike Diffusion Model for Splurge Art

Updated 2.4K runs

From Sketch to Reality: Transforming Outlines into Lifelike Images

Updated 47.4K runs

baby transformer for blog post

Updated 30 runs

staging testing

Updated 255 runs

Document translation with contextual integrity.

Updated 57 runs

Align text to audio with exact word timings. All characters supported!

Updated 111.8K runs

Projection module trained to add vision capabilties to Llama 3 using SigLIP

Updated 5.5K runs

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Updated 22 runs

Apple's monocular depth estimation foundation model (Depth Pro)

Updated 1.6K runs

OmniGen: Unified Image Generation

Updated 12K runs

Create audio clips from text

Updated 8 runs

Explorador FLUX.1-Dev LoRA

Updated 85 runs

Updated 59 runs

Fine-tune StableDiffusion3.5-Large with Hugging Face Diffusers

Updated 544 runs

Updated 24 runs

Run any python code

Updated 6.4K runs

Ostris AI-Toolkit for StableDiffusion3.5-Large LoRA Training

Updated 263 runs