Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

luma/ray-flash-2-720p

Generate 5s and 9s 720p videos, faster and cheaper than Ray 2

Updated 7.7K runs

luma/ray-flash-2-540p

Generate 5s and 9s 540p videos, faster and cheaper than Ray 2

Updated 6.4K runs

Updated 462 runs

Updated 165 runs

Updated 182 runs

Updated 146 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 2.6K runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 804 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 4.3K runs

Updated 622 runs

Updated 67 runs

PuLID-FLUX-v0.9.0

Updated 103 runs

Updated 368 runs

An experimental model for testing out different failure modes

Updated 46 runs

Photomaker V1 optimized with Lightning 8steps

Updated 112 runs

Inpainting and video2video experiments with Wan 2.1

Updated 181 runs

PNG Generation Model https://hipng.com/

Updated 75 runs

Updated 54.6K runs

Updated 84 runs

Updated 10.3K runs

Updated 305 runs

"DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion"

Updated 263 runs

Updated 28 runs

Microsoft Magma: A Foundation Model for Multimodal AI Agents

Updated 17 runs

Updated 51 runs

Updated 48 runs

Updated 51 runs

Updated 145 runs

Updated 273 runs

Updated 543 runs

Updated 65 runs

Updated 32 runs

Updated 91 runs

CogView-4 model, which has 6B parameters, supports native Chinese input, and Chinese text-to-image generation.

Updated 84 runs

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning

Updated 233 runs

Updated 6.6K runs

Updated 150 runs

easel/advanced-face-swap

Face swap one or two people into a target image

Updated 1.5K runs

Updated 65 runs

Updated 80 runs

Updated 77 runs

ibm-granite/granite-3.2-8b-instruct

Updated 257.7K runs

ibm-granite/granite-vision-3.2-2b

Granite-Vision-3.2-2B is a compact and efficient vision-language model, specifically designed for visual document understanding.

Updated 29.1K runs

Updated 125 runs

Updated 227 runs

Updated 30.4K runs