Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Generate panoramic images with text prompts

Updated 120 runs

Locality-enhanced Projector for Multimodal LLM

Updated 25 runs

A 70 billion parameter Llama tuned for coding with Python

Updated 1.1K runs

a family of multimodal small language models

Updated 69 runs

image to 3D

Updated 40 runs

Undi95's FlatDolphinMaid 8x7B Mixtral Merge, GGUF Q5_K_M quantized by TheBloke.

Updated 414.4K runs

InstantID : Zero-shot Identity-Preserving Generation in Seconds with ⚡️LCM-LoRA⚡️. Using AlbedoBase-XL v2.0 as base model.

Updated 177.9K runs

Take an image and an audio file and create a video clip

Updated 3.5K runs

amrul-hzz's fine-tuned version of vit-base-patch16-224-in21k for watermark image detection

Updated 335 runs

Proteus v0.2 Model (Text2Img, Img2Img and Inpainting)

Updated 13.6K runs

Many models: RealVisXL, Juggernaut, Proteus, DreamShaper, etc.

Updated 10.6K runs

Runs Mixtral 8x7B on a single A40 GPU

Updated 61 runs

Remix the music into another styles with MusicGen Chord

Updated 15.3K runs

(Research only) Moondream1 is a vision language model that performs on par with models twice its size

Updated 11.4K runs

Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding that surpasses MJ6, while also approaching its stylistic capabilities.

Updated 9.7M runs

Tiny vision language model

Updated 306 runs

Undi95's Borealis 10.7B Mistral DPO Finetune, GGUF Q5_K_M quantized by Undi95.

Updated 75 runs

InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Juggernaut-XL v8 as the base model to encourage photorealism

Updated 41.7K runs

InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Dreamshaper-XL as the base model to encourage artistic generations

Updated 9.3K runs

Generate song ideas!

Updated 584 runs

Highly practical solution for robust monocular depth estimation by training on a combination of 1.5M labeled images and 62M+ unlabeled images

Updated 11.7K runs

SigLIP proposes to replace the loss function used in CLIP by a simple pairwise sigmoid loss

Updated 143 runs

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

Updated 17K runs

Consistent Diffusion Features for Consistent Video Editing

Updated 2K runs

Nebul.Redmond - Stable Diffusion SD XL Finetuned Model

Updated 16.9K runs

Create photos, paintings and avatars for anyone in any style within seconds. (Stylization version)

Updated 1.3M runs

Video Smoother: AMT All-Pairs Multi-Field Transforms for Efficient Frame Interpolation

Updated 16.5K runs

Updated 34.8K runs

Updated 85.5K runs

NeuralBeagle14-7B is (probably) the best 7B model you can find!

Updated 12.2K runs

Updated 271 runs

Source: SciPhi/Sensei-7B-V1 ✦ Quant: TheBloke/Sensei-7B-V1-AWQ ✦ Sensei is specialized in performing RAG over detailed web search results

Updated 36 runs

Source: WhiteRabbitNeo/WhiteRabbitNeo-13B-v1 ✦ TheBloke/WhiteRabbitNeo-13B-AWQ ✦ WhiteRabbitNeo is a model series that can be used for offensive and defensive cybersecurity

Updated 117 runs

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

Updated 4.8K runs

Create photos, paintings and avatars for anyone in any style within seconds.

Updated 6.7M runs

Third party Fooocus replicate model with preset 'anime'

Updated 250.1K runs

Third party Fooocus replicate model with preset 'realistic'

Updated 652.4K runs

Third party Fooocus replicate model

Updated 1.4M runs

An Open Source text-to-speech system built by inverting Whisper

Updated 1.6K runs

Unofficial Re-Trained AnimateAnyone (Image + DWPose Video → Animated Video of Image)

Updated 861 runs

Updated 278 runs

Photorealism with RealVisXL V3.0 Turbo based on SDXL

Updated 237.1K runs

Image-Prompt Multi-view Diffusion for 3D Generation

Updated 1.5K runs

Implementation of Realistic Vision v5.1 to conjure up images of the potential baby using a single photo from each parent

Updated 2.7M runs

MAGNeT: Masked Audio Generation using a Single Non-Autoregressive Transformer

Updated 2.2K runs

ForgeSaga Landscape

Updated 110 runs

Manmaru mix v3.0

Updated 697 runs

Source: allenai/digital-socrates-13b ✦ Quant: TheBloke/digital-socrates-13B-AWQ ✦ Digital Socrates is an open-source, automatic explanation-critiquing model

Updated 18 runs

Source: Unbabel/TowerInstruct-7B-v0.1 ✦ Quant: TheBloke/TowerInstruct-7B-v0.1-AWQ ✦ This model is trained to handle several translation-related tasks, such as general machine translation, gramatical error correction, and paraphrase generation

Updated 189 runs

Improving the Stability of Diffusion Models for Content Consistent Super-Resolution

Updated 3.3K runs