Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Run FLUX.1 with lora and controlnet

Updated 619 runs

🎨 Fill in masked parts of images with FLUX.1-dev 🖌️

Updated 353.5K runs

Cubiq's ComfyUI InstantID node running `instantid_basic.json` example

Updated 1.6K runs

Ostris AI-Toolkit for Flux LoRA Training (DEPRECATED. Please use: ostris/flux-dev-lora-trainer)

Updated 56.5K runs

Create a music by an image

Updated 466 runs

simple pdf to text from a url using tesseract

Updated 1.4K runs

🎨 Fill in masked parts of images with FLUX.1-schnell 🖌️

Updated 6.3K runs

Paint me some salty seadogs, ye bilge rat!

Updated 167 runs

Idefics3-8B-Llama3, Answers questions and caption about images

Updated 2.1K runs

AnimateDiff text to video from your imagination!

Updated 235 runs

SAM 2: Segment Anything v2 (for videos)

Updated 4.1K runs

Voice conversion with soft speech units

Updated 21 runs

A wrapper model for captioning multiple images using GPT, Claude or Gemini, useful for lora training

Updated 1.2K runs

Updated 954 runs

Topic classification in tweets/texts

Updated 15 runs

FLUX.1-dev with XLabs-AI’s realism lora

Updated 856K runs

Image to image face swapping

Updated 319.3K runs

Sentiment analysis or classification in tweets/texts

Updated 20 runs

aura-sr-v2 model with a 100

Updated 563 runs

UPDATE: new upscaling algorithm for a much improved image quality. Fermat.app open-source implementation of an efficient ControlNet 1.1 tile for high-quality upscales. Increase the creativity to encourage hallucination.

Updated 618K runs

Updated 6.3K runs

FLUX.1-Dev LoRA trainer via SimpleTuner (Work in Progress)

Updated 103 runs

Embed text with Qwen2-7b-Instruct

Updated 637.8K runs

live portrait with audio

Updated 25.2K runs

ECCV2022 Quick background removal

Updated 251 runs

"MusiConGen: Rhythm and chord control for Transformer-based text-to-music generation"

Updated 118 runs

A ready to use image to image workflow of flux

Updated 53.8K runs

Updated 102.6K runs

Gemma2 2b Instruction-tuned variant by Google

Updated 16.2K runs

Gemma2 2b by Google

Updated 34K runs

AuraSR v2: Second-gen GAN-based Super-Resolution for real-world applications

Updated 9.9K runs

MT3: Multi-Task Multitrack Music Transcription

Updated 96 runs

Generate high resolution image

Updated 1.3K runs

Segment Anything 2 (SAM2) by Meta - Automatic mask generation

Updated 22.1K runs

SAM 2: Segment Anything v2 (for Images)

Updated 12.3K runs

Real-ESRGAN Upscale with AI Face Correction

Updated 516.7K runs

Change the background on an image and relight the scene.

Updated 1.5K runs

LLaMA 3.1-8B, finetuned on a synthetic OCR dataset for superior OCR correction.

Updated 37 runs

Updated 100 runs

multilingual-e5-large-instruct: A multi-language text embedding model with custom query instructions.

Updated 33.3K runs

moondream2 is a small vision language model designed to run efficiently on edge devices

Updated 384.4K runs

Generate panoramic image based on text prompts or image.

Updated 369 runs

Huggingface Diffusers: SDv1.4/1.5/2.0/2.1 finetuner

Updated 15 runs

Combines inpainting and outpainting for image editing.

Updated 265 runs

Inpaint a selected area of an image using controlnet union for SDXL.

Updated 297 runs

Outpaint an image using controlnet union for SDXL.

Updated 6.4K runs

Bilateral Reference for High-Resolution Dichotomous Image Segmentation (CAAI AIR 2024)

Updated 2.3M runs

A fully open-sourced, large flow-based text-to-image generation model

Updated 413 runs

LLM-powered applications are susceptible to prompt attacks, which are prompts intentionally designed to subvert the developer’s intended behavior of the LLM

Updated 27 runs