Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

album cover generator

Updated 983 runs

Updated 250 runs

T5 model fine tuned on GPT-3.5 generated paraphrase corpus of 6.3 million unique sentences.

Updated 4.3K runs

Image Inpainting

Updated 4K runs

Consistent view characters with ControlNet and Stable Diffusion fine-tuned on Ready Player Me characters based on OpenJourneyV4

Updated 984 runs

Updated 488 runs

Updated 31 runs

3B parameter base version of Stability AI's language model

Updated 431 runs

Object Detector Using Yolo

Updated 646 runs

Updated 98 runs

datasets: Flickr8k

Updated 11.2K runs

The DeepFloyd IF model has been initially released as a non-commercial research-only model. Please make sure you read and abide to the license before using it.

Updated 2M runs

🔊 Text-Prompted Generative Audio Model

Updated 298.4K runs

Whisper transcription plus speaker diarization

Updated 28K runs

Tryout SegmentAnything Model (SAM) by Meta.

Updated 69.1K runs

image captioning

Updated 1.3K runs

Generate image prompts for Midjourney. Prefix inputs with "Image: "

Updated 54K runs

Tryout SegmentAnything Model (SAM) by Meta.

Updated 2.7K runs

🔊 Text-Prompted Generative Audio Model Topics Resources

Updated 1.1K runs

MiniGPT-4 w/ Vicuna-7B (Image Question/Captioning Use)

Updated 9.9K runs

MiniGPT-4 w/ Vicuna-13B (Image Question/Captioning Use)

Updated 52K runs

Custom model trained with dreambooth

Updated 228 runs

Train your own custom Stable Diffusion model using a small set of images

Updated 295.8K runs

Lora & openjourney V4

Updated 18.8K runs

7 billion parameter version of Stability AI's language model

Updated 140.5K runs

Extract structured data from receipt images using Donut 🍩 (Document Understanding Transformer)

Updated 2.1K runs

Updated 161 runs

A stable diffusion model trained on pictures from my buddy Toshiro (truly the best boy there is)

Updated 109 runs

Updated 423 runs

Nightly release of ControlNet 1.1

Updated 8.2K runs

openai/whisper with exposed settings for word_timestamps

Updated 1.5M runs

Updated 151 runs

A language model by Google for tasks like classification, summarization, and more

Updated 150.7K runs

SD 1.5 trained with +124k MJv4 images by PromptHero

Updated 249.7K runs

Helps you with work

Updated 1.3K runs

Adding semantic labels for segment anything

Updated 29.9K runs

ControlNet with SD 2.1

Updated 17.2K runs

llama-7b trained on the Memory Alpha Star Trek Wiki

Updated 128 runs

Sketch2Image

Updated 212 runs

观照AI

Updated 5.1K runs

Get image quality scores

Updated 1.4K runs

The Picsart Text2Video-Zero model leverages the power of existing text-to-image synthesis methods (e.g., Stable Diffusion), making them suitable for the video domain

Updated 2.2K runs

gpt-j-6b trained on the Memory Alpha Star Trek Wiki

Updated 436 runs

The Picsart Text2Video-Zero model leverages the power of existing text-to-image synthesis methods (e.g., Stable Diffusion), making them suitable for the video domain.

Updated 13.3K runs

Updated 320 runs

flan-t5-xl trained on the Memory Alpha Star Trek Wiki

Updated 132 runs

Stable Diffusion Meets Karlo: a combination of the Karlo CLIP image embedding prior, and Stable Diffusion v2.1-768.

Updated 996 runs

Sketch2Image

Updated 306 runs