Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Source: gorilla-llm/gorilla-openfunctions-v1 ✦ Quant: TheBloke/gorilla-openfunctions-v1-AWQ ✦ Extend Large Language Model (LLM) Chat Completion feature to formulate executable APIs call given natural language instructions and API context

Updated 417 runs

Updated 792 runs

Dreamshaper superfast generation

Updated 343 runs

Real-ESRGAN Video Upscaler

Updated 206.5K runs

Caption an audio

Updated 23 runs

Fooocus API based endpoint

Updated 2.9K runs

Anything V4.5 Model (Text2Img, Img2Img and Inpainting)

Updated 1M runs

DreamShaper V8 Model (Text2Img, Img2Img and Inpainting)

Updated 6.7K runs

Realistic Vision V4.0 Model (Text2Img, Img2Img and Inpainting)

Updated 34.6K runs

CogVLM is a powerful open-source visual language model (VLM)

Updated 12.4K runs

The Yi series models are large language models trained from scratch by developers at 01.AI.

Updated 320.1K runs

The Yi series models are large language models trained from scratch by developers at 01.AI.

Updated 6.4K runs

Image-to-video - SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Updated 12.9K runs

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

Updated 933 runs

Genshin Impact Image SD

Updated 6.1K runs

Controllable Text-to-Music Generation

Updated 8.6K runs

Multi-Controlnet + consistency-decoder + INPAINTING + realestic-vision-v5 + Prompt-Weight + Single-Controlnet

Updated 3.4K runs

controlnet-lineart-brightness-tile-inpainting + low res fix with tile

Updated 708 runs

Mockup generator (bags, t-shirts, mugs, billboard etc) using Stable Diffusion in-painting

Updated 238 runs

WhisperX model for spanish language.

Updated 44.8K runs

High-Quality Video Generation with Cascaded Latent Diffusion Models

Updated 13.9K runs

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 60.3M runs

BAAI/bge-reranker-large Model with fp16

Updated 285 runs

BAAI/bge-reranker-base Model with fp16

Updated 8.1K runs

Create your own Realistic Voice Cloning (RVC v2) dataset using a YouTube link

Updated 8.8K runs

Performs speaker identity verification

Updated 77 runs

Answers questions about images

Updated 29.8M runs

Simple version of https://huggingface.co/teknium/OpenHermes-2-Mistral-7B

Updated 242 runs

Very fast img2img for a collaboration with AI in real time

Updated 3.5K runs

Updated 49K runs

Fast animation using a latent consistency model

Updated 30.6K runs

Fast video2video with a latent consistency model

Updated 2.4K runs

Updated 60.4K runs

Removes background

Updated 9.7K runs

Source: NousResearch/Obsidian-3B-V0.5 ✦ Worlds smallest multi-modal LLM

Updated 117 runs

Source: PocketDoc/Dans-AdventurousWinds-Mk2-7b ✦ Quant: TheBloke/Dans-AdventurousWinds-Mk2-7B-AWQ ✦ This model is proficient in crafting text-based adventure games

Updated 130 runs

Source: Intel/neural-chat-7b-v3-1 ✦ Quant: TheBloke/neural-chat-7B-v3-1-AWQ ✦ Fine-tuned model based on mistralai/Mistral-7B-v0.1

Updated 774 runs

Animate Your Personalized Text-to-Image Diffusion Models with SDXL and LCM

Updated 331 runs

Text to video diffusion model with variable length frame conditioning for infinite length video

Updated 418 runs

Dreamshaper-7 img2img with LCM LoRA for faster inference

Updated 55.1K runs

Updated 312 runs

An auto-regressive causal LM created by combining 2x finetuned Llama-2 70B into one.

Updated 237.5K runs

Text to image prompt

Updated 1.1K runs

RealvisXL-v2.0 with LCM LoRA - requires fewer steps (4 to 8 instead of the original 40 to 50)

Updated 292.6K runs

Updated 39 runs

Take a list of image URLs as frames and output a video

Updated 1.1K runs

Auto fuse a user's face onto the template image, with a similar appearance to the user

Updated 12K runs

Generate 3D assets using text descriptions

Updated 1K runs

Detects objects in an image

Updated 1.5K runs