Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Source: Arc53/docsgpt-7b-mistral ✦ Quant: TheBloke/docsgpt-7B-mistral-AWQ ✦ DocsGPT is optimized for Documentation (RAG), fine-tuned for providing answers that are based on context

Updated 75 runs

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration

Updated 3.3M runs

Better than SDXL at both prompt adherence and image quality, by dataautogpt3

Updated 131.7K runs

Automatically add captions to a video

Updated 37.9K runs

Updated 262 runs

this is the replicate version of singing_voice_conversion from amphion

Updated 560 runs

Animagine XL 2.0 is an advanced latent text-to-image diffusion model designed to create high-resolution, detailed anime images.

Updated 9.3K runs

ElasticDiffusion: Training-free Arbitrary Size Image Generation

Updated 170 runs

Super High Quality Depth Maps 🗺️: An End-to-End Tile-Based Framework 🏗️ for High-Resolution Monocular Metric Depth Estimation 🔍📏

Updated 366 runs

A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension

Updated 126.1K runs

Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer

Updated 178 runs

Nous Hermes 2 - Yi-34B is a state of the art Yi Fine-tune, fine tuned on GPT-4 generated synthetic data

Updated 11.4K runs

Terminus XL Otaku is a latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 40 runs

works with inpainting and multi-controlnet + single-controlnet || ip-adapter + without ip adapter

Updated 23.8K runs

Rethinking the Role of UNet Encoder in Diffusion Models

Updated 133 runs

Updated 215.4K runs

Terminus XL Gamma is a new state-of-the-art latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 279 runs

Source: SuperAGI/SAM ✦ Quant: TheBloke/SAM-AWQ ✦ SAM (Small Agentic Model), a 7B model that demonstrates impressive reasoning abilities despite its smaller size

Updated 77 runs

Multi-controlnet, lora loading, img2img, inpainting

Updated 210.7K runs

High-Fidelity Text-to-3D Generation via Interval Score Matching

Updated 71 runs

Try out akx/Poro-34B-gguf, Q5_K, This is 1000B checkpoint model

Updated 24 runs

Amphion Singing Voice Conversion: DiffWaveNetSVC

Updated 890 runs

DreamBooth safetensors model use RealVisXL

Updated 753 runs

Amazing photorealism with RealVisXL_V3.0, based on SDXL, trainable

Updated 723.9K runs

Cog implementation of mir-aidj(Taejun Kim)'s 'All-In-One Music Structure Analyzer'

Updated 18K runs

(Research only) IP-Adapter-FaceID can generate various style images conditioned on a face with only text prompts

Updated 30K runs

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to go against other general purpose models and pipelines like Midjourney and DALL-E.

Updated 1.4K runs

DPO-SDXL Canny controlnet with LoRA support.

Updated 760 runs

Segment Anything MASK

Updated 1.2K runs

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.

Updated 214.3K runs

Direct Preference Optimization (DPO) is a method to align diffusion models to text human preferences by directly optimizing on human comparison data

Updated 2.2K runs

auto1111_ds8

Updated 61.8K runs

FacebookResearch/SeamlessM4T v2 - Massively Multilingual & Multimodal Machine Translation

Updated 796 runs

Updated 364 runs

The "Overall Best Performing Open Source 7B Model" for Coding + Generalization or Mathematical Reasoning

Updated 26.2K runs

Updated 64 runs

Source: kaist-ai/prometheus-13b-v1.0 ✦ Quant: TheBloke/prometheus-13B-v1.0-AWQ ✦ An alternative to GPT-4 when evaluating LLMs & Reward models for RLHF

Updated 54K runs

Source: OpenBuddy/openbuddy-zephyr-7b-v14.1 ✦ Quant: TheBloke/openbuddy-zephyr-7B-v14.1-AWQ ✦ Open Multilingual Chatbot

Updated 31 runs

AnimateDiff v3 + SparseCtrl: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning. Created with Shimmer.

Updated 680 runs

Blue Pencil XL v2 Model (Text2Img, Img2Img and Inpainting)

Updated 301.9K runs

SDXL image generation using ComfyUI with LoRA trained on DreamBooth method.

Updated 209 runs

korean version of llava-v1.5

Updated 65 runs

Stable Diffusion x4 upscaler model

Updated 7.4K runs

Speaker diarisation

Updated 31 runs

Source: upstage/SOLAR-10.7B-Instruct-v1.0 ✦ Quant: TheBloke/SOLAR-10.7B-Instruct-v1.0-AWQ ✦ Elevating Performance with Upstage Depth UP Scaling!

Updated 4.1K runs

an autocomplete api that runs on the cpu :)

Updated 19.5K runs

Monocular depth estimation

Updated 8K runs

AI-driven audio enhancement for your audio files, powered by Resemble AI

Updated 101.9K runs

Zero-shot speech synthesizer for text-to-speech and voice conversion

Updated 4.6K runs