Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Better than SDXL at both prompt adherence and image quality, by dataautogpt3

Updated 123.6K runs

Automatically add captions to a video

Updated 22.2K runs

Updated 259 runs

this is the replicate version of singing_voice_conversion from amphion

Updated 501 runs

Animagine XL 2.0 is an advanced latent text-to-image diffusion model designed to create high-resolution, detailed anime images.

Updated 7.3K runs

ElasticDiffusion: Training-free Arbitrary Size Image Generation

Updated 168 runs

Super High Quality Depth Maps 🗺️: An End-to-End Tile-Based Framework 🏗️ for High-Resolution Monocular Metric Depth Estimation 🔍📏

Updated 312 runs

A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension

Updated 108.4K runs

Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer

Updated 172 runs

Nous Hermes 2 - Yi-34B is a state of the art Yi Fine-tune, fine tuned on GPT-4 generated synthetic data

Updated 9.6K runs

Terminus XL Otaku is a latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 38 runs

works with inpainting and multi-controlnet + single-controlnet || ip-adapter + without ip adapter

Updated 23.4K runs

Rethinking the Role of UNet Encoder in Diffusion Models

Updated 132 runs

Terminus XL Gamma is a new state-of-the-art latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 268 runs

Fine-tune of music gen with tracks from my record label Dream In Audio.

Updated 127 runs

Source: SuperAGI/SAM ✦ Quant: TheBloke/SAM-AWQ ✦ SAM (Small Agentic Model), a 7B model that demonstrates impressive reasoning abilities despite its smaller size

Updated 76 runs

RealvisXL3 fine-tuned on 300+ colorized 1850s-1940s photos

Updated 220 runs

Updated 129 runs

RealVisXL_V3.0, img-to-emoji

Updated 1.8K runs

RealVisXL_V3.0, fine-tuned on Apple's emojis

Updated 1.5K runs

SDXL fine-tuned on MJv6 Simpsons generated images

Updated 10.6K runs

Multi-controlnet, lora loading, img2img, inpainting

Updated 202K runs

High-Fidelity Text-to-3D Generation via Interval Score Matching

Updated 70 runs

Try out akx/Poro-34B-gguf, Q5_K, This is 1000B checkpoint model

Updated 23 runs

DeepFake AI

Updated 51.9K runs

Amphion Singing Voice Conversion: DiffWaveNetSVC

Updated 708 runs

DreamBooth safetensors model use RealVisXL

Updated 750 runs

Amazing photorealism with RealVisXL_V3.0, based on SDXL, trainable

Updated 595.5K runs

Cog implementation of mir-aidj(Taejun Kim)'s 'All-In-One Music Structure Analyzer'

Updated 4.7K runs

Ugly Sweaters: The only garment that screams "Fashion? Never heard of it."

Updated 103 runs

(Research only) IP-Adapter-FaceID can generate various style images conditioned on a face with only text prompts

Updated 28K runs

Updated 2.4K runs

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to go against other general purpose models and pipelines like Midjourney and DALL-E.

Updated 1.4K runs

DPO-SDXL Canny controlnet with LoRA support.

Updated 557 runs

Updated 89 runs

Segment Anything MASK

Updated 1.1K runs

Creates Santa hats like it's a holiday party at the North Pole

Updated 130 runs

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.

Updated 180.2K runs

Direct Preference Optimization (DPO) is a method to align diffusion models to text human preferences by directly optimizing on human comparison data

Updated 2.2K runs

auto1111_ds8

Updated 61.8K runs

FacebookResearch/SeamlessM4T v2 - Massively Multilingual & Multimodal Machine Translation

Updated 744 runs

Updated 362 runs

Updated 234 runs

SDXL fine-tuned on Santa Hats

Updated 613 runs

The "Overall Best Performing Open Source 7B Model" for Coding + Generalization or Mathematical Reasoning

Updated 26.2K runs

Updated 62 runs

SDXL Fine-tune on cinematic shots

Updated 324 runs

Updated 208 runs

A model trained on images of United Therapeutics CEO Dr. Martine Rothblatt

Updated 20 runs

Updated 132 runs