Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Rethinking Inductive Biases for Surface Normal Estimation

Updated 64 runs

Updated 1.1K runs

AI Music Structure Analyzer + Stem Splitter using Demucs & Mdx-Net with Python-Audio-Separator

Updated 2.4K runs

Experimental & for non-commercial use only

Updated 6.5K runs

A fine-tuned SDXL model based on the movie Dune.

Updated 315 runs

High-quality multilingual text-to-speech library

Updated 1K runs

DUSt3R: Geometric 3D Vision Made Easy

Updated 406 runs

Sentiment Analysis with Texts

Updated 4.8K runs

A wrapper around bel-tts

Updated 1.4K runs

Turn a face into a sticker

Updated 1.2M runs

Updated 252 runs

Real-Time Open-Vocabulary Object Detection

Updated 7.2K runs

Juggernaut XL v9

Updated 1.2M runs

Surya is a document OCR toolkit that does:

Updated 4.4K runs

sdxl-lcm finetuned for Kids Colouring Pages

Updated 992 runs

Updated 130 runs

Turn anything into an abstract fine art masterpiece 🎨

Updated 362 runs

Generates 3D assets from images

Updated 2.6K runs

Scribbled art drawings style

Updated 770 runs

SDXL lightning mult-controlnet, img2img & inpainting

Updated 6.2K runs

dreamshaper-xl-lightning is a Stable Diffusion model that has been fine-tuned on SDXL

Updated 71.8K runs

ProteusV0.4: The Style Update

Updated 110.1K runs

Updated 165 runs

Lightweight multimodal model for visual question answering, reasoning and captioning

Updated 2.1K runs

Updated 47.4K runs

Simple video chroma keying

Updated 37 runs

Multilingual E5-small language embedding model

Updated 46 runs

Multilingual E5-large language embedding model

Updated 18 runs

Multilingual E5-large language embedding model

Updated 534 runs

Updated 2.8K runs

Tea Segmentation Demo

Updated 25 runs

Function calling LLM that surpasses the state-of-the-art in function calling capabilities

Updated 61 runs

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Updated 99 runs

Updated 81 runs

AnimateDiff video to video

Updated 521 runs

Updated 1.1K runs

Updated 5K runs

SDXL tuned on Vsevolod Ivanov paintings

Updated 778 runs

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0F model and does NOT use LLaVA-13b.

Updated 9.6K runs

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0Q model and does NOT use LLaVA-13b.

Updated 6.8K runs

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This version uses LLaVA-13b for captioning.

Updated 142.5K runs

POC implementation of Depth-anything to produce a 3D SBS video

Updated 182 runs

E5-mistral-7b-instruct language embedding model

Updated 627 runs

Untamed

Updated 137 runs

Updated 95 runs

Merge two images together with a prompt

Updated 4.6K runs

Honeycomb NLQ Generator

Updated 180 runs

ProteusV0.4: The Style Update - enhances stylistic capabilities, similar to Midjourney's approach, rather than advancing prompt comprehension

Updated 130K runs

hello-world from cog example

Updated 34 runs

A collection of anime stable diffusion models with VAEs and LORAs.

Updated 3.6K runs