Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Implementation of the RemBG library

Updated 310 runs

BLIP3(XGen-MM) is a series of foundational Large Multimodal Models (LMMs) developed by Salesforce AI Research

Updated 379 runs

Transcribe audios using OpenAI's Whisper with stabilizing timestamps by stable-ts python package.

Updated 113 runs

Updated 15 runs

Updated 838 runs

Use a face to instantly make images. Uses SDXL Lightning checkpoints.

Updated 6.5K runs

Updated 99 runs

Cog to turn minimally-formatted plaintext into pdfs (using tex on the backend)

Updated 73 runs

Dark Sushi Mix 2.25D Model with vae-ft-mse-840000-ema (Text2Img, Img2Img and Inpainting)

Updated 49.9K runs

DeepSeek LLM, an advanced language model comprising 67 billion parameters. Trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese

Updated 52 runs

Updated 984 runs

turns text into pdf files with TeX

Updated 75 runs

A llama-3 based moderation and safeguarding language model

Updated 180.9K runs

a fine-tuned model to detect dragon in images.

Updated 28 runs

InstantID. ControlNets. More base SDXL models. And the latest ByteDance's ⚡️SDXL-Lightning !⚡️

Updated 214.8K runs

The img2img pipeline that makes an anime-style image of a person. It uses one of sd1.5 models as a base, depth-estimation as a ControleNet and IPadapter model for face consistency.

Updated 114 runs

Consistent Self-Attention for Long-Range Image and Video Generation

Updated 50K runs

Updated 722 runs

Generate video

Updated 826 runs

Optimized model

Updated 244 runs

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Updated 875 runs

Robust face restoration algorithm for old photos / AI-generated faces (adapted to work with video inputs)

Updated 374 runs

Generate anime-style image

Updated 85 runs

Updated 55 runs

Semantic Segmentation

Updated 407K runs

A SDXL Model trained from another SDXL-hiroshinagai model images

Updated 175 runs

Just some good ole beautifulsoup scrapping URL magic. (some sites don't work as they block scrapping, but still useful)

Updated 15.7K runs

Realistic Inpainting with ControlNET (M-LSD + SEG)

Updated 177.3K runs

Tango 2: Use text prompts to make sound effects

Updated 21.5K runs

🗣️ TalkNet-ASD: Detect who is speaking in a video

Updated 67 runs

Transfer a material from an image to a subject

Updated 7.1K runs

Updated 69 runs

Updated 301 runs

Creates voxels like game asset

Updated 448 runs

Updated 74 runs

Updated 160 runs

Uses 'Align your steps' for faster higher quality images

Updated 4.6K runs

llava-phi-3-mini is a LLaVA model fine-tuned from microsoft/Phi-3-mini-4k-instruct

Updated 2.7K runs

PyTorch implementation of AnimeGAN for fast photo animation

Updated 31K runs

Updated 9 runs

Updated 77 runs

Updated 260 runs

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

Updated 2.5K runs

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

Updated 34 runs

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

Updated 1.3K runs

llm model ,for CN

Updated 213 runs

Reliberate v3 Model (Text2Img, Img2Img and Inpainting)

Updated 965.4K runs

Deliberate V6 Model (Text2Img, Img2Img and Inpainting)

Updated 11K runs

AbsoluteReality V1.8.1 Model (Text2Img, Img2Img and Inpainting)

Updated 53.1K runs