Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Updated 4K runs

Turn a face into 3D, emoji, pixel art, video game, claymation or toy

Updated 11.3M runs

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling

Updated 78 runs

Free Lunch towards Style-Preserving in Text-to-Image Generation by InstantX team, with ControlNet

Updated 506 runs

繁花 style 测试

Updated 105 runs

Free Lunch towards Style-Preserving in Text-to-Image Generation by InstantX team

Updated 1.8K runs

Updated 96.1K runs

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

Updated 413 runs

Newest balance-striking reranker model from BAAI. Outputs rank scores for query-doc pairs. FP16 inference enabled.

Updated 59.1K runs

Open Sora Plan Text To Video

Updated 1.6K runs

Domain Consistent Resolution Adapter for Diffusion Models: generating consistent images with resolutions outside of their trained domain

Updated 1.3K runs

sticker maker fork from fofr

Updated 444 runs

Realistic interior design with text and image inputs

Updated 119K runs

Updated 467 runs

Generate a video transitioning from one image to another using SEINE model

Updated 2.1K runs

Outputs a relevance/similarity score or a list of scores for a pair or pairs of string data. FP16 enabled.

Updated 95 runs

Updated 70 runs

Updated 34.7K runs

Updated 69 runs

LoRA + Iterative 4x Upscale ComfyUI Workflow

Updated 3.1K runs

Updated 280.9K runs

@pharmapsychotic 's CLIP-Interrogator, but 3x faster and more accurate. Specialized on SDXL.

Updated 901.2K runs

An implementation of ByteDance/SDXL-Lightning-8step

Updated 536 runs

Capture a website screenshot

Updated 450.4K runs

Vanitas style paintings

Updated 100 runs

Genetic algorithm like mixing of SDXL models

Updated 519 runs

Audio-Driven Synthesis of Photorealistic Portrait Animations

Updated 6.5K runs

Updated 22 runs

Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions

Updated 103 runs

Arc2Face: A Foundation Model of Human Faces

Updated 1.5K runs

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

Updated 578 runs

DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Updated 468 runs

Best Human detection and Object Detection Background removal.

Updated 6.8K runs

Tuning-free framework to achieve high appearance and temporal consistency in video editing

Updated 913 runs

AI model where $TORO meme is born. http://torocoin.top

Updated 727 runs

localfultonextractor's Erosumika 7B Mistral Merge, GGUF Q4_K_S-imat quantized by Lewdiculous.

Updated 564 runs

Generate music from a prompt or melody

Updated 2M runs

sdxs-512-0.9 can generate high-resolution images in real-time based on prompt texts, trained using score distillation and feature matching

Updated 18.8K runs

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Updated 3.9K runs

Honeycomb NLQ Generator hosted with vLLM + AWQ Quantized

Updated 100 runs

Good old controlnet + inpaint + lora

Updated 1.2K runs

A background removal model enhanced with ViTMatte.

Updated 1.8M runs

LEdits++ for image editing

Updated 663 runs

Updated 9 runs

Updated 87 runs

GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Updated 91 runs

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Updated 13.6K runs

reverse-engineers images, faithfully reproducing prompts with precision

Updated 255 runs

Updated 13 runs

Updated 21 runs