Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Updated 647 runs

Face Restoration

Updated 2.5K runs

A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency

Updated 1.3M runs

MimicMotion: High-quality human motion video generation with pose-guided control

Updated 1.7K runs

remove background for retailer product images

Updated 40 runs

Make realistic images of real people instantly (w/ ip-adapter-plus-face_sdxl_vit-h)

Updated 2.4K runs

PixArt Sigma 900M is a text-to-image generation model based on the PixArt Sigma architecture

Updated 154 runs

Updated 43.4K runs

Updated 93 runs

araby.ai oneshot video faceswap

Updated 4K runs

NuminaMath is a series of language models that are trained to solve math problems using tool-integrated reasoning (TIR)

Updated 17 runs

MARS5, a fully open-source (commercially usable) voice-cloning/TTS with break-through prosody and realism.

Updated 487 runs

Updated 303 runs

for backsound

Updated 79 runs

Cog wrapper for Ollama deepseek-coder-v2:236b

Updated 373 runs

audio to srt

Updated 26 runs

My Cat Xiaobai

Updated 460 runs

Cog wrapper for Ollama llama3:70b

Updated 48 runs

Cog wrapper for Ollama llama3:8b

Updated 12 runs

Input a video. Ask anything about it

Updated 1.1K runs

YOLOv10: Real-Time End-to-End Object Detection

Updated 52 runs

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Updated 201 runs

Take audio from one video and add it to a second video. Good for adding back audio to liveportrait.

Updated 87 runs

Change the fps of a video without changing its length or speed

Updated 79 runs

Portrait animation using a driving video source

Updated 50.8K runs

Efficient Portrait Animation with Stitching and Retargeting Control

Updated 1.1K runs

Kolors is a SOTA base image model for high quality image generation

Updated 1.1K runs

Updated 13 runs

Updated 83 runs

Updated 13 runs

The API automatically detects objects in an input image and returns their positional and mask information.

Updated 3.7K runs

Convert images to anime style

Updated 174.8K runs

Create music for your content

Updated 138K runs

Updated 354 runs

Mama ママ 2.0 Shinsei Galverse Anime-themed text-to-image model

Updated 1.7K runs

InternLM2.5 has open-sourced a 7 billion parameter base model and a chat model tailored for practical scenarios.

Updated 48 runs

Updated 69 runs

Create videos from illustrated input images

Updated 32.7K runs

Phi-3-Mini-4K-Instruct is a 3.8B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets

Updated 81.8K runs

Qwen2 57 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 973 runs

SDXL fine-tune based on images of birds primarily from the British Library free archive

Updated 17 runs

Generate clay style images based on prompts or images

Updated 328 runs

GLM-4V is a multimodal model released by Tsinghua University that is competitive with GPT-4o and establishes a new SOTA on several benchmarks, including OCR.

Updated 50.5K runs

Convert speech in audio to text w/ `tiny`, `small`, `base`, and `large-v3` models

Updated 35 runs

Dolphin-2.9 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling

Updated 197 runs

Extended video synthesis model that generates 128 frames

Updated 194 runs

Image generation, Inpaint Strength, loras custom_urls and enhancer.

Updated 240 runs

Depth estimation with faster inference speed, fewer parameters, and higher depth accuracy.

Updated 175.7K runs

Updated 33 runs

Updated 55 runs