Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Detect and simplify the contours of a binary image

Updated 220 runs

Super-resolves an LR video frame (ultra-wide) using a reference video frame (wide-angle)

Updated 14.3K runs

Homage to the Pixel: text prompt to 6 color squares

Updated 9.3K runs

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

Updated 1.2K runs

bare pixray for API use

Updated 11.6K runs

Design Your Hair by Text and Reference Image

Updated 276.1K runs

A Steerable Model for Bach Chorales Generation

Updated 841 runs

One-shot (any-to-any) Voice Conversion

Updated 6.3K runs

Online demo for "Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation"

Updated 1.9K runs

A model for testing pydantic cog that yields images one word at a time.

Updated 126 runs

Image Style Transfer with Text Condition

Updated 25.6K runs

Synthesize drawings to match a text prompt

Updated 5.5K runs

A model for testing pydantic cog that generates images.

Updated 409 runs

A test model that generates Haiku (and yields output one word a time)

Updated 104 runs

Clip-Guided Diffusion Model for Image Generation

Updated 4.5K runs

GLIDE-text2im w/ humans and experimental style prompts.

Updated 9.2K runs

Updated 565 runs

A fork of pixray/pixray for trying out Cog's new Predictor API

Updated 58 runs

Grad-CAM visualizations for Align before Fuse

Updated 3.6K runs

Updated 75 runs

GLIDE from OpenAI finetuned on roughly 30M more samples. See `laionide-v3` for the latest.

Updated 3.8K runs

Transcribes piano audio and makes it into a cool video

Updated 218 runs

Masked-attention Mask Transformer for Universal Image Segmentation

Updated 658 runs

Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement

Updated 358 runs

Updated 94 runs

Updated 147 runs

Generate Pokemons with Projected GAN

Updated 9.7K runs

Text-Guided Image Generation and Manipulation

Updated 824 runs

A Single Model for Many Visual Modalities

Updated 247 runs

Supervised Weakly from hashtAGs

Updated 294 runs

Classify numerical digits.

Updated 115 runs

democratizing automatic music transcription

Updated 3K runs

Uses pixray to generate an image from text prompt

Updated 148.4K runs

Training-Free Text-to-Image Generation

Updated 2.4K runs

Updated 1.7K runs

Updated 750 runs

Image generation from Wav2CLIP through VQGAN-CLIP

Updated 896 runs

Guide Style Transfer with CLIP loss

Updated 1.3K runs

Baseline models demo of the IEEE L3DAS22 Challenge

Updated 228 runs

Simple example of a Cog model that produces Markdown output

Updated 23 runs

Huan's first cog with Replicate.AI

Updated 35 runs

Deep Halftoning with Reversible Binary Pattern

Updated 364 runs

Guide a StyleGAN3 trained on pictures of mannequins with CLIP.

Updated 850 runs

Guesses your age based on a photo

Updated 421 runs

Age prediction using CLIP

Updated 528 runs

Selfie to anime

Updated 3.1K runs

Generates pokemon sprites from prompt

Updated 4.9K runs

Disentangled face manipulation using CLIP-based annotations

Updated 1.8K runs

Scaling-up Disentanglement for Image Translation

Updated 984 runs

Image generation with CLIP + VQGAN / PixelDraw

Updated 6.7K runs