Explore

Featured models

zsxkib / pyramid-flow

Text-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching

187 runs

black-forest-labs / flux-1.1-pro

Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.

611.9K runs

black-forest-labs / flux-schnell

The fastest image generation model tailored for local development and personal use

68.6M runs

black-forest-labs / flux-dev

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

2.5M runs

levelsio / analog-film

Take photos in analog film style

2.6K runs

meta / meta-llama-3.1-405b-instruct

Meta's flagship 405 billion parameter language model, fine-tuned for chat completions

3.2M runs

I want to…

Generate images

Models that generate images from text prompts

Use a language model

Models that can understand and generate text

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

The FLUX.1 family of models

The FLUX.1 family of text-to-image models from Black Forest Labs

Upscale images

Upscaling models that create high-quality images from low-quality images

Get embeddings

Models that generate embeddings from inputs

Extract text from images

Optical character recognition (OCR) and text extraction

Use handy tools

Toolbelt-type models for videos and images.

Use a face to make images

Make realistic images of people instantly

Fine-tune Flux

Create a fine-tuned Flux model using your own training images.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Get structured data

Language models that support grammar-based decoding as well as jsonschema constraints.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 weeks, 3 days ago 469.8M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 10 months, 2 weeks ago 15.2M runs

openai/whisper

Convert speech in audio to text

Updated 1 month, 2 weeks ago 36.1M runs

pengdaqian2020/image-tagger

image tagger

Updated 1 year, 4 months ago 37.3M runs

salesforce/blip

Generate image captions

Updated 2 years ago 103M runs

zf-kbot/sd-inpaint

Fill in masked parts of images with Stable Diffusion

Updated 1 month ago 2.8M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years ago 15.4M runs

The LaMa (Large Mask Inpainting) model is an advanced image inpainting system designed to address the challenges of handling large missing areas, complex geometric structures, and high-resolution images.

Updated 1 year, 3 months ago 230.7K runs

Latest models

nightmareai/cogvideo

Text-to-video generation

Updated 2 years, 2 months ago 32.7K runs

sanzgiri/cartoonify_video

Cartoonifies a video

Updated 2 years, 2 months ago 13.7K runs

sanzgiri/cartoonify

Cartoonifies an image

Updated 2 years, 2 months ago 4.2K runs

nicholascelestin/real-esrgan-nitroviper

DO NOT USE - Broken - Only Public For API Usage & Debugging

Updated 2 years, 2 months ago 5.4K runs

nightmareai/latent-sr

Upscale images with the latent diffusion superresolution model

Updated 2 years, 2 months ago 114.1K runs

mehdidc/feed_forward_vqgan_clip

Feed forward VQGAN-CLIP model

Updated 2 years, 3 months ago 130.3K runs

laion-ai/deep-image-diffusion-prior

Generate an image using text by visualizing CLIP features.

Updated 2 years, 3 months ago 1.1K runs

evilstreak/clipdraw-interactive

Morphs vector paths towards a text prompt

Updated 2 years, 3 months ago 183.9K runs

laion-ai/puck

Generate retro videogame art using text.

Updated 2 years, 3 months ago 4.9K runs

wyhsirius/lia

Learning to Animate Images via Latent Space Navigation

Updated 2 years, 3 months ago 19.1K runs

fenglinglwb/large-hole-image-inpainting

MAT: Mask-Aware Transformer for Large Hole Image Inpainting

Updated 2 years, 3 months ago 16.7K runs

renyurui/controllable-person-synthesis

Human pose manipulation for fashion

Updated 2 years, 3 months ago 3.4K runs

davidgillsjo/srw-net

Semantic Room Wireframe

Updated 2 years, 3 months ago 2.5K runs

nightmareai/arf-svox2

Artistic Radiance Fields - Transfer the style of an image to a 3D scene (NeRF)

Updated 2 years, 3 months ago 16K runs

storymy/take-off-eyeglasses

Remove eyeglasses and shadows from photo

Updated 2 years, 3 months ago 32.5K runs

nicholascelestin/latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Updated 2 years, 3 months ago 5.6K runs

winycg/anchor_net

Localizing Semantic Patches for Accelerating Image Classification

Updated 2 years, 3 months ago 220 runs

wzx0826/lbnet

Lightweight Bimodal Network for Single-Image Super-Resolution via Symmetric CNN and Recursive Transformer

Updated 2 years, 3 months ago 7K runs

afiaka87/ldm-autoedit

Updated 2 years, 3 months ago 1.5K runs

nicholascelestin/glid-3

Generate images quickly with GLID-3 (non-xl)

Updated 2 years, 3 months ago 3.9K runs

nicholascelestin/dalle-mega

Made public only for API calls. Use min-dalle instead-- it's superior.

Updated 2 years, 3 months ago 12.1K runs

borisdayma/dalle-mini

Generate images from a text prompt

Updated 2 years, 3 months ago 58.3K runs

vis-opt-group/sci

Low-Light Image Enhancement

Updated 2 years, 4 months ago 11.2K runs

elleo/uk-petition-generator

Generate petitions suitable for sending to the UK government

Updated 2 years, 4 months ago 108 runs

j-min/clip-caption-reward

Fine-grained Image Captioning with CLIP Reward

Updated 2 years, 4 months ago 296K runs

pixray/text2image-future

pixray text2image (future branch)

Updated 2 years, 4 months ago 24.9K runs

cjwbw/face-align-cog

face alignment using stylegan-encoding

Updated 2 years, 4 months ago 4.3K runs

mchong6/gans-n-roses

Convert image or video of your face to anime

Updated 2 years, 4 months ago 4.7K runs

evilstreak/clipdraw

Generate art from text prompts. Based on kvfrans/clipdraw.

Updated 2 years, 4 months ago 893 runs

sujaykhandekar/object-removal

Removes specified objects from image

Updated 2 years, 4 months ago 15.3K runs

javirk/object-removal-partial-convolutions

Removes specified objects from image

Updated 2 years, 4 months ago 842 runs

andreasjansson/codegen

An open-source model for program synthesis. Competitive with OpenAI Codex.

Updated 2 years, 4 months ago 1.1K runs

pixray/text2image

Uses pixray to generate an image from text prompt

Updated 2 years, 4 months ago 1.4M runs

yael-vinker/clipasso

Draws an abstract sketch of an object

Updated 2 years, 4 months ago 9K runs

bfirsh/resnet

Classifies images with ResNet-50

Updated 2 years, 4 months ago 178 runs

replicate/resnet

Classifies images with ResNet-50

Updated 2 years, 4 months ago 8.7K runs

yoyo-nb/thin-plate-spline-motion-model

Thin-Plate Spline Motion Model for Image Animation

Updated 2 years, 4 months ago 573.9K runs

jack000/glid-3-xl

A 1.4B parameter text2im model from CompVis, finetuned on CLIP text embeds and curated data.

Updated 2 years, 4 months ago 45.5K runs

phamquiluan/facial-expression-recognition

Facial Expression Recognition using Residual Masking Network

Updated 2 years, 4 months ago 14.7K runs

nkolkin13/neuralneighborstyletransfer

Transfer the texture/style of one image onto another

Updated 2 years, 4 months ago 7.6K runs

elazarg/nakdimon

A simple Hebrew Diacritizer

Updated 2 years, 5 months ago 125 runs

microsoft/kid

Updated 2 years, 5 months ago 268 runs

bencevans/megadetector

Detect Animals, Vehicles and Humans in Camera Trap Imagery

Updated 2 years, 5 months ago 557 runs

yxuansu/magic

Plugging Visual Controls in Text Generation

Updated 2 years, 5 months ago 1.4K runs

zeke/cog-markdown-example

Updated 2 years, 5 months ago 15 runs

cszn/scunet

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Updated 2 years, 5 months ago 23.2K runs

andreasjansson/music-inpainting-bert

Music inpainting of melody and chords

Updated 2 years, 5 months ago 8.5K runs

megvii-research/nafnet

Nonlinear Activation Free Network for Image Restoration

Updated 2 years, 5 months ago 1.3M runs

retrocirce/zero_shot_audio_source_separation

Zero shot Sound separation by arbitrary query samples

Updated 2 years, 5 months ago 40.1K runs

google-research/maxim

Multi-Axis MLP for Image Processing

Updated 2 years, 5 months ago 464.1K runs