Explore

Featured models

zsxkib / pyramid-flow

Text-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching

60 runs

black-forest-labs / flux-1.1-pro

Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.

601K runs

black-forest-labs / flux-schnell

The fastest image generation model tailored for local development and personal use

68.4M runs

black-forest-labs / flux-dev

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

2.5M runs

levelsio / analog-film

Take photos in analog film style

2.5K runs

meta / meta-llama-3.1-405b-instruct

Meta's flagship 405 billion parameter language model, fine-tuned for chat completions

3.2M runs

I want to…

Generate images

Models that generate images from text prompts

Use a language model

Models that can understand and generate text

Caption images

Models that generate text from images

Edit images

Tools for manipulating images.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

The FLUX.1 family of models

The FLUX.1 family of text-to-image models from Black Forest Labs

Upscale images

Upscaling models that create high-quality images from low-quality images

Get embeddings

Models that generate embeddings from inputs

Extract text from images

Optical character recognition (OCR) and text extraction

Transcribe speech

Models that convert speech to text

Chat with images

Ask language models about images

Use handy tools

Toolbelt-type models for videos and images.

Use a face to make images

Make realistic images of people instantly

Generate music

Models to generate and modify music

Generate videos

Models that create and edit videos

Fine-tune Flux

Create a fine-tuned Flux model using your own training images.

Generate speech

Convert text to speech

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Get structured data

Language models that support grammar-based decoding as well as jsonschema constraints.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 weeks, 3 days ago 469.4M runs

openai/whisper

Convert speech in audio to text

Updated 1 month, 2 weeks ago 36M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 10 months, 2 weeks ago 15.1M runs

salesforce/blip

Generate image captions

Updated 2 years ago 103M runs

pengdaqian2020/image-tagger

image tagger

Updated 1 year, 4 months ago 37.3M runs

zf-kbot/sd-inpaint

Fill in masked parts of images with Stable Diffusion

Updated 1 month ago 2.8M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 4 months, 2 weeks ago 66.9M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 1 year, 7 months ago 61.6M runs

Latest models

yoadtew/zero-shot-image-to-text

image to text generation

Updated 2 years ago 6.6K runs

longguangwang/arbsr

Scale-Arbitrary Super-Resolution

Updated 2 years ago 21.2K runs

jiupinjia/stylized-neural-painting-oil

Image to oil painting generation

Updated 2 years ago 6.2K runs

ouhenio/stylegan3-clip

stylegan3 + clip

Updated 2 years ago 6.9K runs

yuanxunlu/livespeechportraits

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation

Updated 2 years ago 9.7K runs

wonjongg/stylecarigan

Caricature Generation via StyleGAN Feature Map Modulation

Updated 2 years ago 5.3K runs

raoumer/srrescgan

Intelligent image scaling to 4x resolution

Updated 2 years ago 40.6K runs

huage001/adaattn

Arbitrary Neural Style Transfer

Updated 2 years ago 220.3K runs

codeslake/ifan-defocus-deblur

Removes defocus blur in an image

Updated 2 years ago 117.3K runs

meta/ic_gan

Instance-Conditioned GAN

Updated 2 years ago 26.7K runs

cjwbw/whisper-downloadable-subtitles

Added downloadable subtitles for openai/whisper

Updated 2 years ago 2.1K runs

jingyunliang/hcflow-sr

Image Super-Resolution

Updated 2 years ago 221.8K runs

xinntao/esrgan

Image 4x super-resolution

Updated 2 years ago 75.9K runs

kyrick/prompt-parrot

Prompt Parrot generates text2image prompts from finetuned distilgpt2

Updated 2 years ago 246K runs

google-research/frame-interpolation

Frame Interpolation for Large Scene Motion

Updated 2 years ago 263.5K runs

harmonai/dance-diffusion

Tools to train a generative model on arbitrary audio samples

Updated 2 years ago 5.2K runs

tengfei-wang/hfgi

High-Fidelity GAN Inversion for Image Attribute Editing

Updated 2 years ago 22.1K runs

eladrich/pixel2style2pixel

a StyleGAN Encoder for Image-to-Image Translation

Updated 2 years ago 31.8K runs

yuval-alaluf/restyle_encoder

ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement

Updated 2 years ago 89.2K runs

mchong6/jojogan

JoJoGAN: One Shot Face Stylization

Updated 2 years ago 394.3K runs

rinongal/stylegan-nada

StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators

Updated 2 years ago 94.2K runs

cjwbw/rudalle-sr

Real-ESRGAN super-resolution model from ruDALL-E

Updated 2 years ago 481.3K runs

meta/detic

Detects any class given class names

Updated 2 years ago 27.3K runs

jingyunliang/swinir

Image Restoration Using Swin Transformer

Updated 2 years ago 5.8M runs

salesforce/blip

Generate image captions

Updated 2 years ago 103M runs

yangxy/gpen

Blind Face Restoration in the Wild

Updated 2 years ago 161K runs

yuval-alaluf/sam

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model

Updated 2 years ago 986.2K runs

rmokady/clip_prefix_caption

Simple image captioning model using CLIP and GPT-2

Updated 2 years ago 1.7M runs

onion-liu/blendgan

Arbitrary Stylized Face Generation

Updated 2 years ago 7.6K runs

microsoft/bringing-old-photos-back-to-life

Bringing Old Photos Back to Life

Updated 2 years ago 906.8K runs

tommoore515/pix2pix_tf_albedo2pbrmaps

pix2pix model for predicting pbr texture maps from an albedo texture

Updated 2 years ago 3.7K runs

m1guelpf/whisper-subtitles

Generate subtitles from an audio file, using OpenAI's Whisper model.

Updated 2 years ago 69.3K runs

cjwbw/stable-diffusion-high-resolution

Detailed, higher-resolution images from Stable Diffusion

Updated 2 years ago 72.9K runs

2feet6inches/cog-hyped-bot

任意のフレーズからジャパニーズヒップホップ風の歌詞を生成します

Updated 2 years ago 131 runs

tommoore515/material_stable_diffusion

Stable diffusion fork for generating tileable outputs

Updated 2 years ago 387.7K runs

cjwbw/clip-vit-large-patch14

openai/clip-vit-large-patch14 with Transformers

Updated 2 years ago 5.8M runs

cjwbw/sd-textual-inversion-ugly-sonic

stable-diffusion-textual-inversion fine-tuned with ugly sonic

Updated 2 years ago 2K runs

ariel415el/gpdm

Generating Natural Images with Direct Patch Distribution Matching

Updated 2 years ago 5.8K runs

2feet6inches/cog-rinna-japanese-gpt2

任意の日本語トピックからAIが長い文章を生成します

Updated 2 years ago 472 runs

2feet6inches/cog-prompt-parrot

Enhance Stable Diffusion prompt.

Updated 2 years ago 36.1K runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years ago 15.4M runs

rinnakk/japanese-stable-diffusion

Japanese-specific latent text-to-image diffusion model

Updated 2 years ago 2.4K runs

cjwbw/sd-textual-inversion-spyro-dragon

stable-diffusion-textual-inversion fine-tuned with spyro of the dragon STYLE

Updated 2 years ago 477 runs

cjwbw/sd-textual-inversion

Stable Diffusion Textual Inversion

Updated 2 years ago 481 runs

sczhou/codeformer

Robust face restoration algorithm for old photos / AI-generated faces

Updated 2 years, 1 month ago 35.5M runs

nateraw/stable-diffusion-videos

Generate videos by interpolating the latent space of Stable Diffusion

Updated 2 years, 1 month ago 58.3K runs

xinntao/realesrgan

Practical Image Restoration Algorithms for General/Anime Images

Updated 2 years, 1 month ago 6.6M runs

afiaka87/sd-aesthetic-guidance

Use stable diffusion and aesthetic CLIP embeddings to guide boring outputs to be more aesthetically pleasing.

Updated 2 years, 1 month ago 4.3K runs

deforum/deforum_stable_diffusion

Animating prompts with stable diffusion

Updated 2 years, 1 month ago 253.8K runs

xpixelgroup/hat

Activating More Pixels in Image Super-Resolution Transformer

Updated 2 years, 1 month ago 25K runs