Explore
Featured models
black-forest-labs / flux-1.1-pro
Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.
black-forest-labs / flux-schnell
The fastest image generation model tailored for local development and personal use
black-forest-labs / flux-dev
A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
okaris / omni-zero-couples
Omni-Zero Couples: A diffusion pipeline for zero-shot stylized couples portrait creation.
levelsio / counter-strike
Take pics in the style of Counter-Strike 1.6's custom map fy_resort
meta / meta-llama-3.1-405b-instruct
Meta's flagship 405 billion parameter language model, fine-tuned for chat completions
I want to…
Generate images
Models that generate images from text prompts
Use a language model
Models that can understand and generate text
Caption images
Models that generate text from images
Edit images
Tools for manipulating images.
Restore images
Models that improve or restore images by deblurring, colorization, and removing noise
The FLUX.1 family of models
The FLUX.1 family of text-to-image models from Black Forest Labs
Upscale images
Upscaling models that create high-quality images from low-quality images
Get embeddings
Models that generate embeddings from inputs
Extract text from images
Optical character recognition (OCR) and text extraction
Transcribe speech
Models that convert speech to text
Chat with images
Ask language models about images
Use handy tools
Toolbelt-type models for videos and images.
Use a face to make images
Make realistic images of people instantly
Generate music
Models to generate and modify music
Generate videos
Models that create and edit videos
Fine-tune Flux
Create a fine-tuned Flux model using your own training images.
Generate speech
Convert text to speech
Make 3D stuff
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Get structured data
Language models that support grammar-based decoding as well as jsonschema constraints.
Popular models
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
A text-to-image generative AI model that creates beautiful images
Latest models
Inspired by the vibrant and imaginative style of Ukrainian folk artist Maria Prymachenko, this AI model specializes in creating whimsical and colorful artworks that reflect the essence of traditional folklore and nature themes.
The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts.
MusicGen stereo fine-tuned on Pansori Epic Chant, a Korean folk music with the text token “Korean traditional folk music, pansori”
Convert scanned or electronic documents to markdown, very very very fast
An attempt to render Teenage Mutant Ninja Turtles: Mutant Mayhem-like images
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
A ~7B parameter language model from Deepseek for SOTA repository level code completion
Source: Pclanglais/MonadGPT ✦ Quant: TheBloke/MonadGPT-AWQ ✦ What would have happened if ChatGPT was invented in the 17th century?
Playground v2 is a diffusion-based text-to-image generative model trained from scratch by the research team at Playground
Source: fblgit/una-cybertron-7b-v2-bf16 ✦ Quant: TheBloke/una-cybertron-7B-v2-AWQ ✦ A 7B MistralAI based model, best on it's series. Trained on SFT, DPO and UNA (Unified Neural Alignment) on multiple datasets
Translate audio while keeping the original style, pronunciation and tone of your original audio.
Convert your videos to DensePose and use it with MagicAnimate
API for enhanced word-level timestamp accuracy using OpenAI's Whisper model
Add a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI
Counterfeit XL v2 Model (Text2Img, Img2Img and Inpainting)
Juggernaut XL v7 Model (Text2Img, Img2Img and Inpainting)
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Source: chargoddard/loyal-piano-m7 ✦ Quant: TheBloke/loyal-piano-m7-AWQ ✦ Intended to be a roleplay-focused model with some smarts and good long-context recall