Explore
Featured models
black-forest-labs / flux-1.1-pro
Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.
black-forest-labs / flux-schnell
The fastest image generation model tailored for local development and personal use
black-forest-labs / flux-dev
A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
okaris / omni-zero-couples
Omni-Zero Couples: A diffusion pipeline for zero-shot stylized couples portrait creation.
levelsio / counter-strike
Take pics in the style of Counter-Strike 1.6's custom map fy_resort
meta / meta-llama-3.1-405b-instruct
Meta's flagship 405 billion parameter language model, fine-tuned for chat completions
I want to…
Generate images
Models that generate images from text prompts
Use a language model
Models that can understand and generate text
Caption images
Models that generate text from images
Edit images
Tools for manipulating images.
Restore images
Models that improve or restore images by deblurring, colorization, and removing noise
Upscale images
Upscaling models that create high-quality images from low-quality images
The FLUX.1 family of models
The FLUX.1 family of text-to-image models from Black Forest Labs
Get embeddings
Models that generate embeddings from inputs
Extract text from images
Optical character recognition (OCR) and text extraction
Transcribe speech
Models that convert speech to text
Chat with images
Ask language models about images
Use handy tools
Toolbelt-type models for videos and images.
Use a face to make images
Make realistic images of people instantly
Generate music
Models to generate and modify music
Generate videos
Models that create and edit videos
Generate speech
Convert text to speech
Fine-tune Flux
Create a fine-tuned Flux model using your own training images.
Make 3D stuff
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Get structured data
Language models that support grammar-based decoding as well as jsonschema constraints.
Popular models
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
A text-to-image generative AI model that creates beautiful images
Practical face restoration algorithm for *old photos* or *AI-generated faces*
Latest models
A watercolor style painting model that does impressionism well and lends itself to anime.
Stylized sketch anime model that has a bit of a watercolor undertone to it
A style that creates a paint wash, great for anime.
A style model, ligne claire esque with eastern influence
A very blocky and bold cartoon style with some anime elements. You should use daiton style to trigger the image generation.
A model that makes a soft and quiet world. Can use "daiton" as a trigger but it isn't needed.
Real-Time High Quality Lip Synchronization with Latent Space Inpainting
MiniCPM LLama3-V 2.5, a new SOTA open-source VLM that surpasses GPT-4V-1106 and Phi-128k on a number of benchmarks.
An improved outpainting model that supports LoRA urls. This model uses PatchMatch to improve the mask quality.
A fast high quality SD 1.5 model, Realistic Vision V6.0 B1 Hyper
High quality 6-step lightning model, Jib Mix Realistic XL v10 Lightning
Fast and high quality lightning model, epiCRealismXL-Lightning Hades
Llama-3-8B finetuned with ReFT to hyperfocus on New Jersey, the Garden State, the best state, the only state!
An example using Garden State Llama to ReFT on the Golden Gate bridge.
🎙️Hololive text-to-speech and voice-to-voice (Japanese🇯🇵 + English🇬🇧)
epiCRealism v7-Final Destination. Top Realism Model on Civitai
blue_pencil-XL meets ANIMAGINE XL 3.0 / ANIMAGINE XL 3.1, The top ranked model on Civitai
A PhotoBooth style transfer workflow that utilizes IPadapter Style, Canny, OpenPose, RemoveBackground, HumanSegmentation, Cloth Segmentation for initial input, and concludes with the application of DeepFake techniques.
AI Photorealistic Image Ultra-Resolution, Restoration and Upscale!
SDXL LoRA finetuned on spectrograms of Beethoven songs
Transfer empty room into fabulous interior design
COMPP FS24 - A fine tuned model of MusicGen for continuation of music files in the style of System of a Down.
viⓍTTS vixTTS là mô hình tạo sinh giọng nói cho phép bạn sao chép giọng nói sang các ngôn ngữ khác nhau chỉ bằng cách sử dụng một đoạn âm thanh nhanh dài 6 giây
Given image of an face, the it generates full images with given prompt