Explore
Featured models
deeterbleater / flux-mff
This LoRA was created using snapshots of Midwest Furfest 2023, I call it Multiverse Furfest 202X. Trigger word is MFF, adding "fursuit" and "convention" to your prompts seems to help.
black-forest-labs / flux-1.1-pro
Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.
black-forest-labs / flux-schnell
The fastest image generation model tailored for local development and personal use
black-forest-labs / flux-dev
A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
levelsio / analog-film
Take photos in analog film style
meta / meta-llama-3.1-405b-instruct
Meta's flagship 405 billion parameter language model, fine-tuned for chat completions
I want to…
Generate images
Models that generate images from text prompts
Use a language model
Models that can understand and generate text
Caption images
Models that generate text from images
Edit images
Tools for manipulating images.
Restore images
Models that improve or restore images by deblurring, colorization, and removing noise
The FLUX.1 family of models
The FLUX.1 family of text-to-image models from Black Forest Labs
Upscale images
Upscaling models that create high-quality images from low-quality images
Get embeddings
Models that generate embeddings from inputs
Extract text from images
Optical character recognition (OCR) and text extraction
Transcribe speech
Models that convert speech to text
Chat with images
Ask language models about images
Use handy tools
Toolbelt-type models for videos and images.
Use a face to make images
Make realistic images of people instantly
Generate music
Models to generate and modify music
Generate videos
Models that create and edit videos
Fine-tune Flux
Create a fine-tuned Flux model using your own training images.
Generate speech
Convert text to speech
Make 3D stuff
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Get structured data
Language models that support grammar-based decoding as well as jsonschema constraints.
Popular models
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
multilingual-e5-large: A multi-language text embedding model
Practical face restoration algorithm for *old photos* or *AI-generated faces*
Latest models
Source: NousResearch/Obsidian-3B-V0.5 ✦ Worlds smallest multi-modal LLM
Source: PocketDoc/Dans-AdventurousWinds-Mk2-7b ✦ Quant: TheBloke/Dans-AdventurousWinds-Mk2-7B-AWQ ✦ This model is proficient in crafting text-based adventure games
Source: Intel/neural-chat-7b-v3-1 ✦ Quant: TheBloke/neural-chat-7B-v3-1-AWQ ✦ Fine-tuned model based on mistralai/Mistral-7B-v0.1
Animate Your Personalized Text-to-Image Diffusion Models with SDXL and LCM
Text to video diffusion model with variable length frame conditioning for infinite length video
Dreamshaper-7 img2img with LCM LoRA for faster inference
An auto-regressive causal LM created by combining 2x finetuned Llama-2 70B into one.
RealvisXL-v2.0 with LCM LoRA - requires fewer steps (4 to 8 instead of the original 40 to 50)
Take a list of image URLs as frames and output a video
Auto fuse a user's face onto the template image, with a similar appearance to the user
Create song covers with any RVC v2 trained AI voice from audio files.
A smaller cuter, but lower quality version of my SDXL Hiroshi Nagai model
A combination of ip_adapter SDv1.5 and mediapipe-face to inpaint a face
The Yi series models are large language models trained from scratch by developers at 01.AI.
The Yi series models are large language models trained from scratch by developers at 01.AI.
The Yi series models are large language models trained from scratch by developers at 01.AI.
MusicGen Stereo Medium fine-tuned on ambient with the text token "breathe"
Custom improvements like a custom callback to enhance the inference | It's a WIP and it may causes some wrong outputs
MusicGen Stereo Medium fine-tuned on industrial techno with the text token "construction hymn"
MusicGen Stereo Medium model trained on the tracks of NewJeans with the text token "NewJeans"
MusicGen Stereo Melody model fine-tuned on the tracks of NewJeans with the text token "NewJeans"
An extremely fast all-in-one model to use LCM with SDXL, ControlNet and custom LoRA url's!
Create variations of an uploaded image. Please see README for more details
Source: meta-llama/Llama-2-7b-chat-hf ✦ Quant: TheBloke/Llama-2-7B-Chat-AWQ ✦ Intended for assistant-like chat
Source: meta-math/MetaMath-Mistral-7B ✦ Quant: TheBloke/MetaMath-Mistral-7B-AWQ ✦ Bootstrap Your Own Mathematical Questions for Large Language Models
Source: Severian/ANIMA-Phi-Neptune-Mistral-7B ✦ Quant: TheBloke/ANIMA-Phi-Neptune-Mistral-7B-AWQ ✦ Biomimicry Enhanced LLM
Animate Your Personalized Text-to-Image Diffusion Models (Long boot times!)