Explore
Featured models
minimax / video-01
Generate 6s videos with prompts or images. (Also known as Hailuo)
black-forest-labs / flux-fill-pro
Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.
black-forest-labs / flux-1.1-pro-ultra
FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.
black-forest-labs / flux-redux-dev
Open-weight image variation model. Create new versions while preserving key elements of your original.
recraft-ai / recraft-v3
Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis
davisbrown / flux-half-illustration
Flux lora, use "in the style of TOK" to trigger generation, creates half photo half illustrated elements
I want to…
Generate images
Models that generate images from text prompts
Use a language model
Models that can understand and generate text
Upscale images
Upscaling models that create high-quality images from low-quality images
Caption images
Models that generate text from images
The FLUX family of models
The FLUX family of text-to-image models from Black Forest Labs
Restore images
Models that improve or restore images by deblurring, colorization, and removing noise
Get embeddings
Models that generate embeddings from inputs
Extract text from images
Optical character recognition (OCR) and text extraction
Transcribe speech
Models that convert speech to text
Use handy tools
Toolbelt-type models for videos and images.
Chat with images
Ask language models about images
Edit images
Tools for manipulating images.
Use a face to make images
Make realistic images of people instantly
Flux fine-tunes
Browse the diverse range of fine-tunes the community has custom-trained on Replicate
Generate music
Models to generate and modify music
Generate videos
Models that create and edit videos
Generate speech
Convert text to speech
Make 3D stuff
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Get structured data
Language models that support grammar-based decoding as well as jsonschema constraints.
Popular models
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
A simple OCR Model that can easily extract text from an image.
Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
Return CLIP features for the clip-vit-large-patch14 model
A text-to-image generative AI model that creates beautiful images
Latest models
A SDXL Model trained from another SDXL-hiroshinagai model images
Just some good ole beautifulsoup scrapping URL magic. (some sites don't work as they block scrapping, but still useful)
llava-phi-3-mini is a LLaVA model fine-tuned from microsoft/Phi-3-mini-4k-instruct
PyTorch implementation of AnimeGAN for fast photo animation
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
AbsoluteReality V1.8.1 Model (Text2Img, Img2Img and Inpainting)
Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets
Newest reranker model from BAAI (https://huggingface.co/BAAI/bge-reranker-v2-m3). FP16 inference enabled. Normalize param available
Generate a video that morphs between subjects, with an optional style
An efficient, intelligent, and truly open-source language model
Make stickers with AI. Generates graphics with transparent backgrounds.
yuan2.0-2b-mars是源2.0-2B模型的2024年3月版本,源2.0 是浪潮信息发布的新一代基础语言大模型。我们开源了全部的3个模型源2.0-102B,源2.0-51B和源2.0-2B。并且我们提供了预训练,微调,推理服务的相关脚本,以供研发人员做进一步的开发。源2.0是在源1.0的基础上,利用更多样的高质量预训练数据和指令微调数据集,令模型在语义、数学、推理、代码、知识等不同方面具备更强的理解能力。
Idefics2 is an open multimodal model that accepts arbitrary sequences of image and text inputs and produces text outputs
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
FlashFace: Human Image Personalization with High-fidelity Identity Preservation
text2img model trained on LAION HighRes and fine-tuned on internal datasets
snowflake-arctic-embed is a suite of text embedding models that focuses on creating high-quality retrieval models optimized for performance
input your name, and this model will print the most handsome man
Base version of Llama 3, a 70 billion parameter language model from Meta.
A 70 billion parameter language model from Meta, fine tuned for chat completions
An 8 billion parameter language model from Meta, fine tuned for chat completions
Base version of Llama 3, an 8 billion parameter language model from Meta.