Explore
Featured models

kwaivgi / kling-v2.0
Generate 5s and 10s videos in 720p resolution

topazlabs / video-upscale
Video Upscaling from Topaz Labs

fofr / color-matcher
Color match and white balance fixes for images

meta / llama-4-maverick-instruct
A 17 billion parameter model with 128 experts

nvidia / sana-sprint-1.6b
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

black-forest-labs / flux-1.1-pro
Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.

wavespeedai / wan-2.1-i2v-480p
Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

easel / advanced-face-swap
Face swap one or two people into a target image

anthropic / claude-3.7-sonnet
The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)

Fine-tune FLUX
Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)
I want to…
Make videos with Wan2.1
Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.
Generate images
Models that generate images from text prompts
Generate videos
Models that create and edit videos
Caption images
Models that generate text from images
Transcribe speech
Models that convert speech to text
Upscale images
Upscaling models that create high-quality images from low-quality images
Restore images
Models that improve or restore images by deblurring, colorization, and removing noise
Use a face to make images
Make realistic images of people instantly
Edit images
Tools for manipulating images.
Caption videos
Models that generate text from videos
Generate text
Models that can understand and generate text
Use official models
Official models are always on, maintained, and have predictable pricing.
Enhance videos
Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.
Generate speech
Convert text to speech
Remove backgrounds
Models that remove backgrounds from images and videos
Use handy tools
Toolbelt-type models for videos and images.
Detect objects
Models that detect or segment objects in images and videos.
Generate music
Models to generate and modify music
Sing with voices
Voice-to-voice cloning and musical prosody
Make 3D stuff
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Chat with images
Ask language models about images
Extract text from images
Optical character recognition (OCR) and text extraction
Get embeddings
Models that generate embeddings from inputs
Use the FLUX family of models
The FLUX family of text-to-image models from Black Forest Labs
Use FLUX fine-tunes
Browse the diverse range of fine-tunes the community has custom-trained on Replicate
Control image generation
Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.
Popular models
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Generate CLIP (clip-vit-large-patch14) text & image embeddings
Return CLIP features for the clip-vit-large-patch14 model
Latest models
generate pixel art sprite sheets from four different angles with Stable-diffusion
Stable diffusion fork for generating tileable outputs using v1.5 model
Disco Diffusion style on Stable Diffusion via Dreambooth
Animation Studio on Stable Diffusion via Dreambooth
Stable Diffusion fine-tuned of the Codex Borgia, a 16th century Meso-American manuscript.
fine-tuned Stable Diffusion model trained on the game art from Elden Ring
Use Runway's Stable-diffusion inpainting model to create an infinite loop video
Animate Stable Diffusion by interpolating between two prompts
Stable Diffusion fined-tuned on frames from Monkey Island 1 and 2
Prompt-to-prompt image editing with cross-attention control
3 Million Runs! AI Photorealistic Image Super-Resolution and Restoration
Stable diffusion, but with more powerful in-painting & out-painting capabilities
Stable Diffusion with Aesthetic Gradients
Method for generating bizarre looking videos from a series of language descriptions of the video. From the Bot Intelligence Group at CMU: Peter Schaldenbrand, Zhixuan Liu, & Jean Oh
Highly Accurate Dichotomous Image Segmentation (ECCV 2022)
Generate image from text by guiding a denoising diffusion model. Inference is somewhat slow.
Generate images from text quickly. See https://replicate.com/afiaka87/laionide-v2 for a new checkpoint.
The predecessor to DALLE-2, GLIDE (filtered) with faster PRK/PLMS sampling.
mediapipe facial landmark detection demo by Marlene Mhangami
high-resolution piano transcription system: detects piano notes from audio
Emotional conditioned music generation using transformer-based model.