Explore

I want to…

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Clean Text from Manhwa/Manhua

Updated 7 runs

# Interior Decoration Space Scaling - Second Use Case

Updated 65 runs

a model to get images

Updated 275 runs

Updated 396 runs

This model is used to generate speech

Updated 28 runs

A F5-TTS fine-tuned for Spanish

Updated 327 runs

Updated 10 runs

Updated 23 runs

Dreamlike Diffusion Model for Splurge Art

Updated 2.2K runs

From Sketch to Reality: Transforming Outlines into Lifelike Images

Updated 44.3K runs

Updated 17 runs

baby transformer for blog post

Updated 26 runs

staging testing

Updated 243 runs

Document translation with contextual integrity.

Updated 56 runs

Align text to audio with exact word timings. All characters supported!

Updated 70.8K runs

Projection module trained to add vision capabilties to Llama 3 using SigLIP

Updated 5.5K runs

recraft-ai/recraft-v3-svg

Recraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.

Updated 61.9K runs

recraft-ai/recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

Updated 1.2M runs

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Updated 21 runs

Apple's monocular depth estimation foundation model (Depth Pro)

Updated 1.5K runs

OmniGen: Unified Image Generation

Updated 8.2K runs

Create audio clips from text

Updated 7 runs

Explorador FLUX.1-Dev LoRA

Updated 67 runs

Updated 54 runs

Fine-tune StableDiffusion3.5-Large with Hugging Face Diffusers

Updated 441 runs

Updated 24 runs

Run any python code

Updated 5.2K runs

Ostris AI-Toolkit for StableDiffusion3.5-Large LoRA Training

Updated 204 runs

stability-ai/stable-diffusion-3.5-medium

2.5 billion parameter image model with improved MMDiT-X architecture

Updated 17.5K runs

Updated 505 runs

Analyzes music to determine song structure, bpm, downbeats, and demuxes audio

Updated 578 runs

Sayak Paul's cartoonizer, deployed to replicate. Here's the model: https://huggingface.co/instruction-tuning-sd/cartoonizer

Updated 121 runs

Updated 11 runs

flux.1-lite-8B-alpha by Freepik

Updated 311 runs

Updated 37 runs

Stable Diffusion 3.5 Large - LoRA Explorer

Updated 1.6K runs

One shot portrait maker.

Updated 22.6K runs

Remove Background of video and add yours

Updated 247 runs

Updated 262 runs

Updated 69 runs

Powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text prompt

Updated 118 runs

Updated 151 runs

fancyfeast/joytag

Updated 15.1K runs

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching. Voice cloning

Updated 17.2K runs

Updated 79 runs

ChatTTS is a text-to-speech model designed specifically for dialogue scenarios such as LLM assistant.

Updated 122 runs

stability-ai/stable-diffusion-3.5-large-turbo

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps

Updated 76K runs

stability-ai/stable-diffusion-3.5-large

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.

Updated 492.2K runs

ideogram-ai/ideogram-v2-turbo

A fast image model with state of the art inpainting, prompt comprehension and text rendering.

Updated 790.8K runs

ideogram-ai/ideogram-v2

An excellent image model with state of the art inpainting, prompt comprehension and text rendering

Updated 482.6K runs