Explore

I want to…

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Açıklama Testi

Updated 7 runs

Updated 858 runs

Fast Mochi by Hao AI Lab

Updated 46 runs

minimax/music-01

Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track

Updated 50.4K runs

Lipsync model using MuseTalk

Updated 1.8K runs

Updated 7 runs

Ollama Llama 3.2 Vision 90B

Updated 2K runs

comfy with flux model,

Updated 145.2K runs

Ollama Llama 3.2 Vision 11B

Updated 1.7K runs

Ollama QwQ 32B

Updated 35 runs

Clothing segmentation tool that generates masks from outfit images, separating them into top and bottom pieces with automatic background removal and edge refinement.

Updated 38 runs

Ollama Llama 3.3 70B

Updated 172 runs

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

Updated 119 runs

Apollo 7B - An Exploration of Video Understanding in Large Multimodal Models

Updated 1.4K runs

minimax/video-01-live

An image-to-video (I2V) model specifically trained for Live2D and general animation use cases

Updated 58.5K runs

Apollo 3B - An Exploration of Video Understanding in Large Multimodal Models

Updated 87 runs

Video Background Removal

Updated 1.6K runs

Prepare arXiv papers for processing by Large Language Models (LLMs) by converting them into a single, expanded LaTeX file.

Updated 10 runs

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Updated 1.3K runs

Video Preprocessing tool for captioning multiple videos using GPT, Claude or Gemini

Updated 82 runs

Simple tool to split apart a video into snippets

Updated 77 runs

Create ads for marketing, social media with your own company logo on any object you want.

Updated 149 runs

luma/ray

Fast, high quality text-to-video and image-to-video (Also known as Dream Machine)

Updated 12.7K runs

recraft-ai/recraft-20b

Affordable and fast images

Updated 58.6K runs

recraft-ai/recraft-20b-svg

Affordable and fast vector images

Updated 8.5K runs

Add sound to video. An advanced AI model that synthesizes high-quality audio from video content, enabling seamless video-to-audio transformation

Updated 118.6K runs

Fast FLUX DEV -> Flux Controlnet Canny, Controlnet Depth , Controlnet Line Art, Controlnet Upscaler - You can use just one controlnet or All - LORAs: HyperFlex LoRA , Add Details LoRA , Realism LoRA

Updated 85.5K runs

a-r-r-o-w/cogvideox-factory for Mochi-1 LoRA Training

Updated 27 runs

Make realistic images of real people instantly

Updated 793.1K runs

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions

Updated 1.3K runs

MEMO is a state-of-the-art open-weight model for audio-driven talking video generation.

Updated 403 runs

Updated 312 runs

Hunyuan Video 8bit model API for video generation

Updated 232 runs

High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

Updated 139 runs

A fast image model with wide artistic range and resolutions up to 4096x4096

Updated 89.2K runs

Moondream 0.5B, the world's smallest vision language model

Updated 45 runs

The Qwen2.5-Coder-32B-Instruct is a state-of-the-art, open-source large language model (LLM). It is specifically designed for coding tasks and is part of the Qwen2.5-Coder series, featuring 32 billion parameters.

Updated 45 runs

luma/photon-flash

Accelerated variant of Photon prioritizing speed while maintaining quality

Updated 39.4K runs

luma/photon

High-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs

Updated 188.6K runs

The Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out).

Updated 14 runs

Hey, this is a fork of flux pulid to support multiple ids, use with a depth map and define bounding boxes for each face

Updated 2K runs

haiper-ai/haiper-video-2

Generate 4s and 6s videos from a prompt or image

Updated 6.1K runs

Updated 384 runs

black-forest-labs/flux-dev-lora

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference

Updated 254.9K runs

flux.1-dev: hyper-sd 8 steps + instanx ip adataper + pulid + depth controlnet

Updated 210 runs

A version of mochi-1 (a text to video model) that supports fine-tuned lora inference

Updated 96 runs

black-forest-labs/flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

Updated 429.6K runs

Let Vision Language Models Reason Step-by-Step

Updated 22 runs

Mochi 1 preview is an open video generation model with high-fidelity motion and strong prompt adherence in preliminary evaluation

Updated 1.7K runs

SVDQuant Optimized Flux.Schnell

Updated 30 runs