Explore

I want to…

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Video Smoother: AMT All-Pairs Multi-Field Transforms for Efficient Frame Interpolation

Updated 16.4K runs

Updated 34.8K runs

Updated 85.4K runs

NeuralBeagle14-7B is (probably) the best 7B model you can find!

Updated 12.2K runs

Updated 269 runs

Source: SciPhi/Sensei-7B-V1 ✦ Quant: TheBloke/Sensei-7B-V1-AWQ ✦ Sensei is specialized in performing RAG over detailed web search results

Updated 34 runs

Source: WhiteRabbitNeo/WhiteRabbitNeo-13B-v1 ✦ TheBloke/WhiteRabbitNeo-13B-AWQ ✦ WhiteRabbitNeo is a model series that can be used for offensive and defensive cybersecurity

Updated 115 runs

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

Updated 4.7K runs

Create photos, paintings and avatars for anyone in any style within seconds.

Updated 5.6M runs

Third party Fooocus replicate model with preset 'anime'

Updated 217K runs

Third party Fooocus replicate model with preset 'realistic'

Updated 562.2K runs

Third party Fooocus replicate model

Updated 1.3M runs

An Open Source text-to-speech system built by inverting Whisper

Updated 1.6K runs

Unofficial Re-Trained AnimateAnyone (Image + DWPose Video → Animated Video of Image)

Updated 838 runs

Updated 255 runs

Photorealism with RealVisXL V3.0 Turbo based on SDXL

Updated 153.6K runs

Image-Prompt Multi-view Diffusion for 3D Generation

Updated 1.4K runs

Implementation of Realistic Vision v5.1 to conjure up images of the potential baby using a single photo from each parent

Updated 1.7M runs

MAGNeT: Masked Audio Generation using a Single Non-Autoregressive Transformer

Updated 1.9K runs

ForgeSaga Landscape

Updated 109 runs

Manmaru mix v3.0

Updated 696 runs

Source: allenai/digital-socrates-13b ✦ Quant: TheBloke/digital-socrates-13B-AWQ ✦ Digital Socrates is an open-source, automatic explanation-critiquing model

Updated 17 runs

Source: Unbabel/TowerInstruct-7B-v0.1 ✦ Quant: TheBloke/TowerInstruct-7B-v0.1-AWQ ✦ This model is trained to handle several translation-related tasks, such as general machine translation, gramatical error correction, and paraphrase generation

Updated 188 runs

Improving the Stability of Diffusion Models for Content Consistent Super-Resolution

Updated 3.3K runs

ProteusV0.1 uses OpenDalleV1.1 as a base and further refines prompt adherence and stylistic capabilities to a measurable degree

Updated 6.7K runs

Great image quality, good old SDXL with a new and improved Tile refiner.

Updated 788 runs

Audio-based Lip Synchronization for Talking Head Video

Updated 28K runs

SDVN10-Anime

Updated 330 runs

Improved background remover 2.0 - GroundingDino + SAM + Inpainting SDXL + Controlnet Canny

Updated 205 runs

Automatic Speech Recognition with Word-level Timestamps & Diarization

Updated 3.6K runs

Towards Photo-Realistic Image Colorization via Dual Decoders

Updated 204K runs

Source: Neuronovo/neuronovo-7B-v0.3 ✦ Quant: TheBloke/neuronovo-7B-v0.3-AWQ ✦ Neuronovo/neuronovo-7B-v0.3 model represents an advanced and fine-tuned version of a large language model, initially based on CultriX/MistralTrix-v1.

Updated 37 runs

Yuan2.0 is a new generation LLM developed by IEIT System, enhanced the model's understanding of semantics, mathematics, reasoning, code, knowledge, and other aspects.

Updated 384 runs

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

Updated 62.8K runs

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

Updated 467 runs

90s anime

Updated 21.3K runs

multilingual-e5-large: A multi-language text embedding model

Updated 15.3M runs

multilingual-e5-base: A multi-language text embedding model

Updated 8 runs

multilingual-e5-small: A multi-language text embedding model

Updated 11 runs

Amused is a lightweight text to image model based off of the muse architecture. Amused is particularly useful in applications that require a lightweight and fast model such as generating many images quickly at once.

Updated 195 runs

Super-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet

Updated 1.3M runs

Cheaper model SwinIR: Image Restoration Using Swin Transformer (analogue of the popular model: jingyunliang/swinir)

Updated 843 runs

A SOTA Nous Research finetune of 200k Yi-34B fine tuned on the Capybara dataset.

Updated 1.9K runs

Video toolkit – convert, make GIFs, extract audio

Updated 4.8K runs

Make your video talk anything

Updated 1.3K runs

Diffusion Models for Image Morphing

Updated 1.1K runs

A 34 billion parameter Llama tuned for coding and conversation

Updated 152.3K runs

A 7 billion parameter Llama tuned for coding and conversation

Updated 63.1K runs

(Academic and Non-commercial use only) Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Updated 39.8K runs

Lob RealVis XL

Updated 60.3K runs