Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

ProteusV0.1 uses OpenDalleV1.1 as a base and further refines prompt adherence and stylistic capabilities to a measurable degree

Updated 6.7K runs

Great image quality, good old SDXL with a new and improved Tile refiner.

Updated 807 runs

Audio-based Lip Synchronization for Talking Head Video

Updated 29.5K runs

SDVN10-Anime

Updated 337 runs

Improved background remover 2.0 - GroundingDino + SAM + Inpainting SDXL + Controlnet Canny

Updated 205 runs

Automatic Speech Recognition with Word-level Timestamps & Diarization

Updated 4K runs

Towards Photo-Realistic Image Colorization via Dual Decoders

Updated 274.3K runs

Source: Neuronovo/neuronovo-7B-v0.3 ✦ Quant: TheBloke/neuronovo-7B-v0.3-AWQ ✦ Neuronovo/neuronovo-7B-v0.3 model represents an advanced and fine-tuned version of a large language model, initially based on CultriX/MistralTrix-v1.

Updated 41 runs

Yuan2.0 is a new generation LLM developed by IEIT System, enhanced the model's understanding of semantics, mathematics, reasoning, code, knowledge, and other aspects.

Updated 387 runs

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

Updated 72.4K runs

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

Updated 515 runs

90s anime

Updated 21.3K runs

multilingual-e5-large: A multi-language text embedding model

Updated 18.7M runs

multilingual-e5-base: A multi-language text embedding model

Updated 10 runs

multilingual-e5-small: A multi-language text embedding model

Updated 14 runs

Amused is a lightweight text to image model based off of the muse architecture. Amused is particularly useful in applications that require a lightweight and fast model such as generating many images quickly at once.

Updated 196 runs

Super-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet

Updated 1.4M runs

Cheaper model SwinIR: Image Restoration Using Swin Transformer (analogue of the popular model: jingyunliang/swinir)

Updated 851 runs

A SOTA Nous Research finetune of 200k Yi-34B fine tuned on the Capybara dataset.

Updated 1.9K runs

Video toolkit – convert, make GIFs, extract audio

Updated 7.5K runs

Diffusion Models for Image Morphing

Updated 1.1K runs

A 34 billion parameter Llama tuned for coding and conversation

Updated 154.7K runs

A 7 billion parameter Llama tuned for coding and conversation

Updated 65.2K runs

(Academic and Non-commercial use only) Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Updated 40.5K runs

Lob RealVis XL

Updated 60.6K runs

whisper-large-v3, incredibly fast, with video transcription

Updated 123.2K runs

RESEARCH/NON-COMMERCIAL USE ONLY: diffusion-based audio-driven expressive talking head generation

Updated 1K runs

SDXL LoRA finetuned on diamond watches

Updated 53 runs

SDXL LoRA finetuned on Vermeer paintings

Updated 89 runs

SDXL using DeepCache

Updated 3.8K runs

Chest X ray

Updated 2.6K runs

Anydoor: zero-shot object-level image customization

Updated 2K runs

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting

Updated 1.5M runs

Source: pipizhao/Pandalyst-7B-V1.2 ✦ Quant: TheBloke/Pandalyst-7B-v1.2-AWQ ✦ Pandalyst: A large language model for mastering data analysis using pandas

Updated 20 runs

Honeycomb NLQ Generator

Updated 40 runs

RESEARCH/NON-COMMERCIAL USE ONLY: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Updated 126.9K runs

fastai lesson 1 - bird or forest

Updated 229 runs

This is the chat model finetuned on top of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T

Updated 575 runs

🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝

Updated 1.8K runs

SDXL LoRA finetuned on Basquiat Paintings

Updated 167 runs

Nougat: Neural Optical Understanding for Academic Documents

Updated 232 runs

Versatile Audio Super-resolution at Scale which upsamples audio files to 48khz. Longer audio input is possible with this model

Updated 2.4K runs

Detecting Twenty-thousand Classes using Image-level Supervision

Updated 275 runs

Source: TinyLlama/TinyLlama-1.1B-Chat-v1.0 ✦ Quant: TheBloke/TinyLlama-1.1B-Chat-v1.0-AWQ ✦ The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Updated 108 runs

SDXL LoRA I trained on chihuahua images

Updated 25 runs

PyTSMod is an open-source library for Time-Scale Modification(eg. time-stretching) algorithms, by Sangeon Yong at MAC Lab, KAIST.

Updated 174 runs

Zero-shot classifier which classifies text into categories of your choosing. Returns a dictionary of the most likely class and all class likelihoods.

Updated 3.6K runs

Nous Hermes 2 - SOLAR 10.7B is the flagship Nous Research model on the SOLAR 10.7B base model..

Updated 70.5K runs

Nous Hermes 2 - SOLAR 10.7B is the flagship Nous Research model on the SOLAR 10.7B base model.

Updated 9K runs