Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Optical character recognition to turn images of latex equations into latex format.

Updated 838 runs

Create illusions with img2img and masking support

Updated 51.5K runs

Updated 45 runs

MAEST is a family of Transformer models based on PASST and focused on music analysis applications. The MAEST models are also available for inference in the Essentia library and for inference and training in the official repository.

Updated 1.8K runs

Modify images using line art

Updated 55.6K runs

Source: Nexusflow/NexusRaven-13B ✦ Quant: TheBloke/NexusRaven-13B-AWQ ✦ Surpassing the state-of-the-art in open-source function calling LLMs

Updated 54 runs

Source: haoranxu/ALMA-7B ✦ Quant: TheBloke/ALMA-7B-AWQ ✦ ALMA (Advanced Language Model-based trAnslator) is an LLM-based translation model

Updated 94 runs

in_painting

Updated 63 runs

Grounding Multimodal Large Language Models to the World

Updated 1.9K runs

An English, monolingual embedding model supporting 8192 sequence length (33M version)

Updated 33 runs

Segmind Stable Diffusion Model (SSD-1B) img2img

Updated 4.5K runs

Distilled version of Whisper

Updated 275 runs

Identifies NSFW images

Updated 493 runs

openclip analyzes pictures to generate description information

Updated 89 runs

Image Paint Style

Updated 134 runs

T4 GPU, negative embeddings, img2img, inpainting, safety checker, KarrasDPM, pruned fp16 safetensor

Updated 1.8K runs

T4 GPU, negative embeddings, img2img, inpainting, safety checker, KarrasDPM, pruned fp16 safetensor

Updated 3.5K runs

meta/llama-2-7b-chat

A 7 billion parameter language model from Meta, fine tuned for chat completions

Updated 18M runs

SDXL v1.0 - A text-to-image generative AI model that creates beautiful images

Updated 478.8K runs

Implementation of SDXL RealVisXL_V1.0 img2img

Updated 3.4K runs

Generate music restricted to chord sequences and tempo

Updated 2.7K runs

Inference SD 1.5 with cog including several models.

Updated 1.8K runs

CarAI: Evaluate Car Damages

Updated 91 runs

Tuning-Free Longer Video Diffusion via Noise Rescheduling

Updated 14.9K runs

Zephyr-7B-beta, an LLM trained to act as a helpful assistant.

Updated 5.7K runs

Monster Labs' Controlnet QR Code Monster v2 For SD-1.5 on top of AnimateDiff Prompt Travel (Motion Module SD 1.5 v2)

Updated 10.2K runs

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Updated 48.1K runs

Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

Updated 36.2K runs

Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

Updated 13.4K runs

👻

Updated 298 runs

test api

Updated 306 runs

Transform your image or QR code like never before

Updated 1.8K runs

A 40 billion parameter language model trained to follow human instructions.

Updated 41.4K runs

Video Object Segmentation, combined with SAM and ProPainter

Updated 309 runs

An English, monolingual embedding model supporting 8192 sequence length (137M version)

Updated 50 runs

Modify images using canny edges

Updated 105.4K runs

Modify images using sketches

Updated 18.4K runs

Modify images using human pose

Updated 61.7K runs

Modify images using depth maps

Updated 620.2K runs

Source: HuggingFaceH4/zephyr-7b-beta ✦ Quant: TheBloke/zephyr-7B-beta-AWQ ✦ Zephyr is a series of language models that are trained to act as helpful assistants. Zephyr-7B-β is the second model in the series

Updated 188.8K runs

This is a cog implementation of "openbuddy-llemma-34b" 4-bit quantization model.

Updated 273 runs

Mask prompting based on Grounding DINO & Segment Anything | Integral cog of doiwear.it

Updated 688.9K runs

A 6B parameter open bilingual chat LLM | 开源双语对话语言模型

Updated 15.3K runs

A 6B parameter open bilingual chat LLM (optimized for 8k+ context) | 开源双语对话语言模型

Updated 328 runs

SDXL 1.0 + Wrong LoRA weights + Better VAE | WIP

Updated 703 runs

Openbuddy finetuned mistral-7b in GPTQ quantization in 4bits by TheBloke

Updated 68 runs

Updated 10 runs

A 8k sequence length text embedding set trained by Jina AI

Updated 107 runs

Updated 230 runs

MistralLiteA is a fine-tuned Mistral-7B-v0.1 language model, with enhanced capabilities of processing long context (up to 32K tokens)

Updated 655 runs