Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

controlnet 1.1 lineart x realistic-vision-v2.0 (updated to v5)

Updated 5M runs

BakLLaVA-1 is a Mistral 7B base augmented with the LLaVA 1.5 architecture

Updated 39.1K runs

Updated 585 runs

Source: teknium/Mistral-Trismegistus-7B ✦ Quant: TheBloke/Mistral-Trismegistus-7B-AWQ ✦ Mistral Trismegistus is a model made for people interested in the esoteric, occult, and spiritual

Updated 598 runs

Source: bavest/fin-llama-33b ✦ Quant: TheBloke/fin-llama-33B-AWQ ✦ Efficient Finetuning of Quantized LLMs for Finance

Updated 308 runs

😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL

Updated 623.8K runs

Source: migtissera/Synthia-13B-v1.2 ✦ Quant: TheBloke/Synthia-13B-v1.2-AWQ ✦ SynthIA (Synthetic Intelligent Agent) is a LLama-2-13B model trained on Orca style datasets

Updated 590 runs

Source: ajibawa-2023/carl-llama-2-13b ✦ Quant: TheBloke/Carl-Llama-2-13B-AWQ ✦ Carl: A Therapist AI

Updated 546 runs

Detect everything with language!

Updated 6.3M runs

Music Generator

Updated 525 runs

Marks roads in an image

Updated 13 runs

Qwen-VL-Chat but with raw ChatML prompt interface and streaming

Updated 1.1K runs

Content AI detector

Updated 39.6K runs

Huggingface-sdxl-inpainting is a sdxl model of the age inpainting fine-tuning, with inputs and outputs having the same height and width!

Updated 2.6K runs

Updated 167 runs

Fuyu-8B is a multi-modal text and image transformer trained by Adept AI

Updated 4.6K runs

Separate Anything You Describe

Updated 4.4K runs

🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives

Updated 5.7K runs

A simple OCR Model that can easily extract text from an image.

Updated 89.9M runs

Updated 346 runs

Video object segmentation for short and long videos

Updated 41 runs

Open diffusion model for high-quality video generation

Updated 10.5K runs

Synthesizing High-Resolution Images with Few-Step Inference

Updated 1.1M runs

Batch mode for text & image embeddings

Updated 69 runs

Tuning-free Higher-Resolution Visual Generation with Diffusion Models

Updated 1.1K runs

Generate videos from text prompts with Kandinsky-2.2

Updated 7.3K runs

SAM(Segment Anything) ViT-H image encoder

Updated 357.9K runs

Whisper-Large-V2 + Pyannote 3.0 diarization via WhisperX

Updated 105 runs

Anime Pastel Dream Model For Splurge Art

Updated 5.7K runs

Neurogen Model for Splurge Art

Updated 4.8K runs

Dreamlike Anime 1.0 for Splurge Art

Updated 6.2K runs

Dreamlike Photoreal Model for Splurge Art

Updated 3.3K runs

Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Updated 985 runs

TheBloke/Nous-Hermes-Llama2-AWQ served with vLLM

Updated 7.3K runs

Gender recognition for audio files

Updated 4.3K runs

cog-resnet example trial

Updated 10 runs

Make a transcription of a phone call

Updated 14 runs

Trained on plants

Updated 28 runs

My own personal copy of daanelson/whisperx

Updated 312 runs

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

Updated 800K runs

Mistral-7B-v0.1 fine tuned for chat with the OpenOrca dataset.

Updated 65.9K runs

Using a ComfyUI workflow to run SDXL text2img

Updated 449 runs

Zero-shot / open vocabulary object detection

Updated 23.8K runs

A high-performing language model trained to act as a helpful assistant

Updated 8K runs

Controlling Vision-Language Models for Universal Image Restoration

Updated 2.2K runs

✨DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Updated 133.4K runs

Updated 106 runs

Object removal, video completion and video outpainting

Updated 4.4K runs