zsxkib/aura-sr

AuraSR: GAN-based Super-Resolution for real-world

177 runs
Public

zsxkib/qwen2-7b-instruct

Qwen 2: A 7 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

16 runs
Public

zsxkib/qwen2-1.5b-instruct

Qwen 2: A 1.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

68 runs
Public

zsxkib/qwen2-0.5b-instruct

Qwen 2: A 0.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

127 runs
Public

zsxkib/llm-prototype-model

2 runs
Public

zsxkib/sd3-controlnet

✨Stable Diffusion 3 w/ ⚡InstantX's Canny, Pose, and Tile ControlNets🖼️

357 runs
Public

zsxkib/v-express

🫦 Realistic facial expression manipulation (lip-syncing) using audio or video

381 runs
Public

zsxkib/hololive-style-bert-vits2

🎙️Hololive text-to-speech and voice-to-voice (Japanese🇯🇵 + English🇬🇧)

123 runs
Public

zsxkib/instant-id

Make realistic images of real people instantly

485.8K runs
Public

zsxkib/wd-image-tagger

Image tagger fine-tuned on WaifuDiffusion w/ (SwinV2, SwinV2, ConvNext, and ViT)

34 runs
Public

zsxkib/ic-light

✍️✨Prompts to auto-magically relights your images

27K runs
Public

zsxkib/ic-light-background

🖼️✨Background images + prompts to auto-magically relights your images (+normal maps🗺️)

2.5K runs
Public

zsxkib/pulid

📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment

98.7K runs
Public

zsxkib/blip-3

Blip 3 / XGen-MM, Answers questions about images ({blip3,xgen-mm}-phi3-mini-base-r-v1)

415 runs
Public

zsxkib/talknet-asd

🗣️ TalkNet-ASD: Detect who is speaking in a video

66 runs
Public

zsxkib/flash-face

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

2.5K runs
Public

zsxkib/prototype-model

A test model (instantid)

652 runs
Public

zsxkib/animate-diff-scene-assembler

Dkamacho’s Scene Assembler

274 runs
Public

zsxkib/yolo-world

Real-Time Open-Vocabulary Object Detection

2.6K runs
Public

zsxkib/uform-gen

🖼️ Super fast 1.5B Image Captioning/VQA Multimodal LLM (Image-to-Text) 🖋️

1.7K runs
Public

zsxkib/patch-fusion

Super High Quality Depth Maps 🗺️: An End-to-End Tile-Based Framework 🏗️ for High-Resolution Monocular Metric Depth Estimation 🔍📏

288 runs
Public

zsxkib/tortoise-then-rvc

278 runs
Public

zsxkib/create-rvc-dataset

Create your own Realistic Voice Cloning (RVC v2) dataset using a YouTube link

4.2K runs
Public

zsxkib/realistic-voice-cloning

Create song covers with any RVC v2 trained AI voice from audio files.

238.5K runs
Public

zsxkib/stable-diffusion-safety-checker

Identifies NSFW images

296 runs
Public

zsxkib/animatediff-illusions

Monster Labs' Controlnet QR Code Monster v2 For SD-1.5 on top of AnimateDiff Prompt Travel (Motion Module SD 1.5 v2)

8.6K runs
Public

zsxkib/film-frame-interpolation-for-large-motion

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

33.5K runs
Public

zsxkib/prototype-model2

6 runs
Public

zsxkib/animatediff-prompt-travel

🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives

5.4K runs
Public

zsxkib/diffbir

✨DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

125.5K runs
Public

zsxkib/st-mfnet

📽️ Increase Framerate 🎬 ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

38.1K runs
Public

zsxkib/animate-diff

🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

40.9K runs
Public

zsxkib/draggan

🐲 DragGAN 🐉 - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"

560 runs
Public

zsxkib/lil-flan-bias-logits-warper

Logit Warping via Biases for Google's FLAN-T5-small

43 runs
Public

zsxkib/clip-age-predictor

Age prediction using CLIP - Patched version of `https://replicate.com/andreasjansson/clip-age-predictor` that works with the new version of cog!

137.5K runs
Public

zsxkib/emotion2color

Transform your text into a beautiful two-tone color gradient that represents your emotions.

346 runs
Public

zsxkib/hello-world

A "Hello World" model for me to get to grips with `cog` and Replicate

41 runs
Public

zsxkib/illuminati-diffusion

🧿 Illuminati Diffusion w/ Textual Inversion Embeddings 🧬

3.5K runs
Public

zsxkib/animate-diff-prompt-walking

0 runs
Public

zsxkib/qwen2-57b-a14b-instruct

0 runs
Public

zsxkib/open-sora

0 runs
Public

zsxkib/moore-animateanyone

Unofficial Re-Trained AnimateAnyone (Image + DWPose Video → Animated Video of Image)

768 runs
Public

zsxkib/qwen2-0.5b-instruct-gptq-int8

0 runs
Public

zsxkib/qwen2-72b-instruct

1 run
Public

zsxkib/test

0 runs
Public

zsxkib/aya-101

📚 Aya, an LLM by Cohere capable of understanding and generating content in 101 languages 🗣️

300 runs
Public

zsxkib/trocr-base-handwritten

🖋️➡️📱Converts handwritten text images into digital text

263 runs
Public