goodguy1963 / hidream-l1-full-img2img

IMG2IMG for HiDream FULL AND DEV - does creative variations (wait for HiDream-E1 for even better results)

  • Public
  • 144 runs
  • Weights

HiDream Img2Img ComfyUI Workflow with settings that worked for me

This workflow enables advanced image-to-image generation using the HiDream model suite and Florence-2 prompt generator, designed for use with ComfyUI and Replicate.

Overview

  • Image-to-image generation with HiDream diffusion model
  • Florence-2 for prompt generation and captioning
  • VAE encoding/decoding and advanced CLIP-based text encoding
  • Negative prompt support for artifact reduction
  • LOW VRAM MODE

Required Models & Credits

Diffusion Model

  • hidream_i1_full_fp16.safetensors
    Place in: ComfyUI/models/diffusion_models
    Download
    Thanks to HiDream.ai for the model!

For low VRAM user - GPU with less than 24GB VRAM:

Text Encoders

Place all in: ComfyUI/models/text_encoders - clip_g_hidream.safetensors
Download - clip_l_hidream.safetensors
Download - llama_3.1_8b_instruct_fp8_scaled.safetensors
Download - t5xxl_fp8_e4m3fn_scaled.safetensors
Download

VAE

  • ae.safetensors
    Place in: ComfyUI/models/vae
    Download

Florence-2 Prompt Generator (NO need to download - will be downloaded automatacally at runtime)

Usage

  1. Download all required models and place them in the correct directories as listed above.
  2. Drag the workflow image in ComfyUI
  3. Use the workflow to generate new images from your input images and prompts.

For low VRAM user - GPU with less than 24GB VRAM:

Workflow Diagram

See the full workflow structure here:
WORKFLOW-HIDREAM-IMG2IMG.png

Acknowledgements

  • HiDream.ai for the diffusion model and encoders
  • Microsoft for Florence-2
  • MiaoshouAI for the Florence-2 prompt generator implementation
  • ComfyUI team for the UI and workflow engine

Thank you to all model creators and contributors!