Readme
Transform AI-generated images into photorealistic masterpieces with advanced enhancement and upscaling
Advanced AI image enhancement that eliminates artifacts, adds natural depth, improves details, and upscales with photorealistic quality for stunning, professional results.
📋 Overview
This specialized workflow prioritizes quality over speed, using an intensive multi-stage process to transform AI-generated images. Rather than quick fixes, it employs a thorough sequence of professional enhancement techniques that combine multiple state-of-the-art Stable Diffusion models with powerful computer vision to:
- ✨ Fix AI artifacts and improve photorealism through deliberate multi-stage processing
- 🔍 Add natural depth with realistic depth-of-field effects and subtle background blur
- 🖼️ Preserve composition while enhancing details and correcting common AI flaws
- 📈 Upscale by 2x or 4x with AI-powered detail preservation and enhancement
- 📦 Batch process multiple AI images via ZIP or TAR.GZ archives
🚀 Features
- High-Quality Focus: Optimized for maximum image quality, not processing speed
- Multi-Stage Processing Pipeline: Uses a sophisticated sequence of model applications rather than a single-pass approach
- AI Image Enhancement: Specifically tuned to improve AI-generated images for more photorealistic results
- Depth-Aware Processing: Uses Depth Anything V2 to add realistic depth effects often missing in AI art
- Intelligent Depth Blur: Applies light, natural-looking blur based on depth map for more photographic results
- Content Safety: Automatically detects and blurs NSFW content for appropriate usage
- Flexible Upscaling: Choose between 2x or 4x upscaling depending on your needs
- Archive Support: Process multiple images at once via ZIP or TAR.GZ archives
- High-Resolution Output: Upscales images while maintaining quality
- JPEG Output: All processed images are saved in high-quality JPEG format
📥 Input/Output Options
- Single Image: Process one image at a time (JPEG, PNG, WebP, BMP supported)
- Batch Processing: Upload ZIP or TAR.GZ archives containing multiple images
- Output Format: All images are output as high-quality JPEGs for consistency
🖌️ Models Used
This workflow combines several powerful models to achieve stunning results:
🎨 Stable Diffusion Models
- cyberillustrious v3.5 by Cyberdelia - Realistic detail enhancement (Creator of the Illustrious-XL-v1.0: OnomaAI based on SDXL by [StabilityAI] https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0 )
- epicrealismNatural v4.0 by epinikion - Good hair
- perfectdeliberate v5 by Desync - Good skin
🔍 ControlNets & Special Models
- Depth Anything V2 - Advanced depth map generation for realistic focal effects
- OpenPoseXL2 - Pose understanding for better human subjects
- diffusers_xl_depth_full - Depth-based detail control and subtle background blurring
🔧 Enhancement Tools
- HandFineTuning_XL by Hustmox - Hand detail improvement
- 4xRealWebPhoto_v4 by Phips - High-quality photo-realistic upscaling
- IP-Adapter SDXL Plus - Style and composition guidance
💡 Usage Guide
- Upload a single image or an archive containing multiple images
- Wait for the enhancement process to complete
- Download your enhanced image(s)
For archives, all images inside will be processed with the same settings and returned in a similarly formatted archive with the same directory structure.
⚠️ Important Limitations
- Image Ratio: This tool only works properly with 1:1 (square) ratio images. Other aspect ratios will be center-cropped to square format. All images will be processed at 1536x1536 resolution regardless of input size.
- Output Resolution: Final images will be either 3072x3072 (2x upscale) or 6144x6144 (4x upscale) depending on your selection.
- Text Processing: While the tool can process text in images, results may not always be perfect.
- Depth Blur: The depth-based blur effect can be disabled if preferred.
- Processing Time Limit: ⚠️ WARNING - Replicate automatically stops processes after 30 minutes. Very large images or batch operations may time out.
🙏 Acknowledgements
This workflow builds on the incredible work of many talented developers and model creators:
Model Creators
- Cyberdelia for cyberillustrious v3.5
- epinikion for epicrealismNatural v4.0
- Desync for perfectdeliberate v5
- Hustmox for HandFineTuning_XL
- Phips for 4xRealWebPhoto_v4 upscaler
- tencent-ailab for IP-Adapter (with IPAdapter Plus implementation from cubiq)
- Depth Anything team for Depth Anything V2 depth detection model
Tools & Frameworks
- ComfyUI team for the incredible workflow engine
- Depth Anything V2 for vision-language architectures
- pysssss for the String Function and WD14 Tagger nodes
- ComfyUI Pro Post Processing team for depth-map blur effects and focal depth control
⭐ Thanks. ⭐