Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion

Kashinath Remeshkumar; Abhijith R R Abhijith; Dan Philip Bobby; Kevin Varghese Theveril; Hema H H Hema

Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion

Authors

Kashinath Remeshkumar

Sree Buddha College Of Engineering Pattoor

Author
Abhijith R R Abhijith

Sree Buddha College Of Engineering Pattoor

Author
Dan Philip Bobby

Sree Buddha College Of Engineering Pattoor

Author
Kevin Varghese Theveril

Author
Hema H H Hema

Author

Abstract

Capturing clear images in low-light conditions remains a significant challenge across surveillance, mobile photography, and diagnostic imaging. Traditional enhancement methods require extensive paired datasets or risk introducing visual artifacts. This paper presents a zero-shot low-light image enhancement framework combining vision-language models (CLIP) with latent diffusion models (Stable Diffusion) to enhance images without task-specific training. CLIP extracts semantic embeddings to guide the enhancement process, while the diffusion model performs iterative denoising to restore brightness and detail. By constraining enhancement through semantic similarity, our method preserves scene content while improving visibility. The system achieves competitive PSNR (15.556 dB) and SSIM (0.729) scores on standard benchmarks without requiring paired training data, demonstrating practical applicability for real-world deployment scenarios including embedded and mobile platforms.

Keywords:

low-light enhancement, zero-shot learning, diffusion models, vision-language models

Downloads 58

Full Text (PDF)

Published

29-05-2026

Issue

Vol. 6 No. 1 (2026): IJERA

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

How to Cite

[1]

K. Remeshkumar, A. R. R Abhijith, D. Philip Bobby, K. Varghese Theveril, and H. H. H Hema, “Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion”, IJERA, vol. 6, no. 1, pp. 77–83, May 2026, Accessed: Jul. 30, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/380

Download Citation

Indexed By

Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion

Authors

Kashinath Remeshkumar

Abhijith R R Abhijith

Dan Philip Bobby

Kevin Varghese Theveril

Hema H H Hema

Abstract

Keywords:

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

The Carbon footprint of Machine Learning Models

Wildlife Detection And Recognition Using YOLO V8

GestureMate: An AI-Driven System for Real-Time Malayalam Sign Language and Speech Translation

Advanced Sensor-Based Landslide and Earthquake Detection and Alert System Utilizing Machine Learning and Computer Vision Technologies

A Machine Learning Approach to Fake News Detection

InsightAI: Bridging Natural Language and Data Analytics

Deep Learning Techniques for Image Steganography: A Comprehensive Review

A Review of Machine Learning and Deep Learning Approaches for Offensive Text Detection

Securing AI: Understanding and Defending Against Adversarial Attacks in Deep Learning Systems