Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion

Kashinath Remeshkumar; Abhijith R R Abhijith; Dan Philip Bobby; Kevin Varghese Theveril; Hema H H Hema

Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion

Authors

Kashinath Remeshkumar

Sree Buddha College Of Engineering Pattoor

Author
Abhijith R R Abhijith

Sree Buddha College Of Engineering Pattoor

Author
Dan Philip Bobby

Sree Buddha College Of Engineering Pattoor

Author
Kevin Varghese Theveril

Author
Hema H H Hema

Author

Abstract

Capturing clear images in low-light conditions remains a significant challenge across surveillance, mobile photography, and diagnostic imaging. Traditional enhancement methods require extensive paired datasets or risk introducing visual artifacts. This paper presents a zero-shot low-light image enhancement framework combining vision-language models (CLIP) with latent diffusion models (Stable Diffusion) to enhance images without task-specific training. CLIP extracts semantic embeddings to guide the enhancement process, while the diffusion model performs iterative denoising to restore brightness and detail. By constraining enhancement through semantic similarity, our method preserves scene content while improving visibility. The system achieves competitive PSNR (15.556 dB) and SSIM (0.729) scores on standard benchmarks without requiring paired training data, demonstrating practical applicability for real-world deployment scenarios including embedded and mobile platforms.

Keywords:

low-light enhancement, zero-shot learning, diffusion models, vision-language models

Downloads 57

Full Text (PDF)

Published

29-05-2026

Issue

Vol. 6 No. 1 (2026): IJERA

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

How to Cite

[1]

K. Remeshkumar, A. R. R Abhijith, D. Philip Bobby, K. Varghese Theveril, and H. H. H Hema, “Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion”, IJERA, vol. 6, no. 1, pp. 77–83, May 2026, Accessed: Jul. 28, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/380

Download Citation

Indexed By

Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion

Authors

Kashinath Remeshkumar

Abhijith R R Abhijith

Dan Philip Bobby

Kevin Varghese Theveril

Hema H H Hema

Abstract

Keywords:

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

A Review of AI-Powered Tools to Help People With Visual Impairments

CARDAMOM PLANT DISEASE DETECTION USING ROBOT

Detection of Diabetic Retinopathy and Glaucoma using Deep Learning

Custom Cart – Virtual try-on in e-commerce platforms using generative AI

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

A Review on Prompt Engineering in Agriculture

A Literature Review On Machine Learning-Based Phishing Detection Systems

A Review on Deep Learning and IoT-Based Road Surface Damage Detection

Literature Survey on AURA: Augmented Reality Glasses for Enhancing Accessibility of Visually and Hearing Impaired Users

Personality Profiling Using CV Analysis