Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion

Kashinath Remeshkumar; Abhijith R R Abhijith; Dan Philip Bobby; Kevin Varghese Theveril; Hema H H Hema

Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion

Authors

Kashinath Remeshkumar

Sree Buddha College Of Engineering Pattoor

Author
Abhijith R R Abhijith

Sree Buddha College Of Engineering Pattoor

Author
Dan Philip Bobby

Sree Buddha College Of Engineering Pattoor

Author
Kevin Varghese Theveril

Author
Hema H H Hema

Author

Abstract

Capturing clear images in low-light conditions remains a significant challenge across surveillance, mobile photography, and diagnostic imaging. Traditional enhancement methods require extensive paired datasets or risk introducing visual artifacts. This paper presents a zero-shot low-light image enhancement framework combining vision-language models (CLIP) with latent diffusion models (Stable Diffusion) to enhance images without task-specific training. CLIP extracts semantic embeddings to guide the enhancement process, while the diffusion model performs iterative denoising to restore brightness and detail. By constraining enhancement through semantic similarity, our method preserves scene content while improving visibility. The system achieves competitive PSNR (15.556 dB) and SSIM (0.729) scores on standard benchmarks without requiring paired training data, demonstrating practical applicability for real-world deployment scenarios including embedded and mobile platforms.

Keywords:

low-light enhancement, zero-shot learning, diffusion models, vision-language models

Downloads 57

Full Text (PDF)

Published

29-05-2026

Issue

Vol. 6 No. 1 (2026): IJERA

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

How to Cite

[1]

K. Remeshkumar, A. R. R Abhijith, D. Philip Bobby, K. Varghese Theveril, and H. H. H Hema, “Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion”, IJERA, vol. 6, no. 1, pp. 77–83, May 2026, Accessed: Jul. 29, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/380

Download Citation

Indexed By

Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion

Authors

Kashinath Remeshkumar

Abhijith R R Abhijith

Dan Philip Bobby

Kevin Varghese Theveril

Hema H H Hema

Abstract

Keywords:

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

DESIGNING OF A VOICE – BASED PROGRAMMING IDE FOR SOURCE CODE GENERATION

Footage Analysis Toolkit: A System for Semantic Video Retrieval and Structured Forensic Analysis

SkinGuard: An EfficientNet Model for Skin Cancer and M-pox Detection

StockGenie: AI-Driven Stock Market Assistant and Forecasting System

A Review of Methodologies for Detecting Missing and Wanted People Using Machine Learning and Video Surveillance

A Concise Review On E-Commerce Website For Visually Impaired

DeepScan : A Deepfake Video Detection System

FEATURE EXTRACTION AND CLASSIFICATION OF CERTIFICATES USING OCR

Chatbot-Enabled Symptom Assessment: Revolutionizing Disease Diagnosis and Patient Care

Footage Analysis Toolkit: A System for Semantic Video Retrieval and Structured Forensic Analysis