Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion
Abstract
Capturing clear images in low-light conditions remains a significant challenge across surveillance, mobile photography, and diagnostic imaging. Traditional enhancement methods require extensive paired datasets or risk introducing visual artifacts. This paper presents a zero-shot low-light image enhancement framework combining vision-language models (CLIP) with latent diffusion models (Stable Diffusion) to enhance images without task-specific training. CLIP extracts semantic embeddings to guide the enhancement process, while the diffusion model performs iterative denoising to restore brightness and detail. By constraining enhancement through semantic similarity, our method preserves scene content while improving visibility. The system achieves competitive PSNR (15.556 dB) and SSIM (0.729) scores on standard benchmarks without requiring paired training data, demonstrating practical applicability for real-world deployment scenarios including embedded and mobile platforms.
Keywords:
low-light enhancement, zero-shot learning, diffusion models, vision-language modelsPublished
Issue
Section
License
Copyright (c) 2026 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Dr Anil A R, Amit Sankar Arun, Anandhu Anilkumar, Anandu S Sivan, Anoop Manoharan, DESIGNING OF A VOICE – BASED PROGRAMMING IDE FOR SOURCE CODE GENERATION , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Adithya Raj, Jibin Gigi, Lidiya Reju, Manu Emmanuel, Smitha Jacob, Footage Analysis Toolkit: A System for Semantic Video Retrieval and Structured Forensic Analysis , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Joel Judish, Samrudh Salas, Farhaan Zuhair, Muhammed Zakkariya M, Juby Mathew, SkinGuard: An EfficientNet Model for Skin Cancer and M-pox Detection , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Felix Jobi, Nagaraj Menon K S, Revathy Biju, Shraya S Santhosh, StockGenie: AI-Driven Stock Market Assistant and Forecasting System , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Nighila Ashok, Adithya Ajith, Aparna Shaju, Arjuna Chandran V V, Fahmi Fathima T S, DeepScan : A Deepfake Video Detection System , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- George P Kurias, Gokul Krishna AU, Jifith Joseph, Sharunmon R, Linsa Mathew, A Review of Methodologies for Detecting Missing and Wanted People Using Machine Learning and Video Surveillance , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- V Naveen, S Rekha, A Concise Review On E-Commerce Website For Visually Impaired , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Vinayak Prakash, Tresa Mariya Denny, Vivek Subash Nair, Sonal Varghese, Tom Kurian, FEATURE EXTRACTION AND CLASSIFICATION OF CERTIFICATES USING OCR , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Krishnendu B, Sreelakshmi A, Sumayya Maheen, Zameel Hassan, Honey Joseph, Chatbot-Enabled Symptom Assessment: Revolutionizing Disease Diagnosis and Patient Care , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Adithya Raj, Jibin Gigi, Lidiya Reju, Manu Emmanuel, Smitha Jacob, Footage Analysis Toolkit: A System for Semantic Video Retrieval and Structured Forensic Analysis , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
You may also start an advanced similarity search for this article.
