Zero Shot Low Light Image Enhancement using Vision Language Models and Semantic Diffusion
Abstract
Capturing clear images in low-light conditions remains a significant challenge across surveillance, mobile photography, and diagnostic imaging. Traditional enhancement methods require extensive paired datasets or risk introducing visual artifacts. This paper presents a zero-shot low-light image enhancement framework combining vision-language models (CLIP) with latent diffusion models (Stable Diffusion) to enhance images without task-specific training. CLIP extracts semantic embeddings to guide the enhancement process, while the diffusion model performs iterative denoising to restore brightness and detail. By constraining enhancement through semantic similarity, our method preserves scene content while improving visibility. The system achieves competitive PSNR (15.556 dB) and SSIM (0.729) scores on standard benchmarks without requiring paired training data, demonstrating practical applicability for real-world deployment scenarios including embedded and mobile platforms.
Keywords:
low-light enhancement, zero-shot learning, diffusion models, vision-language modelsPublished
Issue
Section
License
Copyright (c) 2026 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Aniruddha Das, Avisikta Modak, The Carbon footprint of Machine Learning Models , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Shiney Thomas, Elsa George, Alphonsa Francis, Anna Job, Ann Maria James, Wildlife Detection And Recognition Using YOLO V8 , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Jeswin Sabu, Kevin Biju Kulangara, Prapanch J, Stephin Mathew, Vimal Babu P, GestureMate: An AI-Driven System for Real-Time Malayalam Sign Language and Speech Translation , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Betzy Babu Thoppil, Anugrah Premachandran, Annapoorna M, Ashwin Mathew Zachariah, Bala Susan Jacob, Advanced Sensor-Based Landslide and Earthquake Detection and Alert System Utilizing Machine Learning and Computer Vision Technologies , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Yamini C.K, Ajin krishna K U, Akhil Thilak, Amith Raj P R, Aromal A S, Alex joy, Jishnu Babu T, Jeswin jaison, VIDEO MOMENT RETRIEVAL SYSTEM , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Ansamol Varghese, Anoushkha Tresa, Athira John, Ignatious Ealias Roy, M S Gautham Sankar, A Machine Learning Approach to Fake News Detection , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- R Karthika, Maria Toms, S R Aadrash, P U Prabath, InsightAI: Bridging Natural Language and Data Analytics , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Jannies Varghese, Hariprasad Prasanth, Blessy Mariam Babu, Chris Joseph, Bini M Issac, Deep Learning Techniques for Image Steganography: A Comprehensive Review , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Mekha Jose, Jocelyn Anthony, Jose V Joseph, Joshwa Thomas, Sharon Baby Thomas, A Review of Machine Learning and Deep Learning Approaches for Offensive Text Detection , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Nikita Niteen , Juby Mathew, Securing AI: Understanding and Defending Against Adversarial Attacks in Deep Learning Systems , International Journal on Emerging Research Areas: Vol. 3 No. 2 (2023): IJERA
You may also start an advanced similarity search for this article.
