Lip Reading and Reconstruction using ML
Abstract
Lip reading is a technique of comprehension of speech through visual interpretation of lip movements. Although lip reading is most often used by people who are deaf or hard of hearing, most people with normal hearing process some voice information from the sight of the moving mouth. In addition, understanding the language cues of lip readings can enhance the clarity of conversation in noisy environments. This paper proposes a model that identifies the impact of intermodal self monitoring for speech reconstruction (video-audio) by taking advantage of the natural occurrence of audio and visual streams in videos. The model that has an autoregressive encoder-decoder with an attention architecture, to map directly the sequences of silent facial movements to mel-scale spectrograms for speech reconstruction, which requires no human annotation.
Keywords:
lip reading, self supervised pre-training, speech recognition, speech reconstructionPublished
Issue
Section
License
Copyright (c) 2023 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Fabeela Ali Rawther , Abhinay A K, Anagha Tess B, Alan Joseph , Adham Saheer, Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Nikita Niteen , Simy Mary Kurian, Exploring Explainable AI, Security and Beyond : A Comprehensive Review , International Journal on Emerging Research Areas: Vol. 3 No. 2 (2023): IJERA
- V Amarjith, Anaswara Anil, Anju Viswam, KM Aravind, Multilingual Hardcoded Subtitle Extractor , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Nikita Niteen , Juby Mathew, Securing AI: Understanding and Defending Against Adversarial Attacks in Deep Learning Systems , International Journal on Emerging Research Areas: Vol. 3 No. 2 (2023): IJERA
- Aashish Tom Raju, Aneesh Varghese John, Ashish Shabu, Bibin Babu, Anishamol Abraham, Vision-Based Surveillance for Malpractice Detection: An Analysis of Pose Estimation and Object Detection , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Manju Susan Thomas, Juby Mathew, The Integration of Trustworthy AI Values: A Comprehensive Model for Governance, Risk, and Compliance in Audit Architecture Framework context , International Journal on Emerging Research Areas: Vol. 3 No. 2 (2023): IJERA
- Ann Mary Babu, Anto K Thomas, Aswin Sebastian, Beffin K Lalu, Dr Jacob John, Assistive Technology For Deaf And Dumb , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Adith Ajay, Automatic Fall Detection And Alert System For Home Safety , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Lakshmi Nandana, Mariyam Emamudeen, Nikitha Mary Varghese, Susan Andrews, Manoj T Joy, FaceVue: A Review For Dynamic Advertising And Cost Management System , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Dona S Plavelil, A Devanandha, Haritha H Kurupp, Jissin k Jose, DETECTION OF ALZHEIMER’S DISEASE AND ASSISTANCE , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
You may also start an advanced similarity search for this article.
