Lip Reading and Reconstruction using ML
Abstract
Lip reading is a technique of comprehension of speech through visual interpretation of lip movements. Although lip reading is most often used by people who are deaf or hard of hearing, most people with normal hearing process some voice information from the sight of the moving mouth. In addition, understanding the language cues of lip readings can enhance the clarity of conversation in noisy environments. This paper proposes a model that identifies the impact of intermodal self monitoring for speech reconstruction (video-audio) by taking advantage of the natural occurrence of audio and visual streams in videos. The model that has an autoregressive encoder-decoder with an attention architecture, to map directly the sequences of silent facial movements to mel-scale spectrograms for speech reconstruction, which requires no human annotation.
Keywords:
lip reading, self supervised pre-training, speech recognition, speech reconstructionPublished
Issue
Section
License
Copyright (c) 2023 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Alfred Joe Devasia, Nandhana Kunjumon, Rahul R Krishna, Safna M S, Thomas Joseph, TalkTrace: Secure Automated Transcription and Summary Generation , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Anitta K Mathew, Hanna Sarah Sabu, Annu Alphonse Jojo, Helan Poulose, Lia Maria Rajan, A Review of AI-Powered Tools to Help People With Visual Impairments , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Kevin Roy, Lino Shaji, Riya G Johnson, Tince Tomy, Jane George, INTELLIGENT BUDDY , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Vinayak Prakash, Tresa Mariya Denny, Vivek Subash Nair, Sonal Varghese, Tom Kurian, FEATURE EXTRACTION AND CLASSIFICATION OF CERTIFICATES USING OCR , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Alfred Santhosh, Franklin V Jose, K Rohit, Anderson Abraham, Literature Survey on AURA: Augmented Reality Glasses for Enhancing Accessibility of Visually and Hearing Impaired Users , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Cymil Sara Eashow, Fathima Ishana K.M, Eva Mary Regi, Ken Jacob Zachariah, Kesiya Rachel John, Juby Mathew, Assistive Technologies for the Visually Impaired: A Comprehensive Survey , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- FATHIMA P.S, ANU ROSE JOY, ANSPIN TITUS, ANSU MARIUM SHIBU, ASNA AZEEZ, INDIAN SIGN LANGUAGE RECOGNITION USING YOLOV5 , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Albin , Aarunya Retheep, Adona Shibu, Athul P Shibu, Lis Jose, LanguaGuide -Your personalized AI companion for mastering languages, anytime, anywhere. , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Ria Mathews, AI Based Stress and Mental Health Monitoring System Using Chatbot, Speech and Facial Analysis , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Aleena Joseph, Diya Paramesh G, Elza Mary Thomas, Gayathri V, Anu V Kottath, A Review on Comparison of VGG-16 and DenseNet algorithms for analysing brain tumor in MRI image , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
You may also start an advanced similarity search for this article.
