Lip Reading and Reconstruction using ML
Abstract
Lip reading is a technique of comprehension of speech through visual interpretation of lip movements. Although lip reading is most often used by people who are deaf or hard of hearing, most people with normal hearing process some voice information from the sight of the moving mouth. In addition, understanding the language cues of lip readings can enhance the clarity of conversation in noisy environments. This paper proposes a model that identifies the impact of intermodal self monitoring for speech reconstruction (video-audio) by taking advantage of the natural occurrence of audio and visual streams in videos. The model that has an autoregressive encoder-decoder with an attention architecture, to map directly the sequences of silent facial movements to mel-scale spectrograms for speech reconstruction, which requires no human annotation.
Keywords:
lip reading, self supervised pre-training, speech recognition, speech reconstructionPublished
Issue
Section
License
Copyright (c) 2023 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Muneebah Mohyiddeen, Sana T.H, Anoodh Hussain, Nandana P Narayanan, Sneha Soman, DGCURE: Model for Detection of Dysgraphia , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Sandra Saji, Melbin Mathew, Angel Mariya S, Amrutha Mugesh, Jincy Lukose, MACHINE LEARNING FOR DETECTION AND PREDICTION OF TOMATO LEAF DISEASES , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Amith Bino, Don Peter Joseph, Sreehari P, Anchal J Vattakunnel, Revolutionizing Nutritional Management Through Food Scanning And Object Detection: A New Android Application For Adults , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Aaron Samuel Mathew, Adhil Salim , From Exorbitant to Affordable: The Evolution of AI Training Costs , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Muneebah Mohyiddeen, Amal E A, Maxen Varghese, Mohammed Rasnal K A, Rohith Sekhar N, SARA: A College Receptionist System , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Adithya Satheesh, Ashwin S Nair, Darren Padamittam Jacob, Athul Rajeev, Er. Maheshwary Sreenath, Intrusion Countermeasure System , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Adithya Satheesh, Ashwin S Nair, Darren Padamittam Jacob, Athul Rajeev, Er. Maheshwary Sreenath, Intrusion Countermeasure System , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Elisabeth Thomas, Arjun Saji, Aswin M S, Augustine Salas, Emil Viju, A Comprehensive Review of Advancing Cattle Monitoring and Behavior Classification using Deep Learning , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Thomas Mathew Jose, Mathew Abraham, Sebastian Biju , Samuel Michael , Minu Cherian , Canine Dermal Analyser: Harnessing Artificial Intelligence and Deep Learning to Revolutionize Canine Skin Disease Detection , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Sreyas George, Gregan George, Ruth Tennyson, Rishil Shajan, Dr. Juby Mathew, MindPulse: Employee Mental Health Detection and Attrition Prediction App , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
You may also start an advanced similarity search for this article.
