Lip Reading and Reconstruction using ML
Abstract
Lip reading is a technique of comprehension of speech through visual interpretation of lip movements. Although lip reading is most often used by people who are deaf or hard of hearing, most people with normal hearing process some voice information from the sight of the moving mouth. In addition, understanding the language cues of lip readings can enhance the clarity of conversation in noisy environments. This paper proposes a model that identifies the impact of intermodal self monitoring for speech reconstruction (video-audio) by taking advantage of the natural occurrence of audio and visual streams in videos. The model that has an autoregressive encoder-decoder with an attention architecture, to map directly the sequences of silent facial movements to mel-scale spectrograms for speech reconstruction, which requires no human annotation.
Keywords:
lip reading, self supervised pre-training, speech recognition, speech reconstructionPublished
Issue
Section
License
Copyright (c) 2023 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Tom Kurian, Ektha P S, Chethana Raj T, Diona Joseph, Annu Mary Abraham, Intelligent Disease Prediction in Hydroponic Systems Using Machine Learning , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Leo Jose, Navin Shibu George, Raju, Safa Haroon, Bini M Issac, Wearable Technology for Driver Monitoring and Health Management: A Comprehensive Survey , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Mrs.Resmipriya M G, Aakarsh P, Abel VJ, Deepak Denny David, Francis Tom, Wise Care: A Comprehensive Mobile Application with Conversational Chatbot and Medical Assistance , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Lis Jose, Albin John Wilson, Akshay Sebastian, Alisha Ann Subash, Agnes James, SafeRoute-A Comprehensive Travel Solution , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Dr.Amal M R, Allen Joseph, Jishnu suresh, Abhijith selvam, Aravind A S, AI Based Multi Robot Fire Suppression System , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Kaveri S, Pooja Satheesh, Kesiya Susan John, Reubel K Wilson, Dr. Jacob John, Predictive Maintenance of Machines Using IoT and Machine Learning , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Ashish George, Fida Fathima N, Aswin Kumar A, Nishok Perumal A , Lini Ickappan, GITSHUB - A COMPREHENSIVE PLATFORM FOR ACADEMIC NETWORKING, MENTORSHIP, AND CAREER DEVELOPMENT , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Honey Joseph, Aaron M Vinod, Abin Mathew varghese, Aby Alex, Aleena Sain, Crop Yield Prediction Using ML , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Goutham P Raj, Gregan George, Hadii Hasan, John Ashwin Delmon, V Pradeeba, COMPREHENSIVE VEHICLE SERVICES & E-COMMERCE PLATFORM WITH PRICE PREDICTION USING ML , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Febin Cheriyan, Deni Tom Jacob, Joanna Daniel, Haby S Mathews, Honey Joseph, Pneumonia Detection From Chest X-Rays Using Deep Learning : A Comprehensive Review , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
You may also start an advanced similarity search for this article.
