Lip Reading and Reconstruction using ML
Abstract
Lip reading is a technique of comprehension of speech through visual interpretation of lip movements. Although lip reading is most often used by people who are deaf or hard of hearing, most people with normal hearing process some voice information from the sight of the moving mouth. In addition, understanding the language cues of lip readings can enhance the clarity of conversation in noisy environments. This paper proposes a model that identifies the impact of intermodal self monitoring for speech reconstruction (video-audio) by taking advantage of the natural occurrence of audio and visual streams in videos. The model that has an autoregressive encoder-decoder with an attention architecture, to map directly the sequences of silent facial movements to mel-scale spectrograms for speech reconstruction, which requires no human annotation.
Keywords:
lip reading, self supervised pre-training, speech recognition, speech reconstructionPublished
Issue
Section
License
Copyright (c) 2023 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Arya Raj S, R Gopika Krishnan, Drishya Das, Rohith R, Jocelyn Ann Joseph, Personality Profiling Using CV Analysis , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Yamini C.K, Ajin krishna K U, Akhil Thilak, Amith Raj P R, Aromal A S, Alex joy, Jishnu Babu T, Jeswin jaison, VIDEO MOMENT RETRIEVAL SYSTEM , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Lakshmy Suresh K , Joanna Danniel, Mariya Binoy, R Neenu, BookVerse: A Platform for Book Reviews and Resale , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- NITHYA M V, ADIL SIYAD K.M, AFINSHA P.B, GAUTHAM T.S, ABHIJITH K.P, SALIH SUDHEER, ARJUN SANKAR R.S, C.S ADHITHYAN, JEWELLERY SHOPPING WITH FACIAL RECOGNITION , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Parvathy S Pillai, Pooja Rajeev, Sania Regi, Parvathy S Nair, Dr. Therese Yamuna Mahesh, Agi Joseph George, SMART TROLLEY: A MORE ENHANCED SHOPPING EXPERIENCE , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Dr. Indu John, Gauri Santhosh, Jesna Susan Reji, Abdul Musawir, Glady Prince, Detection of Autism Spectrum Disorder in Toddlers using Machine Learning , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Maria Sajeeve, Karthik Vinod, Kausalya Sumesh, Joby Jose, Minu Cherian, KALO:AI-Powered Precision in Nutrition Tracking , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Ansamol Varghese, Anandhu Anoj, Angel Thomas, Deepta K Sunny, Emil Thomas, TrueNews-AI Powered Detection of Manipulated Text and Images , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Mishal Rose Thankachan, Joshua John Sajit, Merwin Maria Antony, Richa Maria Biju, Richa Maria Biju, Bini M Issac, Pixelyse : ViT- VAE for Document Forgery Detection , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Tintu Alphonsa Thomas, Nandana Rajagopal, Neethu Liz Shaji, Silby Elza Simon, P Sree Parvathy, Survey on Video Summarization using Extracted Audio , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
You may also start an advanced similarity search for this article.
