Lip Reading and Reconstruction using ML
Abstract
Lip reading is a technique of comprehension of speech through visual interpretation of lip movements. Although lip reading is most often used by people who are deaf or hard of hearing, most people with normal hearing process some voice information from the sight of the moving mouth. In addition, understanding the language cues of lip readings can enhance the clarity of conversation in noisy environments. This paper proposes a model that identifies the impact of intermodal self monitoring for speech reconstruction (video-audio) by taking advantage of the natural occurrence of audio and visual streams in videos. The model that has an autoregressive encoder-decoder with an attention architecture, to map directly the sequences of silent facial movements to mel-scale spectrograms for speech reconstruction, which requires no human annotation.
Keywords:
lip reading, self supervised pre-training, speech recognition, speech reconstructionPublished
Issue
Section
License
Copyright (c) 2023 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Dr Anil A R, Amit Sankar Arun, Anandhu Anilkumar, Anandu S Sivan, Anoop Manoharan, DESIGNING OF A VOICE – BASED PROGRAMMING IDE FOR SOURCE CODE GENERATION , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Mekha Jose, Avin Joshy, Abishek R Paleri, Athul Mohan, Ali Jasim R M, A Review on Contribution and Influence of Artificial Intelligence in Road Safety and Optimal Routing , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Alan K George, Arpita Mary Mathew, Asin Mary Jacob, Elizabeth Antony, Shiney Thomas, Classification of Lung Cancer Subtypes Using Deep Learning Model , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- V Naveen, S Rekha, A Concise Review On E-Commerce Website For Visually Impaired , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- P Sathya Narayan, Safad Ismail, Developing an Empathetic Interaction Model for Elderly in Pandemics , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Adona Shibu, Aarunya Retheep, Albin Joseph, Ali Jasim, Adona Shibu , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Linsa Mathew, Brain Tumor Detection , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Jefrin Siby Mathew, Joyal Joseph, Roshik George, Tinu Rose Thottungal , Honey Joseph, Multiple Disease Detection using Machine Learning , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Fabeela Ali Rawther, Abhinay A K, Anagha Tess B, Alan Joseph, Adham Saheer, Survey of Machine Learning and Deep Learning Approaches for Automated Hate Speech Detection and Sentiment Analysis in Multilingual Contexts , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Arun Robin, Tijo Thomas Titus, Ms. Minu Cherian, Improved Handwritten Digit Recognition Using Deep Learning Technique , International Journal on Emerging Research Areas: Vol. 3 No. 2 (2023): IJERA
You may also start an advanced similarity search for this article.