Lip Reading and Reconstruction using ML

Lida  K Kuriakose; Misha Rose Joseph; Namitha R; Sheezan Niby; Tanver Ahmad Lone

Lip Reading and Reconstruction using ML

Authors

Lida K Kuriakose

Amal Jyothi College of Engineering

Author
Misha Rose Joseph

Amal Jyothi College of Engineering

Author
R Namitha

Amal Jyothi College of Engineering

Author
Sheezan Niby

Amal Jyothi College of Engineering

Author
Tanver Ahmad Lone

Amal Jyothi College of Engineering

Author

Abstract

Lip reading is a technique of comprehension of speech through visual interpretation of lip movements. Although lip reading is most often used by people who are deaf or hard of hearing, most people with normal hearing process some voice information from the sight of the moving mouth. In addition, understanding the language cues of lip readings can enhance the clarity of conversation in noisy environments. This paper proposes a model that identifies the impact of intermodal self monitoring for speech reconstruction (video-audio) by taking advantage of the natural occurrence of audio and visual streams in videos. The model that has an autoregressive encoder-decoder with an attention architecture, to map directly the sequences of silent facial movements to mel-scale spectrograms for speech reconstruction, which requires no human annotation.

Keywords:

lip reading, self supervised pre-training, speech recognition, speech reconstruction

Downloads 126

Full Text (PDF)

Published

16-07-2025

Issue

Vol. 3 No. 1 (2023): IJERA

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

How to Cite

[1]

L. K Kuriakose, M. Rose Joseph, N. R, S. Niby, and T. Ahmad Lone, “Lip Reading and Reconstruction using ML”, IJERA, vol. 3, no. 1, Jul. 2025, Accessed: Jul. 01, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/101

Download Citation

Indexed By

Lip Reading and Reconstruction using ML

Authors

Lida K Kuriakose

Misha Rose Joseph

R Namitha

Sheezan Niby

Tanver Ahmad Lone

Abstract

Keywords:

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

TrueNews-AI Powered Detection of Manipulated Text and Images

Deep Learning and Machine Learning Approaches for Satellite-Based Environmental Monitoring: A Comprehensive Survey

Pixelyse : ViT- VAE for Document Forgery Detection

Survey on Video Summarization using Extracted Audio

A Reliable Method for Detecting Brain Tumors in Magnetic Resonance Images Utilizing EfficientNet

A Comprehensive Review on Diagnosis and Classification of Various Respiratory Diseases

Traffic Violation Detection Using Machine Learning: A Comprehensive Study

"A Multimodal Framework For Anaemia Screening Using Images And Clinical Features: A Comprehensive Survey And Methodological Proposal"

A Comprehensive Review of Lightweight and Attention-Driven Deep Learning Models for Automated Cataract Detection

Interview Preparation System: A Smart Platform for Technical and Behavioral Readiness