logo

Lip Reading and Reconstruction using ML

Authors

  • Lida K Kuriakose

    Amal Jyothi College of Engineering
    Author
  • Misha Rose Joseph

    Amal Jyothi College of Engineering
    Author
  • R Namitha

    Amal Jyothi College of Engineering
    Author
  • Sheezan Niby

    Amal Jyothi College of Engineering
    Author
  • Tanver Ahmad Lone

    Amal Jyothi College of Engineering
    Author

Abstract

Lip reading is a technique of comprehension of speech through visual interpretation of lip movements. Although lip reading is most often used by people who are deaf or hard of hearing, most people with normal hearing process some voice information from the sight of the moving mouth. In addition, understanding the language cues of lip readings can enhance the clarity of conversation in noisy environments. This paper proposes a model that identifies the impact of intermodal self monitoring for speech reconstruction (video-audio) by taking advantage of the natural occurrence of audio and visual streams in videos. The model that has an autoregressive encoder-decoder with an attention architecture, to map directly the sequences of silent facial movements to mel-scale spectrograms for speech reconstruction, which requires no human annotation. 

Keywords:

lip reading, self supervised pre-training, speech recognition, speech reconstruction
Views 7
Downloads 1

Published

16-07-2025

Issue

Section

Articles

How to Cite

[1]
L. K Kuriakose, M. Rose Joseph, N. R, S. Niby, and T. Ahmad Lone, “Lip Reading and Reconstruction using ML”, IJERA, vol. 3, no. 1, Jul. 2025, Accessed: Aug. 13, 2025. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/101

Similar Articles

1-10 of 77

You may also start an advanced similarity search for this article.