StamFree: A Gamified AI System for Speech Disfluency Detection and Therapy in Children
Abstract
Abstract—Speech disfluency, commonly referred to as stam-
mering, is a multifaceted communication disorder that pre-
dominantly affects children between 6 to 12, with substantial
consequences for social confidence, academic achievement, and
psychosocial development. Although traditional speech therapy
demonstrates efficacy, it frequently necessitates intensive clinical
supervision and repetitive exercises, which may be stressful and
monotonous for pediatric patients, resulting in low adherence.
This study presents StamFree, an innovative, child-focused gam-
ified therapy system utilizing advanced artificial intelligence.
In contrast to earlier systems that utilize basic signal process-
ing, StamFree employs WavLM, a state-of-the-art self-supervised
deep learning model, to analyze speech directly from raw audio
waveforms. This architecture enables robust multi-class classifica-
tion of disfluencies, accurately distinguishing between repetitions,
prolongations, and blocks. The system incorporates a novel
Stress-Based Progression Strategy, which organizes phonemes
into hierarchical tiers according to articulatory stress levels:
low, medium, and high. By integrating this progression with
an adaptive unlocking mechanism, StamFree ensures that users
achieve proficiency with lower-stress sounds prior to advancing,
thereby minimizing cognitive overload. Interactive mini-games
that reinforce breathing control and pacing further contribute
to a low-anxiety, engaging therapeutic environment, promoting
sustained practice beyond the clinical context.
Keywords:
Speech disfluency, Stammering,, Gamified therapy,, WavLM,, Child-centric, Multi-class Speech AnalysisPublished
Issue
Section
License
Copyright (c) 2026 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Jeswin Sabu, Kevin Biju Kulangara, Prapanch J, Stephin Mathew, Vimal Babu P, GestureMate: An AI-Driven System for Real-Time Malayalam Sign Language and Speech Translation , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Adithya Raj, Jibin Gigi, Lidiya Reju, Manu Emmanuel, Smitha Jacob, Footage Analysis Toolkit: A System for Semantic Video Retrieval and Structured Forensic Analysis , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Adithya Raj, Jibin Gigi, Lidiya Reju, Manu Emmanuel, Smitha Jacob, Footage Analysis Toolkit: A System for Semantic Video Retrieval and Structured Forensic Analysis , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Lida K Kuriakose, Misha Rose Joseph, R Namitha, Sheezan Niby, Tanver Ahmad Lone, Lip Reading and Reconstruction using ML , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Alfred Joe Devasia, Nandhana Kunjumon, Rahul R Krishna, Safna M S, Thomas Joseph, TalkTrace: Secure Automated Transcription and Summary Generation , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Arya Raj S, R Gopika Krishnan, Drishya Das, Rohith R, Jocelyn Ann Joseph, Personality Profiling Using CV Analysis , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Joel Jones, Jaick T Kurian, Jesvin Jelson Thachil, Drishya K. V., Aswin Nandakumar, A Comprehensive Review of Graph-Based Forensic Timeline Reconstruction: Analysis of the Timelance Framework , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- Betzy Babu Thoppil, Anugrah Premachandran, Annapoorna M, Ashwin Mathew Zachariah, Bala Susan Jacob, Advanced Sensor-Based Landslide and Earthquake Detection and Alert System Utilizing Machine Learning and Computer Vision Technologies , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Thejuskrishnan, Amal, Vyshnav M, Narayanan K, Saira Shamsudheen K S, SPEAK: An AI-Based Assistive Video Communication System for Speech and Sign Language Translation , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
- ASHNA SHAJI, ABHIRAMI P, AKSA K THOMAS, AMINA R SHAJI, GEEVA GEORGE, Assessing Inland Waterway Service Quality Using SERVQUAL and IPA Analysis , International Journal on Emerging Research Areas: Vol. 6 No. 1 (2026): IJERA
You may also start an advanced similarity search for this article.
