logo

AUDIONYX: REAL-TIME DETECTION OF AUDIO DEEPFAKES IN PHONE CALLS

Authors

  • Emmanuel J Jose

    Author
  • Fidha Fathima N S

    Author
  • Gautham Babu

    Author
  • Liya Latheef

    Author
  • Shanthi N.M

    Author

Abstract

The explosion of AI-assisted voice synthesis technologies has made audio deepfake–based fraud a greater risk, especially within telecommunication domains. These synthetic voices are one of the leading impersonation methods, attacks and scams with potentially grave security hazards. Detecting real-time deepfakes is challenging due to bandwidth limitations, codec compression, and background noise that obscure distinguishing artifacts. This paper presents Audionyx, a real-time deepfake detection framework for telephony applications. It uses a lightweight custom Convolutional Neural Network (CNN) trained on Melspectrogram abstractions to strike an optimal balance between accuracy in detection and computational efficiency. A sliding window segmentation strategy and probabilistic aggregation mechanism ensure stable and reliable detection across continuous audio streams. Experimental evaluation demonstrates excellent detection performance and low latency, testing the ability of the system to be deployed in real time. The proposed approach is a robust and scalable method for reducing fraud through voice and for improving security against impersonation attacks.

Keywords:

Audio deepfakes, real-time detection, Telephony channels, CNN-Transformer, Mel spectrogram, voice fraud detection
Views 0
Downloads 0

Published

29-05-2026

Issue

Section

Articles

How to Cite

[1]
E. J. Jose, F. F. N S, G. Babu, L. Latheef, and S. N.M, “AUDIONYX: REAL-TIME DETECTION OF AUDIO DEEPFAKES IN PHONE CALLS”, IJERA, vol. 6, no. 1, pp. 318–323, May 2026, Accessed: May 29, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/360

Similar Articles

11-20 of 211

You may also start an advanced similarity search for this article.