logo

Pixelyse : ViT- VAE for Document Forgery Detection

Authors

  • Mishal Rose Thankachan

    Author
  • Joshua John Sajit

    Author
  • Merwin Maria Antony

    Author
  • Richa Maria Biju

    Author
  • Richa Maria Biju

    Author
  • Bini M Issac

    Author

Abstract

Ensuring the authenticity of documents is more important than ever, as forgery techniques continue to evolve. Traditional methods, which rely on predefined rules and handcrafted features, often struggle to adapt to new types of fraud. To address this, we propose a Vision Transformer-based Variational Autoencoder (ViT-VAE) designed to enhance document authentication. By combining the Vision Transformer’s ability to capture intricate details with the Variational Autoencoder’s capability to model genuine document patterns, our approach effectively detects anomalies based on reconstruction errors. This fusion of self-attention mechanisms and probabilistic modeling improves accuracy and adaptability in identifying forged elements. Our experiments on diverse datasets show that ViT-VAE outperforms conventional machine learning and deep learning methods, offering a more reliable solution for document security. These findings open the door for further advancements in fraud detection and verification technologies, strengthening trust in digital and physical documentation.

Keywords:

Deep learning, Vision Transformer-based Variational Autoencoder, fraud detection, forgery detection
Views 0
Downloads 0

Published

21-04-2026

Issue

Section

Articles

How to Cite

[1]
M. R. Thankachan, J. John Sajit, M. M. Antony, R. M. Biju, R. M. Biju, and B. M. Issac, “Pixelyse : ViT- VAE for Document Forgery Detection”, IJERA, vol. 5, no. 1, pp. 308–313, Apr. 2026, Accessed: Apr. 21, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/282

Similar Articles

21-30 of 186

You may also start an advanced similarity search for this article.