A Comprehensive Review on Deep Learning Approaches for Classifying Real and AI generated Images
Article Sidebar
Main Article Content
Due to the realm of the blooming generative models, like GANs, VAE and diffusion-based environment, the trustworthiness of authentication in visual media has been vastly challenged. AI-generated images appearing quite photorealistic go beyond human perceptual boundaries often overlying concerns for disinformation, digital tampering, forensic evidence authentication, and or the security of biometric purposes. This review organizes throughout the chronology of deep learning advancement to segregate real from synthetic images by focusing not much on its generalization snags, extraction of fake image or artifacts from sustainability in the aspect of datasets. Also this review provides systematic analysis on publicly available datasets and those essential research directions in the future being necessary to become a robust detection mechanism for fake images.
Downloads
References
S. Mohammadjafari, “Improved 3D α GAN for Generating Connected Volumes,” arXiv preprint, 2022.
S. Sabnam, “Application of Generative Adversarial Networks in Image, Text-to-Image and Medical Imaging,” International Journal of Pattern Recognition and Artificial Intelligence, 2024.
D. Ruan, “Improvement of Generative Adversarial Network and Its Application to Bearing Fault Data Augmentation,” MDPI, 2023.
Z. Wang, T. Pang, C. Du, M. Lin, W. Liu, and S. Yan, “Better Diffusion Models Further Improve Adversarial Training,” arXiv preprint, 2023.
R. Huang, J. Han, G. Lu, X. Liang, Y. Zeng, W. Zhang, and H. Xu, “DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability,” arXiv preprint, 2023.
A. Hatamizadeh, J. Song, G. Liu, J. Kautz, and A. Vahdat, “DiffiT: Diffusion Vision Transformers for Image Generation,” arXiv preprint, 2023.
S. Azizi, S. Kornblith, C. Saharia, M. Norouzi, and D. Fleet, “Synthetic Data from Diffusion Models Improves ImageNet Classification,” arXiv preprint, 2023.
K. Tian, Y. Jiang, Z. Yuan, B. Peng, and L. Wang, “Visual AutoRegressive Modeling: Scalable Image Generation via Next-Scale Prediction,” NeurIPS, 2024.
X. Tang, et al., “Image Generation Method Based on Improved Diffusion Models,” SPIE Conference on Computational Imaging, 2025.
Q. Yu, et al., “Randomized Autoregressive Visual Generation,” ICCV, 2025.
T. Li, et al., “Autoregressive Image Generation Without Vector Quantization via Diffusion Loss,” NeurIPS, 2024.
A. Kingma and M. Welling, “Auto-Encoding Variational Bayes,” ICLR, 2014.
K. Lipianina Honcharenko, M. Telka, and N. Melnyk, “Comparison of ResNet, EfficientNet, and Xception architectures for deepfake video detection,” CEUR Workshop Proc., vol. 3899, 2024.
B. Yasser, J. Hani, S. M. Elgayar, and O. Abdelhameed, “Deepfake Detection Using EfficientNet B4 and XceptionNet,” ICICIS / ResearchGate, 2024.
H. Lin, W. Luo, K. Wei, and M. Liu, “Improved Xception with Dual Attention Mechanism and Feature Fusion for Face Forgery Detection,” arXiv preprint, 2021.
A. Qadir et al., “An Efficient Deepfake Video Detection Using Pre trained ResNet CNN,” Journal / Elsevier, 2024.
V. D., J. S., G. J., and S. S., “Hybrid Deep Learning Approach for Deepfake Detection Using ResNet50 and EfficientNet B0,” IROIIP Journal, 2025.
D. Wodajo and S. Atnafu, “Deepfake Video Detection Using Convolutional Vision Transformer,” arXiv preprint, 2021.
Y.-J. Heo, et al., “Deepfake Detection Scheme Based on Vision Transformer and Distillation,” DeepAI, 2021.
A. Al Jallad, et al., “DFDT: An End-to-End DeepFake Detection Framework Using Vision Transformer,” Applied Sciences, vol. 12, no. 6, pp.2953, 2022.
P. M. Thuan, B. T. Lam, and P. D. Trung, “DSViT: An Enhanced Transformer Model for Deepfake Detection,” Journal of Science and Technology on Information Security, vol. 2, no. 22, 2024.
D. Nguyen, M. Astrid, E. Ghorbel, and D. Aouada, “FakeFormer: Efficient Vulnerability Driven Transformers for Generalisable Deepfake Detection,” arXiv preprint, 2024.
L. Zhao, M. Zhang, H. Ding, and X. Cui, “MFF Net: Deepfake Detection Network Based on Multi Feature Fusion,” Entropy, vol. 23, no. 12, p. 1692, 2021. MDPI+1
“A Spatial-Frequency Aware Multi-Scale Fusion Network for Real-Time Deepfake Detection,” Fraunhofer, 2024. deepfake-demo.aisec.fraunhofer.de
“Two Stream Xception Structure Based on Feature Fusion for DeepFake Detection,” Int. J. Computational Intelligence Systems, vol.16, article 134, 2023. SpringerLink
“Multi-scale Deepfake Detection Method with Fusion of Spatial Features,” ECICE06 Journal, 2023. ECICE06
X. Qiu, X. Miao, F. Wan, H. Duan, T. Shah, V. Ojhab, Y. Long, and R. Ranjan, “D2Fusion: Dual-domain Fusion with Feature Superposition for Deepfake Detection,” arXiv preprint, Mar. 2025.

This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles published in our journal are licensed under CC-BY 4.0, which permits authors to retain copyright of their work. This license allows for unrestricted use, sharing, and reproduction of the articles, provided that proper credit is given to the original authors and the source.