INTERNATIONAL JOURNAL OF LATEST TECHNOLOGY IN ENGINEERING,
MANAGEMENT & APPLIED SCIENCE (IJLTEMAS)
ISSN 2278-2540 | DOI: 10.51583/IJLTEMAS | Volume XV, Issue V, May 2026
immediate, interpretable answer. The confidence score ensures they are not just handed a verdict but can see
how certain the system is, helping them decide whether to dig deeper.
No detector is a silver bullet, and the limitations of the current system have been identified and documented—
particularly with very short clips and the newest generation of neural vocoders. Future work will focus on closing
those gaps through richer features and expanded training data. But even at its current stage, the system makes a
meaningful contribution to accessible deepfake audio detection: bringing robust, accessible deepfake audio
detection out of the research lab and into the hands of people who need it.
REFERENCES
1. T. Kinnunen, M. Sahidullah, H. Delgado, M. Todisco, N. Evans, J. Yamagishi, and K. A. Lee, "The
ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection," in Proc.
Interspeech, Stockholm, Sweden, 2017, pp. 2–6.
2. A. Nautsch, X. Wang, N. Evans, T. Kinnunen, V. Vestman, M. Todisco, H. Delgado, M. Sahidullah, J.
Yamagishi, and K. A. Lee, "ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized,
Converted and Replayed Speech," IEEE Transactions on Biometrics, Behavior, and Identity Science, vol.
3, no. 2, pp. 252–265, 2021.
3. M. Sahidullah, T. Kinnunen, and C. Hanilci, "A Comparison of Features for Synthetic Speech Detection,"
in Proc. Interspeech, Dresden, Germany, 2015, pp. 2087–2091.
4. G. Lavrentyeva, S. Novoselov, A. Malinin, A. Kozlov, O. Kudashev, and V. Shchemelinin, "STC
Antispoofing Systems for the ASVspoof 2019 Challenge," in Proc. Interspeech, Graz, Austria, 2019, pp.
1033–1037.
5. H. Zhang, M. Tan, and X. Zhang, "Fake Speech Detection Using Residual Network with Transformer
Encoder," in Proc. ACM Workshop on Information Hiding and Multimedia Security, 2021, pp. 13–22.
6. R. Reimao and V. Tzerpos, "FoR: A Dataset for Synthetic Speech Detection," in Proc. International
Conference on Speech Technology and Human-Computer Dialogue (SpeD), 2019, pp. 1–8.
7. B. Li, L. Wang, T. Xu, and X. Li, "An Efficient Model for Real-Time Fake Voice Detection," Scientific
Reports, vol. 13, no. 1, p. 7867, 2023.
8. M. Alzantot, B. Balaji, and M. Srivastava, "Did You Hear That? Adversarial Examples Against Automatic
Speech Recognition," arXiv preprint arXiv:1801.00554, 2019.
9. M. Todisco, X. Wang, V. Vestman, M. Sahidullah, H. Delgado, A. Nautsch, J. Yamagishi, N. Evans, T.
Kinnunen, and K. A. Lee, "ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection," in
Proc. Interspeech, Graz, Austria, 2019, pp. 1008–1012.
10. H. Tak, J. Patino, M. Todisco, A. Nautsch, N. Evans, and J. Yamagishi, "End-to-End Anti-Spoofing with
RawNet2," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Toronto, Canada, 2021, pp. 6369–6373.