A Machine Learning Models for Classifying Fake and Real News Articles
Article Sidebar
Main Article Content
Abstract — The era where misinformation spreads rapidly across digital platforms, ability to distinguish between authentic and fabricated news has become a critical societal challenge. This project presents a machine learning-based approach to fake and real news detection using natural language processing techniques. Utilizing a labelled dataset comprising 6,335 news articles, the model analyzes both the title and content of each entry to accurately classify them as either “FAKE” or “REAL.” Pre-processing steps, including tokenization, vectorization, and noise removal, was applied to enhance text clarity. Multiple machine learning algorithms were evaluated, with performance measured through accuracy, precision, recall, and F1-score. The results underscore the efficacy of supervised learning techniques in automating the verification of news content, offering a scalable solution to combat the proliferation of misinformation in online media.
Downloads
References
Jain, A., & Upadhyay, A. (2020). Fake News Detection using Machine Learning Algorithms. International Journal of Engineering Research & Technology (IJERT), 9(05), 623–627.
Ahmed, H., Traore, I., & Saad, S. (2018). Detecting opinion spams and fake news using text classification. Security and Privacy, 1(1), e9. https://doi.org/10.1002/spy2.9
Kaggle. (2020). Fake and Real News Dataset. https://www.kaggle.com/clmentbisaillon/fake-and-real-news-dataset
Mihalcea, R., & Strapparava, C. (2009). The lie detector: Explorations in the automatic recognition of deceptive language. In Proceedings of the ACL-IJCNLP 2009 Conference (pp. 309–312).
Shu, K., Sliva, A., Wang, S., Tang, J., & Liu, H. (2017). Fake News Detection on Social Media: A Data Mining Perspective. ACM SIGKDD Explorations Newsletter, 19(1), 22–36.
Wang, W. Y. (2017). "Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 422–426).
Zhou, X., & Zafarani, R. (2018). Fake News: A Survey of Research, Detection Methods, and Opportunities. arXiv preprint arXiv:1812.00315.
Ajao, O., Bhowmik, D., & Zargari, S. (2018). Fake News Identification on Twitter with Hybrid CNN and RNN Models. In Proceedings of the 9th International Conference on social media and Society (pp. 279–287). ACM.
Chakraborty, A. (2020). Fake News Detection using Sentiment Analysis and GloVe Embeddings.
Ajao, O., Bhowmik, D., & Zargari, S. (2018). Fake News Identification on Twitter with Hybrid CNN and RNN Models. In Proceedings of the 9th International Conference on social media and Society(pp.279–287).

This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles published in our journal are licensed under CC-BY 4.0, which permits authors to retain copyright of their work. This license allows for unrestricted use, sharing, and reproduction of the articles, provided that proper credit is given to the original authors and the source.