Towards Accurate Student Performance Prediction: An Assessment of Machine Learning Models and Metrics
Article Sidebar
Main Article Content
Predicting students’ academic outcomes has become a central focus within Educational Data Mining (EDM) to support early academic interventions, minimize dropout risks, and promote personalized learning strategies. The availability of diverse educational data—ranging from demographic details and previous grades to behavioral engagement and digital learning activity—has encouraged the adoption of machine learning (ML) approaches for this purpose. This analysis synthesizes major ML techniques applied to student performance prediction, including regression models, decision trees, random forests, support vector machines, k-nearest neighbors, naïve Bayes classifiers, artificial neural networks, gradient boosting methods such as XGBoost and LightGBM, and deep learning models like CNNs and RNNs. Common evaluation measures—such as accuracy, precision, recall, F1-score, ROC-AUC, MAE, MSE, RMSE, and R²—are also examined. Despite their potential, challenges persist, including inconsistent data quality, complex feature selection, privacy and ethical concerns, limited interpretability, poor generalization across institutions, and the need for frequent model updates. Future research should emphasize explainable AI (XAI), temporal modeling techniques, integration of behavioral and psychological indicators, and transfer learning to enhance scalability and adaptability. Overall, this study highlights the promise of ML-driven systems in improving educational outcomes when combined with ethical, interpretable, and context-aware design principles.
Downloads
References
Kumar, S., Kumar, H., Kumar, G., Singh, SP., Bijalwan, A., Diwakar, M., 2024. A methodical exploration of imaging modalities from dataset to detection through machine learning paradigms in prominent lung disease diagnosis: a review. BMC Medical Imaging, 24(1), 30. https://link.springer.com/content/pdf/10.1186/s12880-024-01192-w.pdf
Haziq, R., Chwen, JC., 2022. Educational Data Mining for Student Performance Prediction: A Systematic Literature Review (2015-2021). International Journal of Emerging Technologies in Learning, 17(05), 147-179. DOI: 10.3991/ijet.v17i05.27685
Alnasyan, B., Basheri, M., Alassafi, M., 2024. The power of Deep Learning techniques for predicting student performance in Virtual Learning Environments: A systematic literature review. Computers and Education: Artificial Intelligence, 100231. https://www.sciencedirect.com/science/article/pii/S2666920X24000328
Pelima, LR., Sukmana, Y., Rosmansyah, Y., 2024. Predicting university student graduation using academic performance and machine learning: a systematic literature review. IEEE Access, 12, 2345123465. https://ieeexplore.ieee.org/iel7/6287639/6514899/10418878.pdf
Yağcı, M., 2022. Educational data mining: prediction of students' academic performance using machine learning algorithms. Smart Learning Environments, 9(11).
Athanasios, A., John, A., Markos, K., Alkiviadis, T., 2025. Predicting Student Performance and Enhancing Learning Outcomes: A Data-Driven Approach Using Educational Data Mining Techniques. Computers, 14(3), 83. https://doi.org/10.3390/computers14030083)
Yaosheng, L., Kimberly, FC., 2025. Performance prediction using educational data mining techniques: a comparative study. Discover Education, 4(12), 112.
Zhu, Y., Ma, Y., Fan, F., Huang, J., Wu, K., Wang, G., 2024. Towards accurate infrared small target detection via edge-aware gated transformer. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. https://ieeexplore.ieee.org/iel7/4609443/4609444/10496183.pdf
John-Bosco, D., Icon, Mekala, MS., Ulric, SA., John, I., Eyad, E., 2025. Information & Communications Technology in Education Predicting student next-term performance in degree programs using AI-based approach: a case study from Ghana, Cogent Education, 12(1). https://doi.org/10.1080/2331186X.2025.2481000)
Wu, S., Cao, Y., Cui, J., Li, R., Qian, H., Jiang, B., Zhang, W., 2024. A comprehensive exploration of personalized learning in smart education: From student modeling to personalized recommendations. arXiv preprint arXiv:2402.01666. https://arxiv.org/pdf/2402.01666
Natarajan, SK., Shanmurthy, P., Arockiam, D., Balusamy, B., Selvarajan, S., 2024. Optimized machine learning model for air quality index prediction in major cities in India. Scientific reports, 14(1), 6795. https://www.nature.com/articles/s41598-024-54807-1.pdf
Boutahri, Y., Tilioua, A., 2024. Machine learning-based predictive model for thermal comfort and energy optimization in smart buildings. Results in Engineering, 22, 102148. https://www.sciencedirect.com/science/article/pii/S259012302400402X
Bressane, A., Zwirn, D., Essiptchouk, A., Saraiva, ACV., de Campos Carvalho, FL., Formiga, JKS., Negri, RG., 2024. Understanding the role of study strategies and learning disabilities on student academic performance to enhance educational approaches: A proposal using artificial intelligence.
Computers and Education: Artificial Intelligence, 6, 100196. https://www.sciencedirect.com/science/article/pii/S2666920X23000759
Ahmed, QO., 2024. Machine learning for intrusion detection in cloud environments: A comparative study. Journal of Artificial Intelligence General Science (JAIGS). https://www.researchgate.net/profile/Qazi-Omair-Ahmed/publication/387076339_Machine_Learning_for_Intrusion_Detection_in_Cloud_Environments _A_Comparative_Study/links/676173c2e9b25e24af5c961a/Machine-Learning-for-Intrusion-Detectionin-Cloud-Environments-A-Comparative-Study.pdf
Oyucu, S., Doğan, F., Aksöz, A., Biçer, E., 2024. Comparative analysis of commonly used machine learning approaches for Li-ion battery performance prediction and management in electric vehicles. Applied Sciences, 14(6), 2306. https://www.mdpi.com/2076-3417/14/6/2306/pdf
Archana, R., Jeevaraj, PE., 2024. Deep learning models for digital image processing: a review. Artificial Intelligence Review, 57(1), 11. https://link.springer.com/content/pdf/10.1007/s10462-023-10631-z.pdf
Idowu, JA., 2024. Debiasing education algorithms. International Journal of Artificial Intelligence in Education, 1-31. https://link.springer.com/content/pdf/10.1007/s40593-023-00389-4.pdf
Ikegwu, AC., Nweke, HF., Anikwe, CV., 2024. Recent trends in computational intelligence for educational big data analysis. Iran Journal of Computer Science, 7(1), 103-129. https://www.researchgate.net/profile/Henry-Nweke-3/publication/373738289_Recent_trends_in_computational_intelligence_for_educational_big_data_ana lysis/links/67c74dc28311ce680c7cb159/Recent-trends-in-computational-intelligence-for-educationalbig-data-analysis.pdf
Hussain, T., Yu, L., Asim, M., Ahmed, A., Wani, MA., 2024. Enhancing e-learning adaptability with automated learning style identification and sentiment analysis: a hybrid deep learning approach for smart education. Information, 15(5), 277. https://www.sciencedirect.com/science/article/pii/S1532046424000406
Manzoor, A., Qureshi, MA., Kidney, E., Longo, L., 2024. A review on machine learning methods for customer churn prediction and recommendations for business practitioners. IEEE access. https://ieeexplore.ieee.org/iel7/6287639/6514899/10531735.pdf
Gul, MN., Waseem, A., Muhammad, ZB., Abeer, A., Muhammad, A., 2025. Data driven decisions in education using a comprehensive machine learning framework for student performance prediction, 28, 153. https://doi.org/10.1007/s10791-025-09585-3
Baniata, LH., Kang, S., Alsharaiah, MA., Baniata, MH., 2024. Advanced deep learning model for predicting the academic performances of students in educational institutions. Appl Sciences, 14(5), 1963. https://doi.org/10.3390/app14051963
Hussain, S., Khan, MQ., 2023. Student-performulator: predicting students’ academic performance at secondary and intermediate level using machine learning. Ann Data Science, 10(3), 637–55. https://link.springer.com/article/10.1007/s40745-021-00341-0
Sarker, IH., 2021. Machine Learning: Algorithms, real-world applications and research directions. SN Comput. Science, 2, 160. https://link.springer.com/article/10.1007/s42979-021-00592-x
Kotsiantis, SB., 2011. Use of machine learning techniques for educational proposes: a decision support system for forecasting students’ grades. Artificial Intelligence Review, 37(4), 331–344. https://doi.org/10.1007/s10462-011-9234-x
Kim, SJ., Bae, SJ., Jang, MW., 2022. Linear Regression Machine Learning Algorithms for Estimating Reference Evapotranspiration Using Limited Climate Data. Sustainability, 14, 11674. https://doi.org/10.3390/su141811674
Peling, IBA., Arnawan, IN., Arthawan, IPA., Janardana, IGN., 2017. Implementation of data mining to predict period of students study using naive bayes algorithm. International Journal of Engineering and Emerging Technology, 2(1), 53. 10.24843/IJEET.2017.v02.i01.p11

This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles published in our journal are licensed under CC-BY 4.0, which permits authors to retain copyright of their work. This license allows for unrestricted use, sharing, and reproduction of the articles, provided that proper credit is given to the original authors and the source.