Predicting Pest and Disease Occurrence Using Synthetic Data and Explainable Machine Learning Methods

Article Sidebar

Main Article Content

Priyanka Balley
Prof. Kanchan K. Doke

Prediction of occurrence for pests and diseases is an essential problem for agriculture, as such events have a huge influence on the productivity of the crop with regard to the security of food production. Traditional methods lack datasets and tend not to incorporate domain knowledge, which leads to suboptimal performance with limited sets of interpretation. This study addresses such gaps by developing a systematic machine learning-based framework for combining synthetic data generation, robust predictive modeling, and explainability techniques to produce actionable insights in pest and disease dynamics. Synthetic datasets are first generated based on the domain-driven logic simulating the correlations between critical environmental and biological factors such as temperature, humidity, rainfall, pest lifecycle stage, and soil moisture and the incidence of pests or diseases. For interpretability, Local Interpretable Model-agnostic Explanations LIME with Random Forest provides localized, instance-level insights on feature contributions to individual predictions. For complement, permutation importance calculates the global relevance of every feature by assessing its effect on model performance. Both of these techniques ensure that fine-grained and holistic understanding is achieved regarding the model's behavior. This integrated approach therefore addresses the limitations of traditional methods by improving the predictive accuracy and enhancing interpretability. The findings have tremendous implications for precision agriculture in order to allow stakeholders to put into action data-driven strategies for pest and disease management. This framework is reproducible and therefore adaptable to different contexts in agriculture sets.

Predicting Pest and Disease Occurrence Using Synthetic Data and Explainable Machine Learning Methods. (2026). International Journal of Latest Technology in Engineering Management & Applied Science, 15(5), 413-421. https://doi.org/10.51583/IJLTEMAS.2026.150500037

Downloads

References

Chithambarathanu, M., Jeyakumar, M.K. Survey on crop pest detection using deep learning and machine learning approaches. Multimed Tools Appl 82, 42277–42310 (2023). https://doi.org/10.1007/s11042-023-15221-3

Sailaja, B., Gayatri, S., Rathod, S. et al. Spatial temperature prediction—a machine learning and GIS perspective. Theor Appl Climatol 155, 9619–9642 (2024). https://doi.org/10.1007/s00704-024-05167-3

Saravanan, K.S., Bhagavathiappan, V. Prediction of crop yield in India using machine learning and hybrid deep learning models. Acta Geophys. 72, 4613–4632 (2024). https://doi.org/10.1007/s11600-024-01312-8

Kuppan, P., Priya, V.V. Crop Yield Prediction Using Ensemble Machine Learning Techniques. SN COMPUT. SCI. 5, 1160 (2024). https://doi.org/10.1007/s42979-024-03536-3

Shinde, N., Ambhaikar, A. An efficient plant disease prediction model based on machine learning and deep learning classifiers. Evol. Intel. 18, 14 (2025). https://doi.org/10.1007/s12065-024-01000-y

Venkatasaichandrakanth, P., Iyapparaja, M. Review on Pest Detection and Classification in Agricultural Environments Using Image-Based Deep Learning Models and Its Challenges. Opt. Mem. Neural Networks 32, 295–309 (2023). https://doi.org/10.3103/S1060992X23040112

Attri, I., Awasthi, L.K. & Sharma, T.P. Machine learning in agriculture: a review of crop management applications. Multimed Tools Appl 83, 12875–12915 (2024). https://doi.org/10.1007/s11042-023-16105-2

Nithya, V., Josephine, M.S. & Jeyabalaraja, V. Comparative approach on crop detection using machine learning and deep learning techniques. Int J Syst Assur Eng Manag 15, 4636–4648 (2024). https://doi.org/10.1007/s13198-024-02483-9

Anand, S., Sandhu, S.K., Biswas, B. et al. Comparative analysis of different Karnal bunt disease prediction models developed by machine learning techniques for Punjab conditions. Int J Biometeorol 68, 1799–1810 (2024). https://doi.org/10.1007/s00484-024-02707-4

Verma, R., Kushwaha, K.P.S., Bijlwan, A. et al. Enhancing urad bean (Vigna mungo L.) crop management with machine learning: Predictive analysis of pod rot severity and pod bug incidence patterns. Australasian Plant Pathol. 53, 273–283 (2024). https://doi.org/10.1007/s13313-024-00967-7

Chacón-Maldonado, A.M., Melgar-García, L., Asencio-Cortés, G. et al. A novel method based on hybrid deep learning with explainability for olive fruit pest forecasting. Neural Comput & Applic (2024). https://doi.org/10.1007/s00521-024-10731-z

Nithya, V., Josephine, M.S. & Jeyabalaraja, V. IoT-Based Crop Yield Prediction System in Indian Sub-continent Using Machine Learning Techniques. Remote Sens Earth Syst Sci 6, 156–166 (2023). https://doi.org/10.1007/s41976-023-00097-6

Mandrapa, B., Spohrer, K., Wuttke, D. et al. Machine learning-based hyperspectral wavelength selection and classification of spider mite-infested cucumber leaves. Exp Appl Acarol 93, 627–644 (2024). https://doi.org/10.1007/s10493-024-00953-0

Abdel-salam, M., Kumar, N. & Mahajan, S. A proposed framework for crop yield prediction using hybrid feature selection approach and optimized machine learning. Neural Comput & Applic 36, 20723–20750 (2024). https://doi.org/10.1007/s00521-024-10226-x

Ahmed, I., Yadav, P.K. Predicting Apple Plant Diseases in Orchards Using Machine Learning and Deep Learning Algorithms. SN COMPUT. SCI. 5, 700 (2024). https://doi.org/10.1007/s42979-024-02959-2

Article Details

How to Cite

Predicting Pest and Disease Occurrence Using Synthetic Data and Explainable Machine Learning Methods. (2026). International Journal of Latest Technology in Engineering Management & Applied Science, 15(5), 413-421. https://doi.org/10.51583/IJLTEMAS.2026.150500037