INTERNATIONAL JOURNAL OF LATEST TECHNOLOGY IN ENGINEERING,
MANAGEMENT & APPLIED SCIENCE (IJLTEMAS)
ISSN 2278-2540 | DOI: 10.51583/IJLTEMAS | Volume XIV, Issue XII, December 2025
where policy support is most critical, underscoring the need for higher-resolution climate and soil data and more
consistent district-level reporting to improve future model robustness and fairness.
RECOMMENDATIONS FOR MODEL IMPROVEMENT
Future iterations should integrate satellite-derived vegetation indices (NDVI, EVI, LST), broader climate
features, and remote sensing for microclimate mapping. Expanding open-source datasets and cross-institutional
collaboration will enhance model robustness and transferability.
CONCLUSION
This research demonstrates that machine learning, when paired with systematic interpretation and granular
spatial analysis, can revolutionise district-level agricultural yield prediction in India. The developed framework
is transparent, data-driven, and fully reproducible, establishing a new benchmark for scalable agricultural
intelligence. By bridging machine learning rigours with actionable policy tools, we support a resilient and
sustainable path toward national food security.
FUTURE WORK
The current methodology provides a strong foundation for crop yield analytics but offers scope for significant
enhancement in several key areas. Future work will focus on integrating high-resolution remote sensing and
meteorological data such as satellite-derived vegetation indices (e.g., NDVI, LST), dynamic weather inputs, and
soil health metrics to improve the spatial-temporal adaptability of models, particularly in rainfed and rapidly
evolving districts. Advancements in temporal and spatiotemporal modelling through hybrid frameworks, such
as combining Random Forest (RF) with Long Short-Term Memory (LSTM) networks, will enable better learning
from time-series dependencies, enhancing near-term forecasting and anomaly detection. Automated, real-time
data pipelines developed with modular, API-driven platforms will facilitate seamless ingestion of district-level,
satellite, and weather data, enabling operational yield forecasting for government and institutional stakeholders.
Strengthening participatory data collection, including farmer-reported area and production data, will refine
ground truth inputs and improve model accuracy in hard-to-monitor regions. Also, open benchmarking and
reproducibility standards that allow for clear sharing of code, datasets, and performance metrics will encourage
collaboration, new ideas, and trust in digital agriculture. Collectively, these advancements will transform the
current yield estimation framework into a fully operational, nationally scalable agri-intelligence system that
supports India’s vision for sustainable, resilient, and food-secure agricultural development.
ACKNOWLEDGEMENTS
We gratefully acknowledge all contributors and advisors named in the originating documentation, as well as the
developers of open-source Python libraries instrumental to this research.
REFERENCES
2. “Ministry of Agriculture & Farmers Welfare Department of Agriculture and Farmers’ Welfare releases
Final Estimates of major agricultural crops for 2023-24.” [Online]. Available:
3. S. Saiful and N. B. Wibisono, “Crop Yield Prediction Using Random Forest Algorithm and XGBoost
Machine Learning Model,” International Journal of Research and Innovation in Social Science, vol. IX,
no. III, pp. 1983–1994, Apr. 2025, doi: 10.47772/IJRISS.2025.90300155.
4. R. Prathiba, D. Sri Harsha, D. Madhu, D. Chaitanya Venkata Ajay, and D. Harsha Vardhan Assistant
Professor, “International Journal of Innovative Research in Science Engineering and Technology
(IJIRSET) Crop Yield Prediction using Random Forest Algorithm”, doi:
10.15680/IJIRSET.2025.1404465.
Page 249