Abstract
Rice cultivation is the major source of earning revenues worldwide. The productivity and yield of rice crops mainly depend on soil water balance and soil fertility. Irrigation water requirement (IWR) analysis helps to retain appropriate soil water balance and judiciously allocate water resources considering vegetative, reproductive, and ripening stages of rice growth. To restore fertility, the application of fertilizers is inevitable but most of these are squandered owing to improper fertilizer selection without evaluation of soil macro-nutrients. So, the enhancement of rice yield demands the well-balanced application of fertilizers along with specific IWR analysis in each growth stage. In this paper, eXtreme Gradient Boosting (XGBoost) is used to extract high-scoring, correlated environmental parameters with IWR. Stacking-based ensemble learning is used to predict evapotranspiration since it is a very crucial indicator of rice water demand in different growth stages. Based on selected features of XGBoost and predicted evapotranspiration, IWR specific to all the rice growth stages is predicted using the Bayesian genetic algorithm (\(Bay_{GA}\)) hyper-tuned random forest (RF). The parameters of the maximum and the minimum number of samples required to be at the leaf node of RF are hyper-tuned using \(Bay_{GA}\) to optimize performance. Comparative results indicate that IWR prediction using \(Bay_{GA}-RF\) outperforms other methods with Accuracy (86.12, 92.42, 91.24), MSE (0.182, 0.162, 0.196), RMSE (0.426, 0.402, 0.442), MAE (0.193, 0.174, 0.205) and NSE (0.911, 0.952, 0.944) in Vegetative, Reproductive and Ripening rice growth stages and accuracy of \(98\%\) to predict suitable fertilizer depending on Nitrogen, Phosphorous, and Potassium soil macro-nutrients.
Similar content being viewed by others
Data availability
Data will be made available on request.
Code availability
No codes are made available for sharing at present.
References
Ali M, Mubarak S (2017) Effective rainfall calculation methods for field crops: an overview, analysis and new formulation. Asian Res J Agric 7:1–12
Allen R, Pereira L, Raes D et al (1998) Crop evapotranspiration-guidelines for computing crop water requirements-FAO irrigation and drainage paper 56. Fao, Rome 300(9):D05109
Antanasijevic D, Pocajt V, Peric-Grujic A et al (2014) Modelling of dissolved oxygen in the Danube river using artificial neural networks and monte carlo simulation uncertainty analysis. J Hydrol 519:1895–1907. https://doi.org/10.1016/j.jhydrol.2014.10.009
Bai Y, Yue W, Ding C (2022) Optimize the irrigation and fertilizer schedules by combining DSS at and genetic algorithm. Environ Sci Pollut Res. https://doi.org/10.1007/s11356-022-19525-z
Breiman L (2001) Random forests. Mach Learn 45(1):5–32. https://doi.org/10.1023/A:1010933404324
Chen T, Guestrin C (2016a) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining pp 785–794. https://doi.org/10.3390/atmos10070373
Chen T, Guestrin C (2016b) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery data mining, San Francisco, pp 785-794. https://doi.org/10.48550/arXiv.1603.02754
Chia M, Huang Y, Koo C (2021) Swarm-based optimization as stochastic training strategy for estimation of reference evapotranspiration using extreme learning machine. Agric Water Manag 243(106):447. https://doi.org/10.1016/j.agwat.2020.106447
Chung Y, Char I, Guo H, et al (2021) Uncertainty toolbox: an open-source library for assessing, visualizing, and improving uncertainty quantification.
Dasgupta A, Daruka A, Pandey A, et al (2019) Smart irrigation: Iot-based irrigation monitoring system. In: Proceedings of international ethical hacking conference Springer, Singapore pp 395–403. https://doi.org/10.1007/978-981-13-1544-2_32
Djaman K, Mel V, Boye A et al (2020) Rice genotype and fertilizer management for improving rice productivity under saline soil conditions. Paddy Water Environ 18(1):43–57. https://doi.org/10.1007/s10333-019-00763-w
Gao M, Yin L, Ning J (2018) Artifcial neural network model for ozone concentration estimation and monte carlo analysis. Atmos Environ 184:129–139. https://doi.org/10.1016/j.atmosenv.2018.03.027
Ghamarnia H (2019) Estimation of rice cultivar (AMBERBO) water requirement and crop coefficients using lysimeter under non-flooding irrigation conditions. J Rice Sci 1(2):1–6
Ghorbani M, Zadeh H, Isazadeh M et al (2016) A comparative study of artificial neural network (mlp, rbf) and support vector machine models for river flow prediction. Environ Earth Sci. https://doi.org/10.1007/s12665-015-5096-x
Goap A, Sharma D, Shukla AK et al (2018) An IOT based smart irrigation management system using machine learning and open source technologies. Comput Electron Agric 155:41–49. https://doi.org/10.1016/j.compag.2018.09.040
He B, Jia B, Zhao Y et al (2022) Estimate soil moisture of maize by combining support vector machine and chaotic whale optimization algorithm. Agric Water Manag 267(107):618. https://doi.org/10.1016/j.agwat.2022.107618
Jayalakshmi M, Gomathi V (2020) Sensor-cloud based precision agriculture approach for intelligent water management. Int J Plant Prod 14:177–186. https://doi.org/10.1007/s42106-019-00077-1
Khaydar D, Chen X, Huang Y et al (2021) Investigation of crop evapotranspiration and irrigation water requirement in the lower AMU DARYA river Basin, central Asia. J Arid Land 13(1):23–39. https://doi.org/10.1007/s40333-021-0054-9
Kuzman B, Petkovic B, Denic N et al (2021) Estimation of optimal fertilizers for optimal crop yield by adaptive neuro fuzzy logic. Rhizosphere 18(100):358. https://doi.org/10.1016/j.rhisph.2021.100358
Li Q, Wang Z, Shangguan W et al (2021) Improved daily SMAP satellite soil moisture prediction over china using deep learning model with transfer learning. J Hydrol 600(126):698. https://doi.org/10.1016/j.jhydrol.2021.126698
Luo W, Chen M, Kang Y et al (2022) Analysis of crop water requirements and irrigation demands for rice: implications for increasing effective rainfall. Agric Water Manag 260(107):285. https://doi.org/10.1016/j.agwat.2021.107285
Martin J, Saez JA, Corchado E (2021) On the suitability of stacking-based ensembles in smart agriculture for evapotranspiration prediction. Appl Soft Comput 108(107):509. https://doi.org/10.1016/j.asoc.2021.107509
Ming D, Zhou T, Wang M et al (2016) Land cover classification using random forest with genetic algorithm-based parameter optimization. J Appl Remote Sens 10(3):35021. https://doi.org/10.1117/1.JRS.10.035021
Moazenzadeh R, Mohammadi B, Safari M et al (2022) Soil moisture estimation using novel bio-inspired soft computing approaches. Eng Appl Comput Fluid Mech 16:826–840. https://doi.org/10.1080/19942060.2022.2037467
Mohammadi B, Mehdizadeh S (2020) Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric Water Manag 237(106):145. https://doi.org/10.1016/j.agwat.2020.106145
Mostafa S (2019) Imputing missing values using cumulative linear regression. CAAI Trans Intell Technol 4:182–200. https://doi.org/10.1049/trit.2019.0032
Noori R, Hoshyaripour G, Ashra K et al (2010) Uncertainty analysis of developed ANN and ANFIS models in prediction of carbon monoxide daily concentration. Atmos Environ 44:476–482. https://doi.org/10.1016/j.atmosenv.2009.11.005
Ogasawara E, Martinez L, De Oliveira D, et al (2010) Adaptive normalization: a novel data normalization approach for non- stationary time series. In: The 2010 International Joint Conference on Neural Networks (IJCNN) IEEE. https://doi.org/10.1109/IJCNN.2010.5596746
Reddy AGS (2012) Water level variations in fractured, semi-confined aquifers of Anantapur district, southern India. J Geol Soc India 80:111–118. https://doi.org/10.1007/s12594-012-0124-x
Ren X, Qu Z, Martins DS et al (2016) Daily reference evapotranspiration for hyper-arid to moist sub-humid climates in inner Mongolia, china: I. assessing temperature methods and spatial variability. Water Resour Manag 30:3769–3791. https://doi.org/10.1007/s11269-016-1385-8
Rodriguez-Galiano V, Ghimire B, Rogan J et al (2012) An assessment of the effectiveness of a random forest classifer for land-cover classifcation. ISPRS J Photogramm Remote Sens 67:93–104. https://doi.org/10.1016/j.isprsjprs.2011.11.002
Roy SK, De D (2020) Genetic algorithm based internet of precision agricultural things (IOPAT) for agriculture 4.0. Int Things. https://doi.org/10.1016/j.iot.2020.100201
Sagar BM, Cauvery NK, Abbi P, et al (2022) Analysis and prediction of cotton yield with fertilizer recommendation using gradient boost algorithm. In: Information and communication technology for competitive strategies (ICTCS 2020) Springer, Singapore pp 1143–1152. https://doi.org/10.1007/978-981-16-0739-4_105
Sharma DN, Tare V (2021) Assessment of irrigation requirement and scheduling under canal command area of upper ganga canal using crop at model. Modeling Earth Systems and Environment pp 1–11. https://doi.org/10.1007/s40808-021-01184-7
Sidhu RK, Kumar R, Rana PS (2020) Machine learning based crop water demand forecasting using minimum climatological data. Multimed Tools Appl 79(19):3109–13124. https://doi.org/10.1007/s11042-019-08533-w
Sivakumar B, Nanjundaswamy C (2021) Weather monitoring and forecasting system using IoT. Global J Eng Technol Adv 8(02):008–016
Tao H, Al-Bedyry N, Khedher K et al (2021) River water level prediction in coastal catchment using hybridized relevance vector machine model with improved grasshopper optimization. J Hydrol 598(126):477. https://doi.org/10.1016/j.jhydrol.2021.126477
Vergara BS (1991) Rice plant growth and development. In Rice Springer, pp 13–22. https://doi.org/10.1007/978-1-4899-3754-4_2
Vij A, Vijendra S, Jain A et al (2020) IOT and machine learning approaches for automation of farm irrigation system. Procedia Comput Sci 167:1250–1257. https://doi.org/10.1016/j.compag.2018.09.040
Wu J, Sun J, Liang L et al (2011) Determination of weights for ultimate cross efficiency using Shannon entropy. Expert Syst Appl 38(5):5162–5165. https://doi.org/10.1016/j.eswa.2010.10.046
Zhang L, Tan F, Li S et al (2020) Potential dynamic of irrigation water requirement for rice across northeast China. Theoret Appl Climatol 142(3):1283–1293. https://doi.org/10.1007/s00704-020-03366-2
Zhao Q, Fan J, Ning S, et al (2022) Prediction and guidance of fertilizer requirement in different growth stages of crops based on artificial neural network. In: Innovative computing, pp 1651–1655. https://doi.org/10.1007/978-981-16-4258-6_210
Zheng J, Fan J, Zhang F et al (2021) Estimation of rainfed maize transpiration under various mulching methods using modified Jarvis–Stewart model and hybrid support vector machine model with whale optimization algorithm. Agric Water Manag 249(106):799. https://doi.org/10.1016/j.agwat.2021.106799
Acknowledgements
None of the Authors received any financial support from any funding agency to carry out this research work. Computing Infrastructure of Tripura Institute of Technology, Agartala as well as National Institute of Technology, Agartala was used to prepare this research article.
Funding
The authors did not receive support from any organization for the submitted work.
Author information
Authors and Affiliations
Contributions
Conceptualization:- PM and SM Formal Analysis:- PM and BB Investigation and Methodology:- SM and PM Data Curation and Software:- PM and DB Validation:- SM and RS Visualization:- RS,DO and BB Writing—Original Draft:- PM, SM and DB Writing - Review and Editing:- SM, RS and DO Supervision:- DB and SM
Corresponding author
Ethics declarations
Competing interest
The authors have no competing interests to declare that are relevant to the content of this article. All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.
Ethical approval
This Research does not involve Human Participants and/or Animals. Hence ethics approval not required.
Consent to participate
As this research does not involve Human Participants and/or Animals, hence, Consent to participate not applicable.
Consent to Publish
As this research does not involve Human Participants and/or Animals, hence, Consent to publish not applicable.
Appendix 1: EXtreme Gradient Boosting
Appendix 1: EXtreme Gradient Boosting
In eXtreme Gradient Boosting (XGBoost), the training data \(a_{i}\) has been trained to predict a target variable \(b_{i}\) and an ensemble of K Classification and Regression Trees. T1 (\(a_{i}\), \(b_{i}\))...TN (\(a_{i}\), \(b_{i}\)) where \(a_{i}\) is the descriptor’s training set. To acquire the overall prediction, the XGBoost ensemble incorporates a gradient descent approach to minimize loss and evaluate the errors of the previous model.
The data d(\(a_{i}\), \(b_{i}\)): i = \(1 \rightarrow n\) with s sample of f features, while \(b_{i}\) is the predicted value expressed as:
here \(f_{j}\) is the regression tree, and \(f_{j}(a)\) is the prediction score of the jth tree to the data sample. \(N =f(a)=W_{p(a)}\) (p: \({\mathbb {R}}^{m}\) to T, \(W \in {{\mathbb {R}}}^{T}\) ), the space of regression tree, where W is the leaf weight and p is the tree mapping structure to its leaf index. T is the number of leaf nodes in the tree. Learning the function \(f_{j}\) is based on minimizing the objective function,
where l is the training loss and the regularization term \(\Omega\) penalizes model complexity. The optimal weight of the leaf can be represented as:
where \(\lambda _{1}\) and \(\lambda _{2}\) are the regularization degrees. T and \(w_{t}\) are the leaf nodes and score. Considering \(b_{i}^{'}(t)\) is predicted at t iteration, \(f_{t}\) is added to minimize the objective,
The first and the second-order gradient on l are \(\delta _{b^{'}(t-1)} l(b,b^{'(t-1)})\) and \(\delta ^{2}_{b^{'}(t-1)} l(b,b^{'(t-1)})\) denoted by \(g_{i}\) and \(h_{i}\). Thus, using the second-order Taylor expansion, the above equation can be rewritten as:
where \(g_{i}\) and \(h_{i}\) is the second-order gradient on l. It can be defined as \(I_{k}\)=\(f_{i} \vert p(a_{i})=k\) which is the instance of leaf k. Thus, the above equation can be written as:
The optimal weight, \(w^{*}_{k}\) of leaf k on a fixed structure q(a) is expressed as:
whose values can be expressed as:
where \(G_{k}=\sum _{i \in I_{k}} g_{i}\), \(H_{k}= \sum _{i \in I_{k}} h_{i}\) and \(\Phi\) is the scoring function for the tree structure where smaller value indicates better tree structure. Both the gradient and second-order gradient statistics on each leaf needs to be added to get the overall reliable score before implementing the scoring algorithm.
The optimal split finding algorithm, as well as the loss reduction are similar to the ideas in Chen and Guestrin (2016b).
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Majumdar, P., Bhattacharya, D., Mitra, S. et al. Demand prediction of rice growth stage-wise irrigation water requirement and fertilizer using Bayesian genetic algorithm and random forest for yield enhancement. Paddy Water Environ 21, 275–293 (2023). https://doi.org/10.1007/s10333-023-00930-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10333-023-00930-0