Demand prediction of rice growth stage-wise irrigation water requirement and fertilizer using Bayesian genetic algorithm and random forest for yield enhancement

Majumdar, Parijata; Bhattacharya, Diptendu; Mitra, Sanjoy; Solgi, Ryan; Oliva, Diego; Bhusan, Bharat

doi:10.1007/s10333-023-00930-0

Demand prediction of rice growth stage-wise irrigation water requirement and fertilizer using Bayesian genetic algorithm and random forest for yield enhancement

Article
Published: 28 February 2023

Volume 21, pages 275–293, (2023)
Cite this article

Paddy and Water Environment Aims and scope Submit manuscript

Parijata Majumdar¹,
Diptendu Bhattacharya¹,
Sanjoy Mitra²,
Ryan Solgi³,
Diego Oliva⁴ &
…
Bharat Bhusan⁵

359 Accesses
4 Citations
Explore all metrics

Abstract

Rice cultivation is the major source of earning revenues worldwide. The productivity and yield of rice crops mainly depend on soil water balance and soil fertility. Irrigation water requirement (IWR) analysis helps to retain appropriate soil water balance and judiciously allocate water resources considering vegetative, reproductive, and ripening stages of rice growth. To restore fertility, the application of fertilizers is inevitable but most of these are squandered owing to improper fertilizer selection without evaluation of soil macro-nutrients. So, the enhancement of rice yield demands the well-balanced application of fertilizers along with specific IWR analysis in each growth stage. In this paper, eXtreme Gradient Boosting (XGBoost) is used to extract high-scoring, correlated environmental parameters with IWR. Stacking-based ensemble learning is used to predict evapotranspiration since it is a very crucial indicator of rice water demand in different growth stages. Based on selected features of XGBoost and predicted evapotranspiration, IWR specific to all the rice growth stages is predicted using the Bayesian genetic algorithm ($Bay_{GA}$) hyper-tuned random forest (RF). The parameters of the maximum and the minimum number of samples required to be at the leaf node of RF are hyper-tuned using $Bay_{GA}$ to optimize performance. Comparative results indicate that IWR prediction using $Bay_{GA}-RF$ outperforms other methods with Accuracy (86.12, 92.42, 91.24), MSE (0.182, 0.162, 0.196), RMSE (0.426, 0.402, 0.442), MAE (0.193, 0.174, 0.205) and NSE (0.911, 0.952, 0.944) in Vegetative, Reproductive and Ripening rice growth stages and accuracy of $98\%$ to predict suitable fertilizer depending on Nitrogen, Phosphorous, and Potassium soil macro-nutrients.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Selecting essential factors for predicting reference crop evapotranspiration through tree-based machine learning and Bayesian optimization

Article 20 December 2023

Daily evapotranspiration prediction using gradient boost regression model for irrigation planning

Article 22 August 2019

Winter Wheat Yield Estimation Based on Sparrow Search Algorithm Combined with Random Forest: A Case Study in Henan Province, China

Article 01 March 2024

Data availability

Data will be made available on request.

Code availability

No codes are made available for sharing at present.

References

Ali M, Mubarak S (2017) Effective rainfall calculation methods for field crops: an overview, analysis and new formulation. Asian Res J Agric 7:1–12
Google Scholar
Allen R, Pereira L, Raes D et al (1998) Crop evapotranspiration-guidelines for computing crop water requirements-FAO irrigation and drainage paper 56. Fao, Rome 300(9):D05109
Google Scholar
Antanasijevic D, Pocajt V, Peric-Grujic A et al (2014) Modelling of dissolved oxygen in the Danube river using artificial neural networks and monte carlo simulation uncertainty analysis. J Hydrol 519:1895–1907. https://doi.org/10.1016/j.jhydrol.2014.10.009
Article CAS Google Scholar
Bai Y, Yue W, Ding C (2022) Optimize the irrigation and fertilizer schedules by combining DSS at and genetic algorithm. Environ Sci Pollut Res. https://doi.org/10.1007/s11356-022-19525-z
Article Google Scholar
Breiman L (2001) Random forests. Mach Learn 45(1):5–32. https://doi.org/10.1023/A:1010933404324
Article Google Scholar
Chen T, Guestrin C (2016a) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining pp 785–794. https://doi.org/10.3390/atmos10070373
Chen T, Guestrin C (2016b) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery data mining, San Francisco, pp 785-794. https://doi.org/10.48550/arXiv.1603.02754
Chia M, Huang Y, Koo C (2021) Swarm-based optimization as stochastic training strategy for estimation of reference evapotranspiration using extreme learning machine. Agric Water Manag 243(106):447. https://doi.org/10.1016/j.agwat.2020.106447
Article Google Scholar
Chung Y, Char I, Guo H, et al (2021) Uncertainty toolbox: an open-source library for assessing, visualizing, and improving uncertainty quantification.
Dasgupta A, Daruka A, Pandey A, et al (2019) Smart irrigation: Iot-based irrigation monitoring system. In: Proceedings of international ethical hacking conference Springer, Singapore pp 395–403. https://doi.org/10.1007/978-981-13-1544-2_32
Djaman K, Mel V, Boye A et al (2020) Rice genotype and fertilizer management for improving rice productivity under saline soil conditions. Paddy Water Environ 18(1):43–57. https://doi.org/10.1007/s10333-019-00763-w
Article Google Scholar
Gao M, Yin L, Ning J (2018) Artifcial neural network model for ozone concentration estimation and monte carlo analysis. Atmos Environ 184:129–139. https://doi.org/10.1016/j.atmosenv.2018.03.027
Article CAS Google Scholar
Ghamarnia H (2019) Estimation of rice cultivar (AMBERBO) water requirement and crop coefficients using lysimeter under non-flooding irrigation conditions. J Rice Sci 1(2):1–6
Google Scholar
Ghorbani M, Zadeh H, Isazadeh M et al (2016) A comparative study of artificial neural network (mlp, rbf) and support vector machine models for river flow prediction. Environ Earth Sci. https://doi.org/10.1007/s12665-015-5096-x
Article Google Scholar
Goap A, Sharma D, Shukla AK et al (2018) An IOT based smart irrigation management system using machine learning and open source technologies. Comput Electron Agric 155:41–49. https://doi.org/10.1016/j.compag.2018.09.040
Article Google Scholar
He B, Jia B, Zhao Y et al (2022) Estimate soil moisture of maize by combining support vector machine and chaotic whale optimization algorithm. Agric Water Manag 267(107):618. https://doi.org/10.1016/j.agwat.2022.107618
Article Google Scholar
Jayalakshmi M, Gomathi V (2020) Sensor-cloud based precision agriculture approach for intelligent water management. Int J Plant Prod 14:177–186. https://doi.org/10.1007/s42106-019-00077-1
Article Google Scholar
Khaydar D, Chen X, Huang Y et al (2021) Investigation of crop evapotranspiration and irrigation water requirement in the lower AMU DARYA river Basin, central Asia. J Arid Land 13(1):23–39. https://doi.org/10.1007/s40333-021-0054-9
Article Google Scholar
Kuzman B, Petkovic B, Denic N et al (2021) Estimation of optimal fertilizers for optimal crop yield by adaptive neuro fuzzy logic. Rhizosphere 18(100):358. https://doi.org/10.1016/j.rhisph.2021.100358
Article Google Scholar
Li Q, Wang Z, Shangguan W et al (2021) Improved daily SMAP satellite soil moisture prediction over china using deep learning model with transfer learning. J Hydrol 600(126):698. https://doi.org/10.1016/j.jhydrol.2021.126698
Article Google Scholar
Luo W, Chen M, Kang Y et al (2022) Analysis of crop water requirements and irrigation demands for rice: implications for increasing effective rainfall. Agric Water Manag 260(107):285. https://doi.org/10.1016/j.agwat.2021.107285
Article Google Scholar
Martin J, Saez JA, Corchado E (2021) On the suitability of stacking-based ensembles in smart agriculture for evapotranspiration prediction. Appl Soft Comput 108(107):509. https://doi.org/10.1016/j.asoc.2021.107509
Article Google Scholar
Ming D, Zhou T, Wang M et al (2016) Land cover classification using random forest with genetic algorithm-based parameter optimization. J Appl Remote Sens 10(3):35021. https://doi.org/10.1117/1.JRS.10.035021
Article Google Scholar
Moazenzadeh R, Mohammadi B, Safari M et al (2022) Soil moisture estimation using novel bio-inspired soft computing approaches. Eng Appl Comput Fluid Mech 16:826–840. https://doi.org/10.1080/19942060.2022.2037467
Article Google Scholar
Mohammadi B, Mehdizadeh S (2020) Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric Water Manag 237(106):145. https://doi.org/10.1016/j.agwat.2020.106145
Article Google Scholar
Mostafa S (2019) Imputing missing values using cumulative linear regression. CAAI Trans Intell Technol 4:182–200. https://doi.org/10.1049/trit.2019.0032
Article Google Scholar
Noori R, Hoshyaripour G, Ashra K et al (2010) Uncertainty analysis of developed ANN and ANFIS models in prediction of carbon monoxide daily concentration. Atmos Environ 44:476–482. https://doi.org/10.1016/j.atmosenv.2009.11.005
Article CAS Google Scholar
Ogasawara E, Martinez L, De Oliveira D, et al (2010) Adaptive normalization: a novel data normalization approach for non- stationary time series. In: The 2010 International Joint Conference on Neural Networks (IJCNN) IEEE. https://doi.org/10.1109/IJCNN.2010.5596746
Reddy AGS (2012) Water level variations in fractured, semi-confined aquifers of Anantapur district, southern India. J Geol Soc India 80:111–118. https://doi.org/10.1007/s12594-012-0124-x
Article Google Scholar
Ren X, Qu Z, Martins DS et al (2016) Daily reference evapotranspiration for hyper-arid to moist sub-humid climates in inner Mongolia, china: I. assessing temperature methods and spatial variability. Water Resour Manag 30:3769–3791. https://doi.org/10.1007/s11269-016-1385-8
Article Google Scholar
Rodriguez-Galiano V, Ghimire B, Rogan J et al (2012) An assessment of the effectiveness of a random forest classifer for land-cover classifcation. ISPRS J Photogramm Remote Sens 67:93–104. https://doi.org/10.1016/j.isprsjprs.2011.11.002
Article Google Scholar
Roy SK, De D (2020) Genetic algorithm based internet of precision agricultural things (IOPAT) for agriculture 4.0. Int Things. https://doi.org/10.1016/j.iot.2020.100201
Article Google Scholar
Sagar BM, Cauvery NK, Abbi P, et al (2022) Analysis and prediction of cotton yield with fertilizer recommendation using gradient boost algorithm. In: Information and communication technology for competitive strategies (ICTCS 2020) Springer, Singapore pp 1143–1152. https://doi.org/10.1007/978-981-16-0739-4_105
Sharma DN, Tare V (2021) Assessment of irrigation requirement and scheduling under canal command area of upper ganga canal using crop at model. Modeling Earth Systems and Environment pp 1–11. https://doi.org/10.1007/s40808-021-01184-7
Sidhu RK, Kumar R, Rana PS (2020) Machine learning based crop water demand forecasting using minimum climatological data. Multimed Tools Appl 79(19):3109–13124. https://doi.org/10.1007/s11042-019-08533-w
Article Google Scholar
Sivakumar B, Nanjundaswamy C (2021) Weather monitoring and forecasting system using IoT. Global J Eng Technol Adv 8(02):008–016
Article Google Scholar
Tao H, Al-Bedyry N, Khedher K et al (2021) River water level prediction in coastal catchment using hybridized relevance vector machine model with improved grasshopper optimization. J Hydrol 598(126):477. https://doi.org/10.1016/j.jhydrol.2021.126477
Article Google Scholar
Vergara BS (1991) Rice plant growth and development. In Rice Springer, pp 13–22. https://doi.org/10.1007/978-1-4899-3754-4_2
Vij A, Vijendra S, Jain A et al (2020) IOT and machine learning approaches for automation of farm irrigation system. Procedia Comput Sci 167:1250–1257. https://doi.org/10.1016/j.compag.2018.09.040
Article Google Scholar
Wu J, Sun J, Liang L et al (2011) Determination of weights for ultimate cross efficiency using Shannon entropy. Expert Syst Appl 38(5):5162–5165. https://doi.org/10.1016/j.eswa.2010.10.046
Article Google Scholar
Zhang L, Tan F, Li S et al (2020) Potential dynamic of irrigation water requirement for rice across northeast China. Theoret Appl Climatol 142(3):1283–1293. https://doi.org/10.1007/s00704-020-03366-2
Article Google Scholar
Zhao Q, Fan J, Ning S, et al (2022) Prediction and guidance of fertilizer requirement in different growth stages of crops based on artificial neural network. In: Innovative computing, pp 1651–1655. https://doi.org/10.1007/978-981-16-4258-6_210
Zheng J, Fan J, Zhang F et al (2021) Estimation of rainfed maize transpiration under various mulching methods using modified Jarvis–Stewart model and hybrid support vector machine model with whale optimization algorithm. Agric Water Manag 249(106):799. https://doi.org/10.1016/j.agwat.2021.106799
Article Google Scholar

Download references

Acknowledgements

None of the Authors received any financial support from any funding agency to carry out this research work. Computing Infrastructure of Tripura Institute of Technology, Agartala as well as National Institute of Technology, Agartala was used to prepare this research article.

Funding

The authors did not receive support from any organization for the submitted work.

Author information

Authors and Affiliations

National Institute of Technology, Agartala, West Tripura, 799046, India
Parijata Majumdar & Diptendu Bhattacharya
Tripura Institute of Technology, Agartala, West Tripura, 799009, India
Sanjoy Mitra
University of California, Santa Barbara, USA
Ryan Solgi
Depto. de Ciencias Computacionales, Universidad de Guadalajara, CUCEI, Guadalajara, Mexico
Diego Oliva
Sharda University, Greater Noida, India
Bharat Bhusan

Authors

Parijata Majumdar
View author publications
You can also search for this author in PubMed Google Scholar
Diptendu Bhattacharya
View author publications
You can also search for this author in PubMed Google Scholar
Sanjoy Mitra
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Solgi
View author publications
You can also search for this author in PubMed Google Scholar
Diego Oliva
View author publications
You can also search for this author in PubMed Google Scholar
Bharat Bhusan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization:- PM and SM Formal Analysis:- PM and BB Investigation and Methodology:- SM and PM Data Curation and Software:- PM and DB Validation:- SM and RS Visualization:- RS,DO and BB Writing—Original Draft:- PM, SM and DB Writing - Review and Editing:- SM, RS and DO Supervision:- DB and SM

Corresponding author

Correspondence to Sanjoy Mitra.

Ethics declarations

Competing interest

The authors have no competing interests to declare that are relevant to the content of this article. All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

Ethical approval

This Research does not involve Human Participants and/or Animals. Hence ethics approval not required.

Consent to participate

As this research does not involve Human Participants and/or Animals, hence, Consent to participate not applicable.

Consent to Publish

As this research does not involve Human Participants and/or Animals, hence, Consent to publish not applicable.

Appendix 1: EXtreme Gradient Boosting

In eXtreme Gradient Boosting (XGBoost), the training data $a_{i}$ has been trained to predict a target variable $b_{i}$ and an ensemble of K Classification and Regression Trees. T1 ($a_{i}$, $b_{i}$)...TN ($a_{i}$, $b_{i}$) where $a_{i}$ is the descriptor’s training set. To acquire the overall prediction, the XGBoost ensemble incorporates a gradient descent approach to minimize loss and evaluate the errors of the previous model.

The data d($a_{i}$, $b_{i}$): i = $1 \rightarrow n$ with s sample of f features, while $b_{i}$ is the predicted value expressed as:

$$\begin{aligned} b_{i}^{'}= \sum _{j=1}^{J} f_{j}(a_{i}), f_{j} \in N \end{aligned}$$

(20)

here $f_{j}$ is the regression tree, and $f_{j}(a)$ is the prediction score of the jth tree to the data sample. $N =f(a)=W_{p(a)}$ (p: ${\mathbb {R}}^{m}$ to T, $W \in {{\mathbb {R}}}^{T}$ ), the space of regression tree, where W is the leaf weight and p is the tree mapping structure to its leaf index. T is the number of leaf nodes in the tree. Learning the function $f_{j}$ is based on minimizing the objective function,

$$\begin{aligned} \phi = \sum _{i=1}^{n} l(b_{i}, b_{i}^{'}) + \sum _{j=1}^{J} \Omega (f_{k}) \end{aligned}$$

(21)

where l is the training loss and the regularization term $\Omega$ penalizes model complexity. The optimal weight of the leaf can be represented as:

$$\begin{aligned} \Omega (f_{j}) = \lambda _{1}T + \frac{1}{2\lambda _{2}} \vert \vert w_{t} \vert \vert ^{2} \end{aligned}$$

(22)

where $\lambda _{1}$ and $\lambda _{2}$ are the regularization degrees. T and $w_{t}$ are the leaf nodes and score. Considering $b_{i}^{'}(t)$ is predicted at t iteration, $f_{t}$ is added to minimize the objective,

$$\begin{aligned} \phi ^{t} = \sum _{i=1}^{n} l (b, b^{'(t-1)} + f_{t}(b)) + \Omega (f_{t}) \end{aligned}$$

(23)

The first and the second-order gradient on l are $\delta _{b^{'}(t-1)} l(b,b^{'(t-1)})$ and $\delta ^{2}_{b^{'}(t-1)} l(b,b^{'(t-1)})$ denoted by $g_{i}$ and $h_{i}$. Thus, using the second-order Taylor expansion, the above equation can be rewritten as:

$$\begin{aligned} \phi ^{t} = \sum _{i=1}^{n} [g_{i}f_{t}(a) = \frac{1}{2}h_{i}f_{t}(a^{2})] + \Omega (f_{t}) \end{aligned}$$

(24)

where $g_{i}$ and $h_{i}$ is the second-order gradient on l. It can be defined as $I_{k}$=$f_{i} \vert p(a_{i})=k$ which is the instance of leaf k. Thus, the above equation can be written as:

$$\begin{aligned} \phi (t)= & {} \sum _{i=1}^{n} [g_{i}f_{t}(a) = \frac{1}{2}h_{i}f_{t}(a^{2})] + \lambda _{1} T + \frac{1}{2} \lambda _{2} \sum _{k=1}^{T} w_{k}^{2} \end{aligned}$$

(25)

$$\begin{aligned} \phi (t)= & {} \sum _{k=1}^{T} [(\sum _{i \in I_{k}}g_{i})w_{k} + \frac{1}{2} (\sum _{i \in I_{k}} h_{i} + \lambda _{1})w_{k}^{2}] + \lambda _{2} T \end{aligned}$$

(26)

The optimal weight, $w^{*}_{k}$ of leaf k on a fixed structure q(a) is expressed as:

$$\begin{aligned} w^{*}_{k}= - \frac{G_{k}}{H_{k} + \lambda _{2}} \end{aligned}$$

(27)

whose values can be expressed as:

$$\begin{aligned} \phi ^{*}= - \frac{1}{2} \sum _{k=1}^{T} \frac{G_{k}^{2}}{H_{k} + \lambda _{2}} + \lambda _{2}T \end{aligned}$$

(28)

where $G_{k}=\sum _{i \in I_{k}} g_{i}$, $H_{k}= \sum _{i \in I_{k}} h_{i}$ and $\Phi$ is the scoring function for the tree structure where smaller value indicates better tree structure. Both the gradient and second-order gradient statistics on each leaf needs to be added to get the overall reliable score before implementing the scoring algorithm.

The optimal split finding algorithm, as well as the loss reduction are similar to the ideas in Chen and Guestrin (2016b).

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Majumdar, P., Bhattacharya, D., Mitra, S. et al. Demand prediction of rice growth stage-wise irrigation water requirement and fertilizer using Bayesian genetic algorithm and random forest for yield enhancement. Paddy Water Environ 21, 275–293 (2023). https://doi.org/10.1007/s10333-023-00930-0

Download citation

Received: 11 August 2022
Revised: 14 January 2023
Accepted: 06 February 2023
Published: 28 February 2023
Issue Date: April 2023
DOI: https://doi.org/10.1007/s10333-023-00930-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Demand prediction of rice growth stage-wise irrigation water requirement and fertilizer using Bayesian genetic algorithm and random forest for yield enhancement

Abstract

Access this article

Similar content being viewed by others

Selecting essential factors for predicting reference crop evapotranspiration through tree-based machine learning and Bayesian optimization

Daily evapotranspiration prediction using gradient boost regression model for irrigation planning

Winter Wheat Yield Estimation Based on Sparrow Search Algorithm Combined with Random Forest: A Case Study in Henan Province, China

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interest

Ethical approval

Consent to participate

Consent to Publish

Appendix 1: EXtreme Gradient Boosting

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Demand prediction of rice growth stage-wise irrigation water requirement and fertilizer using Bayesian genetic algorithm and random forest for yield enhancement

Abstract

Access this article

Similar content being viewed by others

Selecting essential factors for predicting reference crop evapotranspiration through tree-based machine learning and Bayesian optimization

Daily evapotranspiration prediction using gradient boost regression model for irrigation planning

Winter Wheat Yield Estimation Based on Sparrow Search Algorithm Combined with Random Forest: A Case Study in Henan Province, China

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interest

Ethical approval

Consent to participate

Consent to Publish

Appendix 1: EXtreme Gradient Boosting

Appendix 1: EXtreme Gradient Boosting

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation