Skip to main content

Advertisement

Log in

Mining of soil data for predicting the paddy productivity by machine learning techniques

  • Article
  • Published:
Paddy and Water Environment Aims and scope Submit manuscript

Abstract

Crop yield prediction is a challenging task towards precision agriculture. In particular, paddy is one of the world’s significant cereal crops and thus crucial for crop management and decision making. Despite the number of crop yield prediction models, better performance in paddy yield prediction is still desirable. Keeping this in mind, the present study aimed to determine the most influencing features that impact paddy production. We employed a machine learning algorithm alongside the best data sources for paddy yield prediction in this study. A total of 5 regression machine learning algorithms were developed using the 16 input variables obtained from the soil health card. Note that we have carried out multiple approaches to improving the model performances. The model results were also validated using Monte Carlo methods. The result from our analysis depicts that XG boost ensembled random forest has demonstrated the highest prediction accuracy of 86% of the other models investigated in our study. It is worth mentioning that this is the first study on paddy crop yield prediction from the features of a soil health card. Indeed, farmers and agronomists could use this model to plan their paddy cultivation and procure the maximum yield.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

Download references

Acknowledgements

The authors thank VIT management for providing the facility for carrying out this research work.

Funding

The author(s) reported there is no funding associated with the work featured in this article.

Author information

Authors and Affiliations

Authors

Contributions

KR conceived and planned the present study. AA carried out the experiments. KR contributed to the interpretation of the results. AA wrote the first draft of the manuscript. KR provided critical feedback and helped to draft the manuscript and supervised the entire study. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Ramanathan Karuppasamy.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interests.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Antony, A., Karuppasamy, R. Mining of soil data for predicting the paddy productivity by machine learning techniques. Paddy Water Environ 21, 231–242 (2023). https://doi.org/10.1007/s10333-023-00924-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10333-023-00924-y

Keywords

Navigation