Abstract
This study conducts an in-depth exploration of the efficacy of deep learning and ensemble learning techniques for slope-unit-based landslide susceptibility prediction within the context of the Three Gorges Reservoir area in China, with a specific focus on Wanzhou District. Leveraging a dataset comprising twelve distinct landslide factors and 1909 Slope Units, the research evaluates three deep learning models (Long Short-Term Memory, Recurrent Neural Network, and Gated Recurrent Unit) as well as three ensemble learning models (LightGBM (LGBM), Extra Trees, and Random Forest) using five performance metrics. Central to this endeavor is the adept utilization of remote sensing technology, including Landsat 8 OLI images, Digital Elevation Model (DEM) data, and Google Earth Pro images. The Landsat 8 OLI images offer a panoramic view of the study area, capturing essential landscape features and variations. The DEM data, providing detailed elevation information, empowers the analysis of terrain morphology crucial for landslide susceptibility assessment. The findings conclusively showcase that ensemble learning models harnessed in this study, augmented by the integration of diverse remote sensing data, exhibit exceptional predictive capabilities in accurately anticipating landslide susceptibility. These models outperform their deep learning model counterparts, attributing their success to the multi-faceted insights derived from the synergy between remote sensing imagery and advanced machine learning algorithms. The ensemble models’ enhanced performance metrics, such as F1-score, recall, precision, and area under the curve (AUC) values, underscore their potential utility in real-world landslide prediction scenarios. Especially noteworthy among the ensemble models is LGBM, which emerges as the most promising candidate with the highest F1-score (0.561) and Recall (0.524), indicating that the LGBM model possesses a more robust predictive capability for landslide samples. In-depth interpretability analysis using SHapley Additive exPlanations (SHAP) values and Partial Dependence Plots (PDP) assessments delves into the mechanics of LGBM’s predictive prowess. This analysis, reliant on remote sensing data, provides clarity into the contributions of various evaluation factors, emphasizing the roles of attributes such as proximity to the river, rainfall, and elevation. The correlation patterns revealed between these factors and landslide susceptibility add layers of understanding, while the intricate interplay of distance to the river unveils the complex interactions between geological and climatic variables.
Similar content being viewed by others
Data availability
For the data used in this paper, please contact the corresponding author.
Abbreviations
- AUC:
-
Area under ROC curve
- DEM:
-
Digital elevation model
- ET:
-
Extra trees
- GIS:
-
Geographic information science
- GRU:
-
Gated recurrent unit
- KNN:
-
K-nearest neighbors
- LGBM:
-
LightGBM
- LPI:
-
Landslide Prediction Index
- LR:
-
Logistic regression
- LSP:
-
Landslide susceptibility prediction
- LSTM:
-
Long short-term memory
- PDP:
-
Partial dependence plots
- RF:
-
Random forest
- RNN:
-
Recurrent neural network
- SHAP:
-
SHapley additive exPlanations
- ROC:
-
Receiver operating characteristic curve
- RS:
-
Remote sensing
- SU:
-
Slope unit
References
Abbas F, Zhang F, Abbas F et al (2023) Landslide susceptibility mapping: analysis of different feature selection techniques with artificial neural network tuned by bayesian and metaheuristic algorithms. Remote Sens. https://doi.org/10.20944/preprints202307.1467.v1
Aditian A, Kubota T, Shinohara Y (2018) Comparison of GIS-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of Ambon, Indonesia. Geomorphology 318:101–111. https://doi.org/10.1016/j.geomorph.2018.06.006
Aksoy B, Ercanoglu M (2012) Landslide identification and classification by object-based image analysis and fuzzy logic: an example from the Azdavay region (Kastamonu, Turkey). Comput Geosci 38:87–98. https://doi.org/10.1016/j.cageo.2011.05.010
Al-Najjar HAH, Pradhan B, Beydoun G et al (2023) A novel method using explainable artificial intelligence (XAI)-based Shapley additive explanations for spatial landslide prediction using time-series SAR dataset. Gondwana Res 123:107–124. https://doi.org/10.1016/j.gr.2022.08.004
Anbalagan R, Kumar R, Lakshmanan K et al (2015) Landslide hazard zonation mapping using frequency ratio and fuzzy logic approach, a case study of Lachung Valley, Sikkim. Geoenvironmental Dis. https://doi.org/10.1186/s40677-014-0009-y
Arabameri A, Karimi-Sangchini E, Pal SC et al (2020) Novel credal decision tree-based ensemble approaches for predicting the landslide susceptibility. Remote Sens 12:3389. https://doi.org/10.3390/rs12203389
Austin NJ, Muller JP, Gong L, Zhang J (2013) A regional investigation of urban land-use change for potential landslide hazard assessment in the Three Gorges Reservoir Area, People’s Republic of China: Zigui to Wanzhou. Int J Remote Sens 34:2983–3011. https://doi.org/10.1080/01431161.2012.744528
Baeza C, Lantada N, Amorim S (2016) Statistical and spatial analysis of landslide susceptibility maps with different classification systems. Environ Earth Sci. https://doi.org/10.1007/s12665-016-6124-1
Barredo J, Benavides A, Hervás J, van Westen CJ (2000) Comparing heuristic landslide hazard assessment techniques using GIS in the Tirajana basin, Gran Canaria Island, Spain. Int J Appl Earth Obs Geoinf 2:9–23. https://doi.org/10.1016/s0303-2434(00)85022-9
Baum RL, Savage WZ, Godt JW (2002) Trigr-A Fortran program for transient rainfall infiltration and grid-based regional slope-stability analysis. Open File Rep. https://doi.org/10.3133/ofr02424
Bragagnolo L, Silva RVD, Grzybowski JMV (2020) Artificial neural network ensembles applied to the mapping of landslide susceptibility. CATENA 184:104240. https://doi.org/10.1016/j.catena.2019.104240
Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/a:1010933404324
Bui DT, Tuan TA, Klempe H et al (2016) Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 13:361–378. https://doi.org/10.1007/s10346-015-0557-6
Caleca F, Tofani V, Segoni S et al (2022) A methodological approach of QRA for slow-moving landslides at a regional scale. Landslides 19:1539–1561. https://doi.org/10.1007/s10346-022-01875-x
Camilo DC, Lombardo L, Mai PM et al (2017) Handling high predictor dimensionality in slope-unit-based landslide susceptibility models through LASSO-penalized Generalized Linear Model. Environ Model Softw 97:145–156. https://doi.org/10.1016/j.envsoft.2017.08.003
Chakrabortty R, Pal SC, Roy P et al (2022) Novel ensemble approach for landslide susceptibility index assessment in a mountainous environment of India. Geocarto Int 37:13311–13336. https://doi.org/10.1080/10106049.2022.2076924
Chen C, Fan L (2023) An attribution deep learning interpretation model for landslide susceptibility mapping in the three gorges reservoir area. IEEE Trans Geosci Remote Sens 61:1–15. https://doi.org/10.1109/tgrs.2023.3323668
Chen T, Niu R, Jia X (2016) A comparison of information value and logistic regression models in landslide susceptibility mapping by using GIS. Environ Earth Sci 75:867. https://doi.org/10.1007/s12665-016-5317-y
Chen W, Pourghasemi HR, Zhao Z (2017) A GIS-based comparative study of Dempster-Shafer, logistic regression and artificial neural network models for landslide susceptibility mapping. Geocarto Int 32:367–385. https://doi.org/10.1080/10106049.2016.1140824
Chen W, Peng J, Hong H et al (2018) Landslide susceptibility modelling using GIS-based machine learning techniques for Chongren County, Jiangxi Province, China. Sci Total Environ 626:1121–1135. https://doi.org/10.1016/j.scitotenv.2018.01.124
Cho K, van Merrienboer B, Gulcehre C, et al (2014) Learning phrase representations using RNN Encoder–Decoder for statistical machine translation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics
Chowdhuri I, Pal SC, Arabameri A et al (2020a) Ensemble approach to develop landslide susceptibility map in landslide dominated Sikkim Himalayan region, India. Environ Earth Sci. https://doi.org/10.1007/s12665-020-09227-5
Chowdhuri I, Roy P, Chakrabortty R et al (2020b) Development of hybrid computational approaches for landslide susceptibility mapping using remotely sensed data in east Sikkim, India. In: Das H, Rout JK, Moharana SC, Dey N (Eds) Applied intelligent decision making in machine learning. CRC Press, Boca Raton, pp 71–92. https://doi.org/10.1201/9781003049548
Chowdhuri I, Pal SC, Chakrabortty R et al (2021a) Torrential rainfall-induced landslide susceptibility assessment using machine learning and statistical methods of eastern Himalaya. Nat Hazards 107:697–722. https://doi.org/10.1007/s11069-021-04601-3
Chowdhuri I, Pal SC, Chakrabortty R et al (2021b) Spatial prediction of landslide susceptibility using projected storm rainfall and land use in Himalayan region. Bull Eng Geol Environ 80:5237–5258. https://doi.org/10.1007/s10064-021-02252-z
Corominas J, van Westen C, Frattini P et al (2013) Recommendations for the quantitative analysis of landslide risk. Bull Eng Geol Env. https://doi.org/10.1007/s10064-013-0538-8
Dahal A, Lombardo L (2023) Explainable artificial intelligence in geoscience: a glimpse into the future of landslide susceptibility modeling. Comput Geosci 176:105364. https://doi.org/10.1016/j.cageo.2023.105364
Dahal RK, Hasegawa S, Nonomura A et al (2007) GIS-based weights-of-evidence modelling of rainfall-induced landslides in small catchments for landslide susceptibility mapping. Environ Geol 54:311–324. https://doi.org/10.1007/s00254-007-0818-3
Ercanoglu M, Gokceoglu C (2002) Assessment of landslide susceptibility for a landslide-prone area (north of Yenice, NW Turkey) by fuzzy approach. Environ Geol 41:720–730. https://doi.org/10.1007/s00254-001-0454-2
Fang Z, Wang Y, Peng L, Hong H (2020a) A comparative study of heterogeneous ensemble-learning techniques for landslide susceptibility mapping. Int J Geogr Inf Sci 35:321–347. https://doi.org/10.1080/13658816.2020.1808897
Fang Z, Wang Y, Peng L, Hong H (2020b) Integration of convolutional neural network and conventional machine learning classifiers for landslide susceptibility mapping. Comput Geosci 139:104470. https://doi.org/10.1016/j.cageo.2020.104470
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29:78. https://doi.org/10.1214/aos/1013203451
Gao K, Cui P, Zhao C, Wei F (2006) Landslide hazard evaluation of Wanzhou based on GIS information value method in the Three Gorges Reservoir. Yanshilixue Yu Gongcheng Xuebao/chin J Rock Mech Eng 25:991–996
Geurts P, Ernst D, Wehenkel L (2006) Extremely randomized trees. Mach Learn 63:3–42. https://doi.org/10.1007/s10994-006-6226-1
Ghorbanzadeh O, Blaschke T, Gholamnia K et al (2019) Evaluation of different machine learning methods and deep-learning convolutional neural networks for landslide detection. Remote Sens 11:196. https://doi.org/10.3390/rs11020196
Guzzetti F, Carrara A, Cardinali M et al (1999) Landslide hazard evaluation; a review of current techniques and their application in a multi-scale study, central Italy. Geomorphology 31:181–216. https://doi.org/10.1016/S0169-555X(99)00078-1
Guzzetti F, Reichenbach P, Ardizzone F et al (2006) Estimating the quality of landslide susceptibility models. Geomorphology 81:166–184. https://doi.org/10.1016/j.geomorph.2006.04.007
Hakim WL, Fadhillah MF, Park S et al (2023) InSAR time-series analysis and susceptibility mapping for land subsidence in Semarang, Indonesia using convolutional neural network and support vector regression. Remote Sens Environ 287:113453. https://doi.org/10.1016/j.rse.2023.113453
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Hong H (2023) Assessing landslide susceptibility based on hybrid best-first decision tree with ensemble learning model. Ecol Indic 147:109968. https://doi.org/10.1016/j.ecolind.2023.109968
Hou S, Liu Y, Yang Q (2022) Real-time prediction of rock mass classification based on TBM operation big data and stacking technique of ensemble learning. J Rock Mech Geotech Eng 14:123–143. https://doi.org/10.1016/j.jrmge.2021.05.004
Huang Y, Zhao L (2018) Review on landslide susceptibility mapping using support vector machines. CATENA 165:520–529. https://doi.org/10.1016/j.catena.2018.03.003
Ilanloo M (2011) A comparative study of fuzzy logic approach for landslide susceptibility mapping using GIS: an experience of Karaj dam basin in Iran. Procedia Soc Behav Sci 19:668–676. https://doi.org/10.1016/j.sbspro.2011.05.184
Islam ARMdT, Saha A, Ghose B et al (2021) Landslide susceptibility modeling in a complex mountainous region of Sikkim Himalaya using new hybrid data mining approach. Geocarto Int 37:9021–9046. https://doi.org/10.1080/10106049.2021.2009920
Ji S, Yu D, Shen C et al (2020) Landslide detection from an open satellite imagery and digital elevation model dataset using attention boosted convolutional neural networks. Landslides 17:1337–1352. https://doi.org/10.1007/s10346-020-01353-2
Kadavi P, Lee C-W, Lee S (2018) Application of ensemble-based machine learning models to landslide susceptibility mapping. Remote Sens 10:1252. https://doi.org/10.3390/rs10081252
Kayastha (2012) Application of fuzzy logic approach for landslide susceptibility mapping in Garuwa sub-basin, East Nepal. Front Earth Sci 6:420–432. https://doi.org/10.1007/s11707-012-0337-8
Kayastha P, Dhital MR, Smedt FD (2013) Application of the analytical hierarchy process (AHP) for landslide susceptibility mapping: a case study from the Tinau watershed, west Nepal. Comput Geosci 52:398–408. https://doi.org/10.1016/j.cageo.2012.11.003
Ke G, Meng Q, Finley T, et al (2017) Lightgbm: A highly efficient gradient boosting decision tree. In: Advances in Neural Information Processing Systems. Long Beach, CA, USA, pp 3146–3154
Kumar R, Anbalagan R (2015) Landslide susceptibility zonation in part of Tehri reservoir region using frequency ratio, fuzzy logic and GIS. J Earth Syst Sci 124:431–448. https://doi.org/10.1007/s12040-015-0536-2
Lima P, Steger S, Glade T, Murillo-García FG (2022) Literature review and bibliometric analysis on data-driven assessment of landslide susceptibility. J Mt Sci 19:1670–1698. https://doi.org/10.1007/s11629-021-7254-9
Lundberg SM, Lee S-I (2017) A unified approach to interpreting model predictions. In: Advances in neural information processing systems. Long Beach, CA, USA
Merghadi A, Yunus AP, Dou J et al (2020) Machine learning methods for landslide susceptibility studies: a comparative overview of algorithm performance. Earth Sci Rev 207:103225. https://doi.org/10.1016/j.earscirev.2020.103225
Mohammady M, Pourghasemi HR, Pradhan B (2012) Landslide susceptibility mapping at Golestan Province, Iran: a comparison between frequency ratio, Dempster-Shafer, and weights-of-evidence models. J Asian Earth Sci 61:221–236. https://doi.org/10.1016/j.jseaes.2012.10.005
Montgomery DR, Dietrich WE (1994) A physically based model for the topographic control on shallow landsliding. Water Resour Res 30:1153–1171. https://doi.org/10.1029/93WR02979
Ozdemir A, Altural T (2013) A comparative study of frequency ratio, weights of evidence and logistic regression methods for landslide susceptibility mapping: Sultan Mountains, SW Turkey. J Asian Earth Sci 64:180–197. https://doi.org/10.1016/j.jseaes.2012.12.014
Pack RT (1998) The SINMAP approach to terrain stability mapping. Congress of the International Association of Engineering Geology, pp 21–25
Pal SC, Chowdhuri I (2019) GIS-based spatial prediction of landslide susceptibility using frequency ratio model of Lachung River basin, North Sikkim, India. SN Appl Sci. https://doi.org/10.1007/s42452-019-0422-7
Peng L, Niu R, Huang B et al (2014) Landslide susceptibility mapping based on rough set theory and support vector machines: a case of the Three Gorges area, China. Geomorphology 204:287–301. https://doi.org/10.1016/j.geomorph.2013.08.013
Pourghasemi HR, Pradhan B, Gokceoglu C (2012) Application of fuzzy logic and analytical hierarchy process (AHP) to landslide susceptibility mapping at Haraz watershed, Iran. Nat Hazards 63:965–996. https://doi.org/10.1007/s11069-012-0217-2
Pourghasemi HR, TeimooriYansari Z, Panagos P, Pradhan B (2018) Analysis and evaluation of landslide susceptibility: a review on articles published during 2005–2016 (periods of 2005–2012 and 2013–2016). Arab J Geosci 11:193. https://doi.org/10.1007/s12517-018-3531-5
Pradhan B, Dikshit A, Lee S, Kim H (2023) An explainable AI (XAI) model for landslide susceptibility modeling. Appl Soft Comput 142:110324. https://doi.org/10.1016/j.asoc.2023.110324
Reichenbach P, Rossi M, Malamud BD et al (2018) A review of statistically-based landslide susceptibility models. Earth Sci Rev 180:60–91. https://doi.org/10.1016/j.earscirev.2018.03.001
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323:533–536. https://doi.org/10.1038/323533a0
Saha A, Pal SC, Chowdhuri I et al (2022) Understanding the scale effects of topographical variables on landslide susceptibility mapping in Sikkim Himalaya using deep learning approaches. Geocarto Int 37:17826–17852. https://doi.org/10.1080/10106049.2022.2136255
Song Y, Niu R, Xu S et al (2018) Landslide susceptibility mapping based on weighted gradient boosting decision tree in Wanzhou Section of the Three Gorges Reservoir Area (China). Isprs Int J Geo Inf 8:4. https://doi.org/10.3390/ijgi8010004
Tanyas H, Rossi M, Alvioli M et al (2019) A global slope unit-based method for the near real-time prediction of earthquake-induced landslides. Geomorphology 327:126–146. https://doi.org/10.1016/j.geomorph.2018.10.022
Tien Bui D, Pradhan B, Lofman O, Revhaug I (2012) Landslide susceptibility assessment in Vietnam using support vector machines, decision tree, and Naïve Bayes models. Math Probl Eng 2012:1–26. https://doi.org/10.1155/2012/974638
Wang J, Yin K, Xiao L (2014) Landslide susceptibility assessment based on GIS and weighted information value: a case study of Wanzhou district, Three Gorges Reservoir. Chin J Rock Mech Eng 33:797–808
Wang Y, Fang Z, Wang M et al (2020) Comparative study of landslide susceptibility mapping with different recurrent neural networks. Comput Geosci 138:104445. https://doi.org/10.1016/j.cageo.2020.104445
Yoshimatsu H, Abe S (2006) A review of landslide hazards in Japan and assessment of their susceptibility using an analytical hierarchic process (AHP) method. Landslides 3:149–158. https://doi.org/10.1007/s10346-005-0031-y
Yu X, Wang Y, Niu R, Hu Y (2016) A combination of geographically weighted regression, particle swarm optimization and support vector machine for landslide susceptibility mapping: a case study at Wanzhou in the Three Gorges Area, China. Int J Environ Res Public Health 13:487. https://doi.org/10.3390/ijerph13050487
Zhang H, Song Y, Xu S et al (2022) Combining a class-weighted algorithm and machine learning models in landslide susceptibility mapping: a case study of Wanzhou section of the Three Gorges Reservoir, China. Comput Geosci 158:104966. https://doi.org/10.1016/j.cageo.2021.104966
Zhao Z, Liu Z, Xu C (2021) Slope unit-based landslide susceptibility mapping using certainty factor, support vector machine, random forest, CF-SVM and CF-RF models. Front Earth Sci 9:78. https://doi.org/10.3389/feart.2021.589630
Zhu AX, Wang R, Qiao J et al (2014) An expert knowledge-based approach to landslide susceptibility mapping using GIS and fuzzy logic. Geomorphology 214:128–138. https://doi.org/10.1016/j.geomorph.2014.02.003
Acknowledgements
We would like to express our sincere gratitude to the funding agencies that supported this research. Specifically, we acknowledge the financial support from Project Digital frequency spectrum analysis and mineralization precise prediction for continental supergene U-Re (No.41872243), Open Fund from Engineering Research Center for Seismic Disaster Prevention and Engineering Geological Disaster Detection of Jiangxi Province (No.SDGD202203), and Open Fund from Key Laboratory for Digital Land and Resources of Jiangxi Province, East China University of Technology (No.DLLJ202204).
Funding
This work was jointly funded by Project Digital frequency spectrum analysis and mineralization precise prediction for continental supergene U-Re (no. 41872243), Open Fund from Engineering Research Center for Seismic Disaster Prevention and Engineering Geological Disaster Detection of Jiangxi Province (no. SDGD202203), and Open Fund from Key Laboratory for Digital Land and Resources of Jiangxi Province, East China University of Technology (no. DLLJ202204); Hubei Geological Bureau Science and Technology Project (no. KJ2023-18); Key Research and Development Program of Hubei Province (no. 2021BCA219).
Author information
Authors and Affiliations
Contributions
Conceptualization, Yingxu Song, Yueshun He and Weicheng Wu; methodology, Yingxu Song, Yuan Li; software, Yujia Zou, Ye Liang; validation, Shiluo Xu; formal analysis, Yueshun He; investigation, Xianyu Yu; resources, Shiluo Xu; writing—original draft preparation, Yingxu Song and Yuan Li; writing—review and editing, Yuan Li; visualization, Yuan Li; supervision, Weicheng Wu; project administration, Yingxu Song. All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Song, Y., Li, Y., Zou, Y. et al. Synergizing multiple machine learning techniques and remote sensing for advanced landslide susceptibility assessment: a case study in the Three Gorges Reservoir Area. Environ Earth Sci 83, 227 (2024). https://doi.org/10.1007/s12665-024-11521-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12665-024-11521-5