Assessment of large-scale multiple forest disturbance susceptibilities with AutoML framework: an Izmir Regional Forest Directorate case

Eker, Remzi; Alkiş, Kamber Can; Aydın, Abdurrahim

doi:10.1007/s11676-024-01723-9

Assessment of large-scale multiple forest disturbance susceptibilities with AutoML framework: an Izmir Regional Forest Directorate case

Original Paper
Open access
Published: 02 April 2024

Volume 35, article number 65, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Forestry Research Aims and scope Submit manuscript

Assessment of large-scale multiple forest disturbance susceptibilities with AutoML framework: an Izmir Regional Forest Directorate case

Download PDF

260 Accesses
3 Altmetric
Explore all metrics

Abstract

Disturbances such as forest fires, intense winds, and insect damage exert strong impacts on forest ecosystems by shaping their structure and growth dynamics, with contributions from climate change. Consequently, there is a need for reliable and operational methods to monitor and map these disturbances for the development of suitable management strategies. While susceptibility assessment using machine learning methods has increased, most studies have focused on a single disturbance. Moreover, there has been limited exploration of the use of “Automated Machine Learning (AutoML)” in the literature. In this study, susceptibility assessment for multiple forest disturbances (fires, insect damage, and wind damage) was conducted using the PyCaret AutoML framework in the Izmir Regional Forest Directorate (RFD) in Turkey. The AutoML framework compared 14 machine learning algorithms and ranked the best models based on AUC (area under the curve) values. The extra tree classifier (ET) algorithm was selected for modeling the susceptibility of each disturbance due to its good performance (AUC values > 0.98). The study evaluated susceptibilities for both individual and multiple disturbances, creating a total of four susceptibility maps using fifteen driving factors in the assessment. According to the results, 82.5% of forested areas in the Izmir RFD are susceptible to multiple disturbances at high and very high levels. Additionally, a potential forest disturbances map was created, revealing that 15.6% of forested areas in the Izmir RFD may experience no damage from the disturbances considered, while 54.2% could face damage from all three disturbances. The SHAP (Shapley Additive exPlanations) methodology was applied to evaluate the importance of features on prediction and the nonlinear relationship between explanatory features and susceptibility to disturbance.

Urban flood susceptibility mapping using frequency ratio and multiple decision tree-based machine learning models

Article 23 April 2024

Strengthening Forest Landscape Restoration through Understanding Land Use Dynamics: Case Study of Tchamba Prefecture (Togo)

Article Open access 20 April 2024

In temperate Europe, fire is already here: The case of The Netherlands

Article Open access 05 February 2024

Introduction

Forests have been utilized by humans for centuries, leading to significant transformations in species compositions, structure, and the disturbance regimes of these ecosystems (Bebi et al 2017; Senf and Seidl 2021). Forest disturbance is defined as a short-term event causing crown cover or biomass loss (Hirschmugl et al 2017). The Food and Agriculture Organization of the United Nations (FAO) characterizes it as "damage caused by any factor (biotic or abiotic) that adversely affects the vigour and productivity of the forest and is not a direct result of human activities" (FRA 2020). Biotic disturbances, such as drought, forest fires, insect epidemics, windthrow, and hurricanes, as well as abiotic disturbances like illegal logging, harvesting operations, and artificial afforestation, can exert profound impacts on forests. These disturbances shape the structure and dynamics of forests, interacting with climate and other factors, including human land use (He and Mladenoff 1999; Turner 2010; Senf and Seidl 2021). Nepstad et al (2008) pointed out that climate change and the growing demand for wood products are increasing the rates of forest disturbance. Additionally, Lindenmayer et al (2004) emphasized that forest management practices can alter biological legacies and landscape structure, with potential feedback on subsequent disturbances. Therefore, gaining detailed information on forest disturbances and understanding changes in their regimes becomes crucial for the development of appropriate land management strategies. Consequently, the mapping of forest disturbances becomes even more critical, necessitating the use of reliable, operational methods for sustainable forest management (European Commission 2013; Chen et al 2021).

Numerous approaches have been developed to monitor and map forest disturbances, and one effective method is the analysis of susceptibility, which quantifies the probability of a specific phenomenon (e.g., landslides, forest fires) occurring under defined conditions (Li et al 2017). Susceptibility assessments typically utilize two quantitative methods: "knowledge base methods" and "data-based methods" (Corominas et al 2014). Alternatively, these methods can be categorized into heuristic, classical statistics, and machine learning methods (Reichenbach et al 2018; Bruzón et al 2021). The fundamental premise of susceptibility assessment is grounded in the idea that an event is more likely to occur under conditions similar to those that led to past events (Tien Bui et al 2012). While forest fire susceptibility has been extensively studied in the literature (Dolui 2023; Kantarcioglu et al 2023; Rihan et al 2023), other disturbances like severe windstorms (Garamszegl et al. 2022) and insect outbreaks (Sivrikaya et al 2023) have received limited attention. Beyond specific disturbances, susceptibility assessments have been applied to various subjects related to forests. For instance, Batar et al (2021) proposed a novel approach—weight of evidence—for mapping and assessing forest fragmentation susceptibility in the Indian Himalayan Region. Sahana et al (2018) assessed susceptibility to deforestation using a fragmentation approach and the frequency ratio model. Shabani et al. (2020) mapped forest stand susceptibility during harvesting using logistic regression and boosted regression tree machine learning models.

In susceptibility mapping studies related to forestry, the focus has traditionally been on single phenomena, such as forest fires, windthrow, or insect outbreaks. However, in recent studies, there has been a shift towards assessing susceptibility to multiple phenomena, primarily centred around various hazards. For instance, Guerriero et al (2022) conducted a comprehensive multi-hazard susceptibility assessment using the analytic hierarchy process in the Derwent Valley Mills UNESCO World Heritage Site (United Kingdom). Their study encompassed surface flooding, groundwater flooding, compressible ground, landslides, and running stages as hazardous events. Similarly, Ullah et al (2022) employed convolutional neural networks to map multi-hazard susceptibility, considering flash floods, debris flow, and landslides. Rusk et al (2022) utilized the maximum entropy (MaxEnt) machine learning method to map multi-hazard susceptibility in the Hindu Kush Himalaya. Pourghasemi et al (2020) adopted the random forest machine learning method to conduct a multi-hazard probability assessment in Fars Province (Shiraz City, Iran), encompassing floods, landslides, and forest fires. In Ankara City (Turkey), Yanar et al (2020) applied the Mamdani fuzzy algorithm to assess multi-hazard susceptibility, focusing on floods and landslides. Regarding forest disturbances, the literature has limited studies in this domain. A recent noteworthy study by Stritih et al (2021) assessed the susceptibility of forests to various natural disturbances, including avalanches, bark beetles, fires, heavy snow, windthrow, and harvests in Graubünden, Switzerland. This study utilized the random forest machine learning method.

It appears that the utilization of machine learning methods in susceptibility assessment is on a trajectory of continual growth across various research domains. However, the success of developing models is critically dependent on the expertise of machine learning specialists. In traditional machine learning, models are crafted by handling each step in the process separately. In simpler terms, the efficacy of machine learning models relies on the collaboration between the right tools and the right expertise (Özdemir and Örslü 2019). Developing machine learning models presents technical complexities that extend beyond the grasp of non-experts in machine learning (Waring et al 2020). This technical complexity is attributed to various steps in the process, including: (1) pre-processing and cleaning data, (2) selecting and constructing appropriate features, (3) choosing an appropriate model family, (4) optimizing model hyperparameters, (5) designing the topology of neural networks if deep learning is employed, (6) post-processing machine learning models, and (7) analyzing obtained results (Hutter et al 2019; Waring et al 2020; URL 2023). The tasks of finding suitable models and optimizing hyperparameters prove challenging and time-consuming for human experts (Halvari et al. 2020). The heightened demand for machine learning methods that are readily applicable without the need for expert knowledge has led to the concept of automated machine learning (AutoML). AutoML is defined as meta-level machine learning algorithms functioning as a platform or open-source library for automating each step in the machine learning process (Hutter et al 2019; Halvari et al 2020).

The utilization of the AutoML framework in susceptibility assessment is relatively scarce in the existing literature, as noted by Bruzón et al (2021) and Renza et al (2021). A recent study in the field of forestry, conducted by Ren et al (2022), delved into the spatiotemporal assessment of forest fire vulnerability. Building on this, this study aims to assess forest disturbance susceptibility employing the AutoML framework, specifically utilizing Python PyCaret. In this study, three distinct forest disturbances— forest fires, insect damage, and wind damage—were considered. The susceptibility assessment was conducted individually for each disturbance, as well as collectively for multiple disturbances. Additionally, a comprehensive map illustrating the potential occurrences of forest disturbances was generated.

Material and methods

Study area

The study area selected for this research is the Izmir Regional Forest Directorate (RFD), one of 28 forest regional directorates in Turkey. Situated in the northern longitude range of 39°35ʹ38"–37°52ʹ30" and eastern latitudes of 26°11ʹ42"–28°52ʹ28"(Fig. 1), Izmir RFD encompasses almost one-third of the Aegean region of Turkey, including the cities of Izmir and Manisa. The region's forests cover an area of 2,509,336 ha, approximately 46% of the total area. The primary forest species in this region are Turkish pine (Pinus brutia Ten.) and black pine (Pinus nigra Arn.) (Kucukosmanoglu and Uzmez 2019). The distribution of forest types across the region is shown in Table 1. Elevation within the Izmir RFD varies from sea level to 2148 m, featuring slopes ranging from zero to 90%. The region is crisscrossed by a network of roads spanning 40,948 km, with a density up to 10 m/km². The region has a typical Mediterranean climate characterized by warm and rainy winters and hot, dry summers. While the coastal areas more directly experience the Mediterranean climate, the interior part of the region, influenced by local topography, also reflects these climatic patterns. In Izmir RFD, winter precipitation is predominantly rain, and the annual precipitation fluctuates between 700 and 1000 mm. The hottest months are July and August.

Table 1 Distribution of forest types in Izmir RFD

Full size table

Situation of forest disturbances in the Izmir RFD

The Izmir RFD is one of the more vulnerable forest regions to various disturbances in Turkey, with a particular emphasis on forest fires (Akyuz and Kucukosmanoglu 1997; Kucukosmanoglu and Uzmez 2019). According to forest fire statistics from the General Directorate of Forestry (GDF) for 2022, 23,938 ha were burned between 2004 and 2022 (GDF 2023). The annual breakdown of burned areas in Izmir is shown in Fig. 2. In contrast to the availability of statistics on forest fires, there is a lack of published data concerning wind and insect damage. The examination of forest disturbances in this study extended to an investigation of official reports accessible in the Izmir RFD archives covering the years 2010–2021. These records provided valuable details, including the extent of salvage logging for each disturbance (fire, insect, wind), as well as information on forest stand number and type, and the name of the forest district. Based on these records, it is evident that four main insect species—Tomicus destruens Woll. (Coleoptera: Scolytinae), Orthotomicus erosus Col. (Curculionidae: Scolytinae), Thaumetopoea pityocampa (Den. & Schiff.) (Lepidoptera: Thaumetopoeidae), and Thaumetopoea wilkinsoni Tams. (Lepidoptera: Thaumatopoeidae)—have predominantly contributed to damage in the region's forests. Sarıkaya et al. (2013) emphasized that bark beetles (Scolytinae) are particularly impactful defoliators in Izmir province, with the Mediterranean pine engraver, Orthotomicus erosus Woll. ranking high among Scolytinae species, causing significant annual economic losses.

Figure 3 illustrates the volumes of salvage logging attributed to forest fires, insect damage, and wind damage for each year. Examining the records of fire damage from 2010 to 2021, a 12-year period, reveals a discernible increasing trend despite occasional fluctuations in salvage volumes. The peak of salvage volume due to fire damage was in 2019, reaching 506,698 m³. Similarly, both insect and wind damages exhibited an upward trajectory during this 12-year period, accompanied by intermittent fluctuations in salvage volumes. The highest salvage volume attributed to insect damage occurred in 2018, totalling 26,625 m³. For wind damage, the peak volume was 23,686 m³ in 2015.

Dataset of forest disturbances

This study focused on three forest disturbances—forest fires, insect damage, and wind damage—that transpired between 2010 and 2021. Official reports from the Izmir RFD documenting each disturbance were incorporated into the GIS environment by vectorizing them as point layers. These points are referred to as "presence data (1)" (Fig. 4). Due to the absence of precise location information in the reports, presence points were digitized randomly within the damage area associated with each disturbance. The reports only provided details on forest stand numbers and the type of damage areas within the relevant forest districts. The vectorization process was executed on a base map of forest stand types obtained from the General Directorate of Forestry (GDF) using ESRI ArcGIS software. Consequently, the total number of points varied based on the affected area's size: 4347 points for insect damage, 1217 points for wind damage, and 54,593 points for forest fire damage in Izmir RFD.

Forest disturbances susceptibility assessment with an AutoML framework

The susceptibility assessment was conducted using PyCaret 3.0 (https://pycaret.org/), an AutoML framework. PyCaret is an open-source, low-code machine-learning library in Python designed to automate machine-learning workflows (https://pycaret.gitbook.io/docs/). It was selected for susceptibility assessment due to its capability to facilitate complex machine learning tasks, allowing data scientists to efficiently conduct end-to-end experiments. The workflow is illustrated in Fig. 5. The application of Python AutoML (PyCaret 3.0) was executed within the Jupyter Notebook environment, a web application designed for creating and sharing computational documents (https://jupyter.org/). This environment provided a versatile platform for implementing and documenting the susceptibility assessment workflow.

In the data preparation step, alongside the presence point data for each forest disturbance, a set of driving factors (or features)—categorized as topographic, structural, climatic, and stand features—were identified based on the literature. A total of 15 driving factors were considered: elevation, slope, aspect, topographic wetness index (TWI), distances to roads, streams, agricultural areas, and settlements, road density, stand types, degree of canopy closure, stand development stages, total annual precipitation, mean temperature of the warmest quarter, and maximum wind speed. All driving factors were created in raster data format with a 50 m × 50 m pixel size and uniform dimensions (5244 pixels in rows and 3589 pixels in columns). Supplementary Material (Figs. S1 − S15) provides an overview of these driving factors. Topographic features such as elevation, slope, aspect, and TWI were derived from a digital elevation model (DEM) from the GDF. Forest features, including stand types, canopy closure, and stand development stages, were extracted from a forest map from the Izmir RDF. Additionally, road density, distances to roads, streams, agricultural areas, and settlements were created using maps obtained from Izmir RDF. Climatic features were generated from a dataset acquired from worldclim.org (Fick and Hijmans 2017). Of the 15 driving factors, 11 had continuous data, while four (aspect, stand types, canopy closure, stand development stages) were categorical. The values of each driving factor at presence points were extracted and formatted into a CSV file for use in the AutoML process. In the data analysis step, before running the AutoML process, Pearson correlation analysis was performed to assess multicollinearity among driving factors with non-categorical data types. This step was crucial as the strength and direction of the relationship between two factors could impact model performance. The Pearson correlation matrix for each forest disturbance is provided in Supplementary Material (Figs. S16 − S18). The PyCaret AutoML process incorporates multicollinearity testing, where highly correlated factors are assessed to eliminate one from each pair. When the parameter is set to "True," features with inter-correlations surpassing the defined threshold are removed. Specifically, for each group, only the feature with the highest correlation to the y-axis is retained (more details can be found at https://pycaret.gitbook.io/docs/). In this study, a threshold value of 0.90 was set for this parameter concerning each forest disturbance. PyCaret AutoML conducted tests on 14 out of the 17 available machine learning algorithms (Table 2) and subsequently ranked their classification performance based on AUC (area under the curve) values.

Table 2 Machine learning algorithms compared in PyCaret AutoML process

Full size table

The best model based on the highest AUC, was employed to predict forest disturbance susceptibility for all pixels within forested areas. In addition to AUC, key evaluation metrics such as accuracy, recall, precision, F1 score, Kappa, and Matthews correlation coefficient (MCC) were considered for a comprehensive assessment (Wang et al 2021). To calculate these metrics, it is essential to clarify the four kinds of predicted samples for classification learning (Wang et al 2021): (1)True Positive (TP): The predicted class is positive, and the prediction aligns with the actual class; (2) False Positive (FP): The predicted class is positive, but the prediction disagrees with the actual class; (3)True Negative (TN): The predicted class is negative, and the prediction aligns with the actual class; (4) False Negative (FN): The predicted class is negative, but the prediction disagrees with the actual class.

Equations Eqs. 1 − 6 were used to calculate these parameters:

$$Accuracy=\frac{TP+TN}{TP+FP+TN+FN}$$

(1)

$$Recall=\frac{TN}{TN+FN}$$

(2)

$$Precision=\frac{TP}{TP+FP}$$

(3)

$${F}_{1}Score=\frac{2\times Precision\times Recall}{Precision+Recall}$$

(4)

$$Kappa=\frac{2\times (TP TN-FP FN)}{(TP+FP)\times (TP+FN)\times (TN+FP)\times (TN+FN)}$$

(5)

$$MCC=\frac{TP\times TN-FP\times FN}{\sqrt{(TP+FP)\times (TP+FN)\times (TN+FP)\times (TN+FN)}}$$

(6)

These equations provide a quantitative assessment of the model's performance in predicting forest disturbance susceptibility.

In the mapping and assessment step, the predicted results from AutoML were transformed into raster data format to create maps for each disturbance. To generate a multiple forest disturbance susceptibility map, a weighted overlay method was implemented in GIS. In this method, each susceptibility map served as input, with weights assigned as 0.5 for forest fires, 0.2 for wind damage, and 0.3 for insect damage. The resulting susceptibility maps were then classified into five categories: very low (0 − 0.2), low (0.2 − 0.4), moderate (0.4 − 0.6), high (0.6 − 0.8), and very high (0.8 − 1.0). In addition, a potential forest disturbance map was created by employing an overlapping method using the susceptibility map of each disturbance. Prior to the overlapping process in GIS, a binary classification was applied to each susceptibility map based on defined rules: if the susceptibility value was higher than 0.75, the pixel value was set to 1; otherwise, the pixel value was set to 0. The resulting map categorized areas into eight classes: (1) no damage, (2) forest fire damage, (3) wind damage, (4) wind + forest fire damage, (5) insect damage, (6) insect + forest fire damage, (7) wind + insect damage, and (8) wind + insect + forest fire damage. While a susceptibility map represents the vulnerability of an area to specific disturbances, a potential forest disturbances map integrates information from multiple susceptibility maps through an overlapping process, offering a comprehensive visualization of areas prone to various disturbances categorized into distinct classes. Additionally, the SHAP (Shapley Additive exPlanations) methodology (Lundberg and Lee 2017) was used as an innovative tool to illustrate the nonlinear relationship between explanatory features and forest disturbance susceptibility. SHAP values, computed through a game theoretical approach, elucidate the contribution of each feature within a model (Li et al 2020; Bugaj et al 2021). For the SHAP analysis, the "shap" Python library was utilized, and SHAP summary plots were generated to provide feature attribution information for all instances in every model.

Results

Forest disturbance susceptibility assessment

An AutoML framework, specifically PyCaret 3.0, was employed to assess the susceptibility to forest disturbances in Izmir RFD. The assessment considered three primary forest disturbances: forest fires, insect damage, and wind damage. Additionally, a comprehensive analysis of susceptibility to multiple forest disturbances was conducted. After comparing various algorithms using AutoML, the extra trees classifier (ET) algorithm emerged as the best model for susceptibility analysis in each disturbance category. Detailed model performances for each dataset are provided in Supplementary Data (Figs. S19 − S21). The forest disturbance susceptibility maps for each individual disturbance (Figs. 6, 7 and 8) and the map depicting susceptibility to multiple disturbances (Fig. 9) were generated based on the selected ET algorithm. Each susceptibility map was categorized into five classes, and the areal distributions of susceptibility classes in the forested areas are shown in Table 3. Notably, the results reveal that 82.5% of the forests in Izmir RFD are susceptible to multiple disturbances at high and very high levels. Additionally, 88.9% are susceptible to forest fires, primarily falling into high and very high classes. Wind damage susceptibility is observed in 66.8% of forested areas at high and very high levels, while 76.9% show susceptibility to insect damage.

Table 3 Area (hectares and percentage) of susceptibility classes for disturbances in forested areas

Full size table

In the current study, SHAP methodology was used to explain model predictions, providing insights into the contribution of each feature to susceptibility modeling. SHAP summary plots, shown in Fig. 10 for insect damage, Fig. 11 for wind damage, and Fig. 12 for forest fire damage, effectively combine feature importance with feature effect. In these plots, the x- and y-axes represent the Shapley value and the value of the feature, respectively, with colours indicating the feature's value from low to high. Features are ordered based on their importance, revealing road density as the most critical feature for susceptibility in each disturbance. According to the SHAP analysis, key features influencing model predictions for insect damage include road density, distance to roads, forest type, annual precipitation, elevation, maximum wind speed, and mean temperature of the warmest quarter. For wind damage, important features include forest type, annual precipitation, distance to roads, elevation, mean temperature of the warmest quarter, and maximum wind speed. Similarly, for fire damage, significant features are distance to roads, annual precipitation, maximum wind speed, elevation, mean temperature of the warmest quarter, and forest type.

Using the susceptibility maps for each disturbance, a potential forest disturbances map was generated through an overlapping method (Fig. 13). The extent of potential damage in hectares and as a percentage for each category is detailed in Table 4. Notably, 15.6% of the forested areas in Izmir RFD are projected to have no likely damage from the considered disturbances. At the same time, 15.1% could potentially experience damage from forest fires, and a significant portion, 54.2%, could be susceptible to all three disturbances. Specifically, a minimum 0.1% of forests may face possible damage exclusively from wind, while only 0.5% might experience damage solely from insects. The evaluation extends to different forest types, as depicted in Fig. 14, shows the percentage area of each type potentially affected by the disturbances, with colour gradations from green to red indicating the percentage within each row.

Table 4 Area (hectare and percentage) of potential forest disturbances

Full size table

Discussion

Susceptibility assessment employing machine learning methods and GIS mapping in forestry studies, particularly for disturbances like forest fires, has become more prevalent (Dolui 2023; Kantarcioglu et al 2023; Rihan et al 2023). However, most studies have focused on a single forest disturbance, with the exception being Stritih et al. (2021) who considered multiple disturbances. They used the random forest (RF) algorithm, whereas our study used the AutoML framework comparing 14 machine learning algorithms and ranking the best based on AUC values. The ET algorithm was selected for susceptibility assessment, with AUC values consistently exceeding 0.98 for each disturbance.

The ET algorithm, similar to the RF algorithm, is an ensemble machine learning method. The key distinction is that the ET algorithm uses the entire dataset, reducing model bias, and is computationally more efficient. The primary reason for using AutoML is its ability to automate the entire machine-learning process, including data pre-processing, model selection, and hyperparameter tuning (Hadi et al 2023). Although alternative AutoML frameworks exist, such as Auto-sklearn, AutoKeras, PyCaret, and TPOT (Kumar et al 2022), PyCaret has been predominantly used in the literature (Bruzón et al (2021).

Additionally, susceptibility assessment requires driving factors (or predictors). In this study, 15 driving factors were selected based on reported disturbance drivers in the literature. For example, topographic and stand features (excluding canopy height) were used, like the approach by Stritih et al (2021). Climate factors such as annual precipitation, mean temperature of the warmest quarter, and maximum wind speed also were incorporated. Importantly, all factors were applied to all disturbances for a comparative analysis of their effects across different disturbances.

The creation of a multiple forest disturbances susceptibility map involved a weighted overlay analysis of each forest disturbance susceptibility map. Different approaches in the literature, including those by Pourghasemi et al. (2020), Yanar et al. (2020), Rusk et al. (2022) and Ullah et al. (2022), can be considered for such analyses.

Moreover, a potential disturbances exposure map was generated for the first time in this study, incorporating wind, insect, and forest fire damage. This map illustrates forested areas that could potentially be damaged by the disturbances considered. In the literature, such maps were predominantly created for multiple hazards. Pourghasemi et al (2020) generated a multi-hazard map that included forest fires as a disturbance but did not include wind and insect damage. The potential forest disturbances map was overlapped with the stand type map to estimate the extent to which areas of each could potentially be damaged by the disturbances. According to the results of the overlay analysis, Turkish pine species and black pine species, either purely or in mixed form, face the greatest damage from all three disturbances. For instance, only 4.3% of the forest type “Pure Turkish pine species,” one of the main species in the Izmir RFD, could potentially escape damage from the disturbances, while more than 70% may face potential damage from all the disturbances. Forest fire damage could be observed in approximately 95% of the area covered by Pure Turkish pine species. Küçük and Bilgili (2008) noted that Turkish pine and black pine are highly sensitive to fires. Regarding insect damage, it has been reported that Turkish pine species in particular, are susceptible to insects such as Orthotomicus erosus (Sarıkaya et al 2013).

Forest types such as “Mixed broadleaved species,” “Broadleaved species dominated coniferous species,” and “Other coniferous dominated broadleaved species” have the largest percentage area classified as "no damage." As Jactel et al (2017) highlighted, mixed forests are more resistant to natural disturbances that are relatively small-scale and selective in their effect. Interestingly, this study revealed that the forest type “Pure broadleaved species,” as a pure forest stand type, also has the largest percentage area of potentially no damage by disturbance. Additionally, open forested areas (i.e., gaps in stands) exhibited the highest percentage of no damage, as expected. Furthermore, wind and insect disturbances had in a lower percentage than forest fires. This is likely due to these areas often covered with grasses and small shrubs and lacking trees or containing a few scattered individuals.

The SHAP method was used to assess feature importance on model predictions and explain non-linear relationships between predictor variables and forest disturbances susceptibility. SHAP serves as a valuable tool for enhancing the interpretability of machine learning models, aiding in comprehending the relationships between input and output, and identifying the most influential features in driving predictions (Li 2022). However, as Li (2022) emphasized, the use of SHAP to provide reliable estimates requires accurate and robust machine learning models, and SHAP values should be interpreted with special attention.

Iban and Sekertekin (2022) compared SHAP outputs for different machine learning models, and their findings showed that the top three contributing factors for modeling fire susceptibility were elevation, temperature, and slope. In the present study, road density, distance to roads, and annual precipitation were identified as the most crucial factors. The rankings also highlighted the significance of maximum wind speed, elevation, and mean temperature of the warmest quarter. As expected, lower road densities and greater distances to roads decrease the susceptibility to forest fires. In addition, high-speed winds increase the susceptibility to fires. Despite the findings of Iban and Sekertekin (2022), who found lower temperatures increased the risk of wildfires, our study revealed a positive impact of higher temperatures on susceptibility to forest fires. Similarly, while distance to settlements was moderately important, in contrast, distance to roads emerged as the top-ranked feature. This discrepancy could be attributed to the exclusive consideration of highways and state roads in our analysis. In the susceptibility analysis for insect damage, road density and distance to roads emerged as the most important features. The third-ranked feature was forest type 1, i.e., pure Turkish pine species. As noted previously, Turkish pine is prone to insect attacks (Sarıkaya et al 2013). Another noteworthy feature is forest type 5 (forest gaps), identified as no insect damage was observed in these areas. Consequently, potential insect damage based on predicted susceptibility maps was observed in very limited areas of forest gaps. Regarding wind damage, maximum wind speeds showed the expected impact, with higher values increasing susceptibility. However, wind speed was ranked less important than other factors such as road density, annual precipitation, and distance to roads.

Conclusions

This study emphasizes the importance of integrating multiple forest disturbances into susceptibility assessments through machine learning, utilizing the AutoML framework. This approach facilitates the comparison of different algorithms, aiding in the selection of the best-performing one. The study reports on multiple forest disturbances susceptibility assessments, considering wind, insect, and forest fire damages, resulting in the creation of four forest disturbances susceptibility maps. This comprehensive analysis provides valuable insights and may be extended to other forested areas by incorporating additional disturbances and selecting an appropriate number of driving factors. The methodology proposed, encompassing topographic, climate, and stand features, allows for large-scale susceptibility assessments of forest disturbances. It provides a foundation for assessing different scenarios based on anticipated climate change. The potential forest disturbances exposure map, overlaid with forest types, offers an understanding of the interactions between different forest types and possible disturbances. Susceptibility assessment contributes to a better awareness of the factors driving forest disturbances, with novel methods like SHAP enhancing the relationship between input and output of models. This method, emphasizing the importance of specific factors, may be preferred in susceptibility analysis of forest disturbances using machine learning algorithms. PyCaret AutoML further streamlines the identification of the most important driving factors. All generated maps, including susceptibility maps, potential forest disturbance exposure maps, and maps highlighting important driving factors, serve as valuable base maps for forest managers in decision-making processes. These maps consolidate information about various disturbances within a specific area, providing essential insights to develop strategies for combating disturbances and contributing to sustainable forest management practices.

References

Akyuz YF, Kucukosmanoglu A (1997) Application of GIS for fire-damaged areas in Izmir. Proceedings of the XI World Forestry Congress, Turkey, 13–22 October, Vol. 1, pp.239.
Batar AK, Shibata H, Watanabe T (2021) A novel approach for forest fragmentation susceptibility mapping and assessment: a case study from the Indian Himalayan region. Remote Sens 13(20):4090. https://doi.org/10.3390/rs13204090
Article Google Scholar
Bebi PSEP, Seidl R, Motta R, Fuhr M, Firm D, Krumm F, Conedera M, Ginzler C, Wohlgemuth T, Kulakowski D (2017) Changes of forest cover and disturbance regimes in the mountain forests of the Alps. For Ecol Manag 388:43–56. https://doi.org/10.1016/j.foreco.2016.10.028
Article CAS Google Scholar
Bruzón AG, Arrogante-Funes P, Arrogante-Funes F, Martín-González F, Novillo CJ, Fernández RR, Vázquez-Jiménez R, Alarcón-Paredes A, Alonso-Silverio GA, Cantu-Ramirez CA, Ramos-Bernal RN (2021) Landslide susceptibility assessment using an AutoML framework. Int J Environ Res Public Health 18(20):10971. https://doi.org/10.3390/ijerph182010971
Article PubMed PubMed Central Google Scholar
Bugaj M, Wrobel K, Iwaniec J (2021) Model explainability using SHAP values for LightGBM predictions. In: 2021 IEEE XVIIth international conference on the perspective technologies and methods in MEMS design (MEMSTECH) (pp. 102–106). IEEE. https://doi.org/10.1109/MEMSTECH53091.2021.9468078
Chen X, Zhao W, Chen J, Qu Y, Wu D, Chen X (2021) Mapping large-scale forest disturbance types with multi-temporal CNN framework. Remote Sens 13(24):5177. https://doi.org/10.3390/rs13245177
Article Google Scholar
European Commission (2013) Communication from the commission to the European parliament, the council, the European economic and social committee and the committee of the regions: a new EU forest strategy: for forests and the forest-based sectors. https://eur-lex.europa.eu/resource.html?uri=cellar:21b27c38-21fb-11e3-8d1c-01aa75ed71a1.0022.01/DOC_1&format=PDF. Accessed 5 July 2023
Corominas J, van Westen C, Frattini P, Cascini L, Malet JP, Fotopoulou S, Catani F, Van Den Eeckhaut M, Mavrouli O, Agliardi F, Pitilakis K, Winter MG, Pastor M, Ferlisi S, Tofani V, Hervás J, Smith JT (2014) Recommendations for the quantitative analysis of landslide risk. Bull Eng Geol Environ 73:209–263. https://doi.org/10.1007/s10064-013-0538-8
Article Google Scholar
Dolui S (2023) Forest fire susceptibility mapping and risk assessment using integrated AHP and DEMATEL method for Purulia District, West Bengal (India). In: Water, land, and forest susceptibility and sustainability (pp 401–452). Elsevier
Fick SE, Hijmans RJ (2017) Worldclim 2: New 1-km spatial resolution climate surfaces for global land areas. Int J Climatol 37(12):4302–4315. https://doi.org/10.1002/joc.5086
Article Google Scholar
FRA (2020) Global forest resources assessment 2020: terms and definitions. Food and agriculture organization of the United Nations., pp 168, Rome, Italy
Garamszegi B, Jung C, Schindler D (2022) Multispectral spaceborne proxies of predisposing forest structure attributes to storm disturbance—a case study from Germany. Forests 13(12):2114. https://doi.org/10.3390/f13122114
Article Google Scholar
GDF (2023) Forestry statistics 2022. https://www.ogm.gov.tr/tr/e-kutuphane/resmi-istatistikler. Accessed 27 June 2023
Guerriero L, Di Napoli M, Novellino A, Martire DD, Rispoli C, Lee K, Bee E, Harrison A, Calcaterra D (2022) Multi-hazard susceptibility assessment using analytic hierarchy process: the Derwent Valley Mills UNESCO World Heritage Site case study (United Kingdom). J Cult Herit 55:339–345. https://doi.org/10.1016/j.culher.2022.04.009
Article Google Scholar
Hadi RH, Hady HN, Hasan AM, Al-Jodah A, Humaidi AJ (2023) Improved fault classification for predictive maintenance in industrial IoT based on AutoML: a case study of ball-bearing faults. Processes 11(5):1507. https://doi.org/10.3390/pr11051507
Article Google Scholar
Halvari T, Nurminen JK, Mikkonen T (2020) Testing the robustness of automl systems. arXiv preprint arXiv:2005.02649, https://doi.org/10.48550/arXiv.2005.02649
He HS, Mladenoff DJ (1999) Spatially explicit and stochastic simulation of forest-landscape fire disturbance and succession. Ecology 80(1):81–99. https://doi.org/10.1890/0012-9658(1999)080[0081:SEASSO]2.0.CO;2
Article Google Scholar
Hirschmugl M, Gallaun H, Dees M, Datta P, Deutscher J, Koutsias N, Schardt M (2017) Methods for mapping forest disturbance and degradation from optical earth observation data: a review. Curr for Rep 3:32–45. https://doi.org/10.1007/s40725-017-0047-2
Article Google Scholar
Hutter F, Kotthoff L, Vanschoren J (eds) (2019) Automated machine learning: methods, systems, challenges. Springer International Publishing, Cham
Google Scholar
Iban MC, Sekertekin A (2022) Machine learning based wildfire susceptibility mapping using remotely sensed fire data and GIS: a case study of Adana and Mersin provinces. Turkey Ecol Inform 69:101647. https://doi.org/10.1016/j.ecoinf.2022.101647
Article Google Scholar
Jactel H, Bauhus J, Boberg J, Bonal D, Castagneyrol B, Gardiner B, Gonzalez-Olabarria JR, Koricheva J, Meurisse N, Brockerhoff EG (2017) Tree diversity drives forest stand resistance to natural disturbances. Curr for Rep 3:223–243. https://doi.org/10.1007/s40725-017-0064-1
Article Google Scholar
Kantarcioglu O, Schindler K, Kocaman S (2023) Forest fire susceptibility assessment with machine learning methods in north-east Turkiye. Int Arch Photogramm Remote Sens Spat Inf Sci 48:161–167. https://doi.org/10.5194/isprs-archives-XLVIII-M-1-2023-161-2023
Article Google Scholar
Kucuk O, Bilgili E (2008) Crown fuel characteristics and fuel load estimates in young Calabrian pine (Pinus brutia Ten.) stands in Northwestern Turkey. Fresenius Environ Bull 17(12b):2226–2231
CAS Google Scholar
Kucukosmanoglu A, Uzmez I (2019) Forest fires in Izmir regional directorate of forestry. Appl Ecol Environ Res 17(3):6989–7001. https://doi.org/10.15666/aeer/1703_69897001
Article Google Scholar
Kumar V, Faheem M, Lee KW (2022) A decade of machine learning-based predictive models for human pharmacokinetics: Advances and challenges. Drug Discov Today 27(2):529–537. https://doi.org/10.1016/j.drudis.2021.09.013
Article CAS PubMed Google Scholar
Li Z (2022) Extracting spatial effects from machine learning model using local interpretation method: an example of SHAP and XGBoost. Comput Environ Urban Syst 96:101845. https://doi.org/10.1016/j.compenvurbsys.2022.101845
Article Google Scholar
Li L, Lan H, Guo C, Zhang Y, Li Q, Wu Y (2017) A modified frequency ratio method for landslide susceptibility assessment. Landslides 14:727–741. https://doi.org/10.1007/s10346-016-0771-x
Article Google Scholar
Li R, Shinde A, Liu A, Glaser S, Lyou Y, Yuh B, Wong J, Amini A (2020) Machine learning–based interpretation and visualization of nonlinear interactions in prostate cancer survival. JCO Clin Cancer Inform 4:637–646. https://doi.org/10.1200/CCI.20.00002
Article PubMed Google Scholar
Lindenmayer DB, Foster DR, Franklin JF, Hunter ML, Noss RF, Schmiegelow FA, Perry D (2004) Salvage harvesting policies after natural disturbance. Science 303(5662):1303–1303. https://doi.org/10.1126/science.1093438
Article CAS PubMed Google Scholar
Lundberg SM, Lee SI (2017) A unified approach to interpreting model predictions. Adv Neural Inf Process Syst 30, New York, NY, USA: Curran Associates, Inc, pp. 4765–4774, 2017, [online] Available: http://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions.pdf
Nepstad DC, Stickler CM, Filho BS, Merry F (2008) Interactions among Amazon land use, forests and climate: prospects for a near-term forest tipping point. Philos Trans R Soc Lond Biol Sci 363(1498):1737–1746. https://doi.org/10.1098/rstb.2007.0036
Article Google Scholar
Özdemir Ş, Örslü S (2019) New perspective on machine learning process: AutoML (Original in Turkish). J Manag Inf Syst 1(1):23–30
Google Scholar
Pourghasemi HR, Kariminejad N, Amiri M, Edalat M, Zarafshar M, Blaschke T, Cerda A (2020) Assessing and mapping multi-hazard risk susceptibility using a machine learning technique. Sci Rep 10(1):3203. https://doi.org/10.1038/s41598-020-60191-3
Article CAS PubMed PubMed Central Google Scholar
Reichenbach P, Rossi M, Malamud BD, Mihir M, Guzzetti F (2018) A review of statistically-based landslide susceptibility models. Earth-Sci Rev 180:60–91. https://doi.org/10.1016/j.earscirev.2018.03.001
Article Google Scholar
Ren H, Zhang L, Yan M, Chen B, Yang Z, Ruan L (2022) Spatiotemporal assessment of forest fire vulnerability in China using automated machine learning. Remote Sens 14(23):5965. https://doi.org/10.3390/rs14235965
Article Google Scholar
Renza D, Cárdenas EA, Jaramillo CM, Weber SS, Martinez E (2021) Landslide susceptibility model by means of remote sensing images and AutoML. In: Figueroa-García JC, Díaz-Gutierrez Y, Gaona-García EE, Orjuela-Cañón AD (eds) Applied computer sciences in engineering: 8th workshop on engineering applications, WEA 2021, Medellín, Colombia, October 6–8, 2021, Proceedings. Springer International Publishing, Cham, pp 25–37. https://doi.org/10.1007/978-3-030-86702-7_3
Chapter Google Scholar
Rihan M, Bindajam AA, Talukdar S et al (2023) Forest fire susceptibility mapping with sensitivity and uncertainty analysis using machine learning and deep learning algorithms. Adv Space Res 72(2):426–443. https://doi.org/10.1016/j.asr.2023.03.026
Article Google Scholar
Rusk J, Maharjan A, Tiwari P, Chen TK, Shneiderman S, Turin M, Seto KC (2022) Multi-hazard susceptibility and exposure assessment of the Hindu Kush Himalaya. Sci Total Environ 804:150039. https://doi.org/10.1016/j.scitotenv.2021.150039
Article CAS PubMed Google Scholar
Sahana M, Hong H, Sajjad H, Liu J, Zhu AX (2018) Assessing deforestation susceptibility to forest ecosystem in Rudraprayag district, India using fragmentation approach and frequency ratio model. Sci Total Environ 627:1264–1275
Article CAS PubMed Google Scholar
Sarıkaya O, Ibis HM, Toprak Ö (2013) The flight activity and population density of Orthotomicus erosus (Wollaston, 1857) in the Brutian pine (Pinus brutia Ten.) forests of Izmir Province, Turkey. Int J Sci Basic Appl 12:208–221
Google Scholar
Senf C, Seidl R (2021) Mapping the forest disturbance regimes of Europe. Nat Sustain 4(1):63–70. https://doi.org/10.1016/j.scitotenv.2018.01.290
Article CAS Google Scholar
Shabani S, Pourghasemi HR, Blaschke T (2020) Forest stand susceptibility mapping during harvesting using logistic regression and boosted regression tree machine learning models. Glob Ecol Conserv 22:e00974. https://doi.org/10.1016/j.gecco.2020.e00974
Article Google Scholar
Sivrikaya F, Özcan GE, Enez K (2023) Predicting the susceptibility to Pityokteines curvidens using GIS with analytical hierarchy process and, maximum entropy models in fir forests. In: Analytic hierarchy process-models, methods, concepts, and applications. IntechOpen. https://doi.org/10.5992/intechopen.1001074
Stritih A, Senf C, Seidl R, Grêt-Regamey A, Bebi P (2021) The impact of land-use legacies and recent management on natural disturbance susceptibility in mountain forests. For Ecol Manag 484:118950. https://doi.org/10.1016/j.foreco.2021.118950
Article Google Scholar
Tien Bui D, Pradhan B, Lofman O, Revhaug I (2012) Landslide susceptibility assessment in Vietnam using support vector machines, decision tree, and Naive Bayes models. Math Probl Eng 2012:974638. https://doi.org/10.1155/2012/974638
Article Google Scholar
Turner MG (2010) Disturbance and landscape dynamics in a changing world. Ecology 91(10):2833–2849. https://doi.org/10.1890/10-0097.1
Article PubMed Google Scholar
Ullah K, Wang Y, Fang Z, Wang L, Rahman M (2022) Multi-hazard susceptibility mapping based on convolutional neural networks. Geosci Front 13(5):101425. https://doi.org/10.1016/j.gsf.2022.101425
Article Google Scholar
URL (2023) https://www.automl.org/automl/. Accessed 7 July 2023
Vaccaro L, Sansonetti G, Micarelli A (2021) An empirical review of automated machine learning. Computers 10(1):11. https://doi.org/10.3390/computers10010011
Article Google Scholar
Wang H, Zhang L, Yin K, Luo H, Li J (2021) Landslide identification using machine learning. Geosci Front 12(1):351–364. https://doi.org/10.1016/j.gsf.2020.02.012
Article Google Scholar
Waring J, Lindvall C, Umeton R (2020) Automated machine learning: review of the state-of-the-art and opportunities for healthcare. Artif Intell Med 104:101822. https://doi.org/10.1016/j.artmed.2020.101822
Article PubMed Google Scholar
Yanar T, Kocaman S, Gokceoglu C (2020) Use of Mamdani fuzzy algorithm for multi-hazard susceptibility assessment in a developing urban settlement (Mamak, Ankara, Turkey). ISPRS Int J Geoinf 9(2):114. https://doi.org/10.3390/ijgi9020114
Article Google Scholar

Download references

Funding

No Funding is available.

Author information

Authors and Affiliations

Department of Forest Engineering, Faculty of Forestry, Izmir Katip Çelebi University, 35620, Izmir, Turkey
Remzi Eker
Graduate School of Natural and Applied Sciences, Izmir Katip Çelebi University, 35620, Izmir, Turkey
Kamber Can Alkiş
Department of Forest Engineering, Faculty of Forestry, Düzce University, 81620, Düzce, Turkey
Abdurrahim Aydın

Authors

Remzi Eker
View author publications
You can also search for this author in PubMed Google Scholar
Kamber Can Alkiş
View author publications
You can also search for this author in PubMed Google Scholar
Abdurrahim Aydın
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

RE: Conceptualization, Methodology, Software, Data curation, Writing—Original draft preparation, Supervision, Writing—Review & Editing. KCA: Data curation, Writing-Original draft. AA: Sup-ervision, Writing—Review & Editing.

Corresponding author

Correspondence to Remzi Eker.

Ethics declarations

Conflict of interest

The authors declare they have no known financial interests or personal relationships that influenced the work in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The online version is available at http://www.springerlink.com.

Corresponding editor: Yu Lei.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 2815 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Eker, R., Alkiş, K.C. & Aydın, A. Assessment of large-scale multiple forest disturbance susceptibilities with AutoML framework: an Izmir Regional Forest Directorate case. J. For. Res. 35, 65 (2024). https://doi.org/10.1007/s11676-024-01723-9

Download citation

Received: 05 October 2023
Accepted: 19 December 2023
Published: 02 April 2024
DOI: https://doi.org/10.1007/s11676-024-01723-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Assessment of large-scale multiple forest disturbance susceptibilities with AutoML framework: an Izmir Regional Forest Directorate case

Abstract

Similar content being viewed by others

Urban flood susceptibility mapping using frequency ratio and multiple decision tree-based machine learning models

Strengthening Forest Landscape Restoration through Understanding Land Use Dynamics: Case Study of Tchamba Prefecture (Togo)

In temperate Europe, fire is already here: The case of The Netherlands

Introduction