Improving the probabilistic drought prediction with soil moisture information under the ensemble streamflow prediction framework

Kim, Gi Joo; Kim, Dae Ho; Kim, Young-Oh

doi:10.1007/s00477-024-02710-6

Improving the probabilistic drought prediction with soil moisture information under the ensemble streamflow prediction framework

ORIGINAL PAPER
Open access
Published: 04 April 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Improving the probabilistic drought prediction with soil moisture information under the ensemble streamflow prediction framework

Download PDF

279 Accesses
Explore all metrics

Abstract

Reliable drought prediction should be preceded to prevent damage from potential droughts. In this context, this study developed a hydrological drought prediction method, namely ensemble drought prediction (EDP) to reflect drought-related information under the ensemble streamflow prediction framework. After generating an ensemble of standardized runoff index by converting the ensemble of generated streamflow, the results were adopted as the prior distribution. Then, precipitation forecast and soil moisture were used to update the prior EDP. The EDP + A model included the precipitation forecast with the PDF-ratio method, and the observed soil moisture index was reflected in the former EDP and EDP + A via Bayes’ theorem, resulting in the EDP + S and EDP + AS models. Eight basins in Korea with more than 30 years of observation data were applied with the proposed methodology. As a result, the overall performance of the four EDP models yielded improved results than the climatological prediction. Moreover, reflecting soil moisture yielded improved evaluation metrics during short-term drought predictions, and in basins with larger drainage areas. Finally, the methodology presented in this study was more effective during periods with less intertemporal variabilities.

Climate Change and Drought: a Perspective on Drought Indices

Article 23 April 2018

Climate Change and Drought: From Past to Future

Article 12 May 2018

Drought severity across Africa: a comparative analysis of multi-source precipitation datasets

Article 27 April 2024

1 Introduction

It is estimated that about two-thirds of the world’s population has experienced severe water scarcity (Mekonnen and Hoekstra 2016), and the dry will get even more drier as global warming has accelerated the hydrological cycle resulting in more extremes (Seager et al. 2010; Trenberth et al. 2014). Although droughts are complex catastrophes that are difficult to classify into a single category, review studies on droughts often classify droughts into the following four categories: meteorological, agricultural, hydrological, and socio-economic (Ndayiragije and Li 2022). Whereas meteorological and hydrological droughts are associated with physical phenomena (i.e., rainfall, surface water, and groundwater), the other two classifications focus more on other aspects of droughts, such as soil moisture and societal perspectives.

According to Korean National Drought Information-analysis Center, droughts have also been occurring locally in Korea for almost every year since 2006, and a record-breaking multi-year drought from 2014 through 2019 due to the lack of precipitation on the western coast of the Korean Peninsula (Kim et al. 2019, 2021) have heavily affected the lives of the public. In response, multiple efforts for probabilistic prediction in drought conditions have been conducted to quantify the uncertainty and derive results that can help better decision-making. Novel probabilistic drought forecasting models, such as the Bayesian approach (Madadgar and Moradkhani 2013, 2014) were often used to forecast future droughts preserving spatial–temporal variations. More recently, utilizing diverse types of data such as remotely sensed data and satellite data for drought monitoring and impact assessments (Gavahi et al. 2020; Li et al. 2019; Park et al. 2018) were proven to enhance drought predictions in various time scales.

A case in point is the ensemble streamflow prediction (ESP) framework, which is one of the most widely utilized and studied tools in the hydrology field over decades (Day 1985). ESP generates the ensemble of streamflow forecasts with the previous historical meteorological observation data given identical initial conditions. The purpose of the ESP is not just to represent an error bar around the predicted values, but to bundle up possible scenarios in the future to estimate probabilities of events. Branković et al. (1990) proved that the probabilistic prediction via ESP has better accuracy than a single-valued prediction. However, since the traditional ESP are not able to reflect the actual hydrometeorological conditions at the time of interest, multiple approaches have applied weather predictions generated by dynamic models of climate and ocean system to the ESP (Saha et al. 2014; Yuan et al. 2014) Harrigan et al. (2018) found that the skill of the ESP decreases exponentially with increasing lead times even using a variety of climate ensembles because the uncertainties in initial hydrologic conditions (e.g., soil moisture) still exist. Recent novel approaches to the application of ESP also include snow data assimilation to enhance predictions (DeChant and Moradkhani 2011) and utilizing machine learning algorithms to improve prediction reliability (Tyralis et al. 2021; Liu et al. 2022).

As the traditional ESP framework was adopted to predict streamflow, the conversion of streamflow prediction ensembles into an appropriate drought measure such as a drought index, should be preceded to utilize ESP during probabilistic drought forecasts. Consequentially, few studies endeavored to convert the precipitation ensemble into the standardized precipitation index (SPI), which is one the most widely used drought indices, to predict meteorological drought (Yoon et al. 2012; Mo and Lyon 2015). In addition, Son and Bae (2015) tested a model that produces standardized runoff index (SRI) ensembles by converting ESP simulation into the SRI for hydrological drought prediction and later improved the skill of the model using climate dynamic models and Bayesian methods (Bae et al. 2017). This Bayesian method combines empirical knowledge with numerical analysis results through Bayesian inferences for more accurate predictions. This approach has also been used to improve the simulation performance of hydrologic conditions (Luo et al. 2007). However, estimating hydrological droughts with only streamflow forecast ensembles may yield inaccurate results since many other factors substantially affect the hydrological process, as highlighted in previous literature.

As soil moisture is often considered one of the most important factors during the hydrological processes, multiple studies have been carried out continuously to make accurate estimates of soil moisture and to analyze its effects on agricultural droughts (Wood and Lettenmaier 2008; Li et al. 2009; Shukla and Lettenmaier 2011; Koster et al. 2014; Yuan et al. 2016). Moreover, it was also found that precipitation forcing has a significant impact on the uncertainties of predictions of hydrological drought and soil moisture (Mo et al. 2012). Although it has been verified that the relationships between hydrological processes and the conditions of climate and soil moisture are significantly important in drought assessment in multiple previous studies, few efforts exist that reflect the information from the drought-related factors (soil moisture and climatic information) during the hydrological drought predictions.

Thus, this study proposes an ensemble drought prediction (EDP) framework, which converts the streamflow simulation ensembles from the conventional ESP framework into a representative drought index for hydrological drought outlook. Then, the probabilistic precipitation forecasts and the information regarding soil moisture were used to update the previous forecasts via Bayes’ Theorem to improve prediction accuracy. The proposed methodology was then applied to eight major watersheds in South Korea, and the results were evaluated in terms of prediction accuracy with both deterministic and probabilistic metrics.

2 Methodology

2.1 Ensemble streamflow prediction

With the assumption that the historical records may potentially represent future conditions (Day 1985), ESP is a scenario-based prediction technique that is often adopted in operational hydrology for streamflow forecasts. In brief, ESP generates streamflow ensemble predictions by inputting climate forcing data sampled from the observation data to a hydrologic model given the same initial condition at the time of the forecast and thus is sometimes referred to as ‘conditional Monte Carlo simulation’ (Day 1985). The initial conditions in the ESP framework which can have a substantial effect on the prediction outcomes, such as soil moisture, are often estimated through observations just before the time of forecast.

By far, ESP has been adopted in the field of water resources engineering widely for various purposes. A popular approach for utilizing ESP is in reducing different types of uncertainties inherent in most hydrologic modeling (e.g., input uncertainty, parameter estimation uncertainty, etc.) (Hwang et al. 2011; Tongal & Booij 2017). Furthermore, ESP is also adequate when coupled with the optimization models for deriving optimal release decisions in reservoir operations during the formulation. In this context, multiple studies have proved that incorporating ESP may improve system performance under various spatial–temporal scales (Kim et al. 2007; Jafarzadegan et al. 2014). Other than applying ESP for streamflow prediction, the use of ESP may be extended to improving the prediction of other variables (Day 1985) such as hydrological drought, as presented in this study. Therefore, this study aims to present a methodology that utilizes the concept of ESP to conduct drought prediction, namely EDP. Further details regarding the procedures behind EDP are thoroughly explained in the following subsections.

2.2 Standardized runoff index

The proposed methodology of this study, EDP, requires the output from the ESP model (i.e., streamflow prediction ensembles) to be converted to an index that can quantify drought severity. Among multiple candidate indices, this study selected the SRI, which is one of the most commonly used indices to represent hydrological drought through the streamflow level. SRI was first developed by Shukla and Wood (2008) to consider various kinds of streamflow inflow such as surface runoff and subsurface runoff for hydrological drought analysis. SRI is appropriate to analyze both short- and long-term hydrological droughts since it is able to analyze droughts of various durations through time scale accumulation (e.g., 1, 3, and 12 months).

A simple calculation procedure of the SRI adopted in this study is as follows. The first step is calculating the cumulative streamflow over a given time scale $k$ by accumulation. Next, the cumulative probability values of the cumulative streamflow are estimated with the appropriate probability density function the cumulated streamflow is assumed to follow (normal distribution in this study). Finally, the SRI is derived by converting the cumulative probability into the variable which follows the standard normal distribution. Let $g(\cdot )$ be the function that calculates the SRI described from the above procedure. Then the one month ahead ensemble drought prediction (${EDP}_{t+1}$) can be converted through Eq. (1) where ${q}_{t}$ represents the streamflow at the time interval $t$, the subscript $t$ indicates the time interval month, $k$ is the given time scale for SRI accumulation, and ${\mu }_{0}$ and ${\sigma }_{0}^{2}$ are the mean and standard deviation of the EDP, respectively.

$${EDP}_{t+1}={\text{g}}\left({q}_{t-k+2},\dots ,{q}_{t},{q}_{t+1}\right)\sim {\text{N}}({\mu }_{0},{\sigma }_{0}^{2})$$

(1)

2.3 Bayes’ theorem

Bayesian inference enables the update of prior distribution with the introduction of novel information through observation with a predefined likelihood function. Mathematically, the equation for the Bayes’ theorem consists of the following three key components: prior distribution, likelihood function, and posterior distribution, as shown in Eq. (2).

$$p(D|X)=\frac{p(X|D)p(D)}{p(X)}$$

(2)

Where $D$ is the random variable of interest (i.e., SRI in this study), $X$ is the new information for the random variable of interest (i.e., soil moisture in this study), $p(D)$ is the prior distribution, $p(X|D)$ is the likelihood function, $p(X)$ is the marginal distribution of $X$, and $p(D|X)$ is the posterior distribution to be updated. As the conventional ESP framework is prone to how the initial conditions before the simulation are set, this study aimed to improve the EDP by reflecting the soil moisture information via the Bayes’ theorem. More detailed explanations regarding each component in Eq. (2) are in the following subsubsections.

2.3.1 Prior distribution

In this study, the prior distribution $p(D)$ is estimated from the distribution of the EDP, calculated from the SRI values of different time scales. Since the EDP is an ensemble of drought indices that is assumed to follow normal distribution, the representative statistics of $D$ can be expressed as Eq. (3), where ${\mu }_{0}$ and ${\sigma }_{0}^{2}$ are the mean and variance of the EDP, respectively.

$${\text{p}}(D)\sim {\text{N}}({\mu }_{0},{\sigma }_{0}^{2})$$

(3)

2.3.2 Likelihood function

The likelihood function is the conditional probability of $X$ given $D$, where the random variable $X$ is the soil moisture index (SMI) which is the satellite observation data being provided by APEC Climate Center (APCC) since 2001. In other study cases, the likelihood function was often estimated from the past performance of the ensemble model (Luo et al. 2007; Seo et al. 2019). However, in this study, the likelihood function was obtained from the linear regression model between $D$ (SRI) and $X$ (SMI) to update the prediction results with soil moisture information, as shown in Eq. (4).

$${X}_{t}={b}_{0}+{b}_{1}{D}_{t+1}+\varepsilon$$

(4)

Where the subscript $t$ is the time interval month, and $b$ and $\varepsilon$ are the regression parameters and residuals, respectively. The residual $\varepsilon$ follows a normal distribution with zero mean and standard deviation ${\sigma }_{\varepsilon }$. As a result, the likelihood function adopted in this study can be expressed as Eq. (5).

$${\text{p}}({X}_{t}|{D}_{t+1})\sim {\text{N}}({b}_{0}+{b}_{1}{D}_{t+1},{\sigma }_{\varepsilon }^{2})$$

(5)

2.3.3 Posterior distribution

Based on Bayes’ theorem, the posterior distribution can be estimated using normal distribution as Eq. (6) when both the prior distribution and the likelihood function follow a normal distribution as well (Lee 1997). The parameters of the posterior distribution can also be easily derived through equations Eq. (7–8), which is a kind of variance-weighted average of the prior distribution and the likelihood function.

$${\text{p}}({D}_{t+1}|{X}_{t})\sim {\text{N}}({\mu }_{p},{\sigma }_{p}^{2})$$

(6)

$$\frac{1}{{\sigma }_{p}^{2}}=\frac{1}{{\sigma }_{0}^{2}}+\frac{{b}_{1}^{2}}{{\sigma }_{\varepsilon }^{2}}$$

(7)

$$\frac{{\mu }_{p}}{{\sigma }_{p}^{2}}=\frac{{\mu }_{0}}{{\sigma }_{0}^{2}}+\frac{{b}_{1}^{2}}{{\sigma }_{\varepsilon }^{2}}\left(\frac{{{X}_{t}-b}_{0}}{{b}_{1}}\right)$$

(8)

Where ${\mu }_{p}$ and ${\sigma }_{p}^{2}$ are the mean and standard deviation of the posterior distribution.

3 Case study

3.1 Study area and meteorological data transformation

The proposed methodology was applied to eight major basins located in South Korea (Fig. 1), all with more than 30 years of observation data sets since it is generally considered stable when the drought indices are generated with observations for more than 30 years. All eight dams are multi-purpose reservoirs that are responsible for more than one of the following objectives: hydroelectric power generation, agricultural water supply, industrial water supply, and environmental flows. As in Fig. 1 and Table 1, the eight dams are all distinctive in terms of their location, capacity, and scale. Thus, a comparative analysis of the dams regarding their characteristics can also be conducted during the evaluation process of this study.

Table 1 Specific information about the eight dams analyzed during this study

Full size table

For the eight dams, historical meteorological data sets including daily precipitation, temperature, and daily inflow were collected from either the Korea Meteorological Administration or the Korea Water Cooperation (K-water). The daily potential evapotranspiration was estimated by the FAO Penman–Monteith equation No. 56 method (Allen et al. 1998), and the meteorological data sets were converted into the areal mean values for each basin through the Thiessen polygon method.

For the rainfall-runoff model which is an essential component in the ESP framework, this study used the TANK model, a conceptual model that describes the rainfall-runoff process in a structure that consists four tanks (Sugawara 1995). To consider the snow accumulation–melting, the modified TANK model by McCabe and Markstrom (2007) was used, and the parameters of the TANK model were calibrated and validated using the observation inflow data of each basin from 1976 to 2015, as suggested in the previous study by Seo and Kim (2018), which used the shuffled complex evolution algorithm with the observation data. For more specifics regarding the steps for parameter estimation in the TANK model, the reader is referred to Seo and Kim (2018).

Finally, to evaluate the prediction results in terms of both short- and long-term droughts, this study selected the two-time scales of 3 and 12 for the monthly SRI values, since the SRI3 and SRI12 are known as the appropriate drought indices for the short- and long-term hydrological drought in South Korea (Son et al. 2011). Finally, the drought phases were categorized by the corresponding to the SRI values as shown in Table 2 to distinguish the severity of droughts during the evaluation process.

Table 2 Drought classification according to the SRI values

Full size table

3.2 EDP model development

Four types of EDP models (EDP, EDP + A, EDP + S, and EDP + AS) were generated using the methodological process described in the previous sections. First, the EDP model is a conventional model which only includes the information from SRI ensembles. The EDP + A model includes the information from the precipitation forecast from the APCC to update the former EDP model. During this process, the pdf-ratio method (Stedinger and Kim 2010) which revises the probability of each meteorological scenario (i.e., input ensemble) based on the probabilistic climate forecast, is used to reflect novel climate information (i.e., precipitation forecasts). To update the EDP model based on soil moisture, the lag-1 correlation coefficients between SRI and SMI were calculated and confirmed.

The next two models, EDP + S and EDP + AS models both update the prior distributions of either the EDP or EDP + A forecasts through the Bayes’ theorem with the information of SMI satellite observation data from the APCC as likelihood function. As previously mentioned, the likelihood function is estimated by the time series regression between two random variables, SMI and the distribution of EDP as shown in Eq. (4). The overall flow chart of this study adopted during the model formulation process is shown in Fig. 2.

3.3 Performance evaluation metrics

Among multiple candidates of evaluation metrics, this study selected three metrics for comparative evaluation among the prediction models: root mean squared error (RMSE), ranked probability skill score (RPSS), and the Brier skill score (BSS). Whereas the RMSE measures the accuracy of the prediction models in comparison with the observed data, the other two metrics, RPSS and BSS, are both score-based metrics that are often adopted when evaluating probabilistic predictions.

The first metric, RMSE, was used to evaluate the quantitative error between the true data and the single-valued prediction. In the RMSE, both bias and variability of predicted values are included. Ideally, zero RMSE across the prediction horizon is favorable since it indicates that the model predictions are able to forecast perfectly identical to the observed data. Specifically, RMSE is computed by taking the root of the mean squared error as written in Eq. (9) where $\overline{{P }_{i}}$ is the mean of the prediction ensemble at time step $i$, ${O}_{i}$ is the observed value at time step $i$, and $N$ is the total number of time steps.

$${\text{RMSE}}=\sqrt{\frac{1}{N}{\sum\nolimits_{i=1}^{N}\left(\overline{{P }_{i}}-{O}_{i}\right)}^{2}}$$

(9)

The ranked probability skill score (RPSS) and the Brier skill score (BSS) were used to evaluate the predictability of the drought occurrence either in multi-categorical or binary outcomes. They are both skill scores which are similar to the rank probability score (RPS), which penalizes the prediction results that are far from actual with a greater penalty and is often adopted in the field of climatology for forecast evaluation. RPS is calculated from the sum of the mean squared difference between the category of the forecast and the actual category the observation belongs to. Ideally, a zero RPS value is expected for a perfect forecast, and the greater the RPS value, the poorer the forecast. RPSS can then be computed from the RPS value according to Eq. (10), where ${RPS}_{cl}$ represents a reference forecast from the climatological predictions.

$${\text{RPSS}}=1-\frac{RPS}{{RPS}_{cl}}$$

(10)

Finally, the concept of BSS is derived from the Brier score (BS), which is sometimes referred to as the reduced version of the RPSS. Both the BS and the BSS are identical to the RPS and the RPSS in cases in which the forecasts can be categorized into exactly two categories (i.e., drought or non-drought) (Weigel et al. 2007). In this specific case study, BSS is used to evaluate whether the drought prediction model can predict drought occurrences above a certain phase. Ultimately, this study utilized the three metrics to evaluate the performance of the forecast results from various EDP models comprehensively.

4 Results and discussion

4.1 Determination of the EDP models

All four EDP models (EDP, EDP + S, EDP + A, and EDP + AS) were generated to forecast the 1-month lead droughts for the eight basins from 2008 to 2017. After fitting the prior distribution with the simulated ensembles from the TANK model, the applicability of implementing SMI into the SRI was validated using the linear regression coefficient between SRI and SMI. As in Table 3, the lag-1 correlation coefficients between SRI and SMI of all eight basins surpassed the critical value of 0.136 at a significance level of 95%. This result indicates that the inclusion of soil moisture information in terms of likelihood function is valid and applicable. The parameters of the likelihood function were then estimated with the average values among the four folds after going through the fourfold cross validation method. For more specific details regarding the parameter estimation procedures of the likelihood function, refer to Kim (2020).

Table 3 Lag-1 correlation coefficients between SRI and SMI

Full size table

Other than the EDP + S model, which updates the former EDP model with SMI, probabilistic precipitation forecasts were also included to create the two other models: EDP + A and EDP + AS. Whereas the EDP + A model utilized novel climate information via the pdf-ratio method from the distribution of the EDP model forecasts, the EDP + AS model includes both the information from the soil moisture and the precipitation forecast for improved prediction accuracy. For illustration, Fig. 3(a) shows an example of implementing four EDP models to predict short-term drought (i.e., using SRI3) at the Soyang basin in July 2014. The light blue colored bars represent the ensemble members of the SRI and the grey line indicates the likelihood function. Blue, red, black, and green line represents the predicted distributions of the EDP models: EDP, EDP + S, EDP + A, and EDP + AS, respectively. Figure 3(b) shows the time series plot during the entire prediction horizon of the four models, using the expected value of the distribution as the representative value. The framework used to determine the distributions of the four models was identically applied to the eight basins, and further analysis of the performance of each model are to be included in the following subsections.

4.2 Drought prediction performance evaluation

For further analysis, this study calculated the performance metrics in terms of length of drought (i.e., short- and long-term), and the period of prediction (i.e., irrigation and non-irrigation) as the annual precipitation pattern may vary in the location of the target basins as shown in many previous studies regarding droughts (Kim et al. 2019, 2021, 2022). Moreover, as the droughts are known to heavily affect the irrigation industry in Korea, this study distinguished the irrigation period (May through September) and non-irrigation period (October through April of the following year) accordingly. This study predicted short-term droughts since they are considered important due to the following reasons, especially in the study area: (1) the capacity inflow ratios of the reservoirs are close to 1 in many reservoirs in Korea, which makes the long-term planning and management challenging, and (2) the precipitation patterns in Korea are concentrated in the 3-month monsoonal period, which increases the need for short-term management based on short-term predictions even more important.

4.2.1 Short-term drought prediction

In terms of short-term drought prediction, the performance of the models was calculated from the SRI3 values. As in Table 4, the RMSE values of the prediction results mostly surpassed 0.5, except for the Sumjin basin indicating that short-term drought predictions are challenging. The RMSE values of all basins tended to increase especially during the irrigation period, in which the increased variability of precipitation hinders accurate predictions compared to the non-irrigation periods. Figure 4 shows the changes in RMSE values from the EDP model for short-term drought predictions across the basins as a heatmap. In Fig. 4, red colors indicate improvements from the EDP model (favorable outcomes), and the blue colors consequently indicate unfavorable outcomes. As a comprehensive result from Fig. 4, the two models that included the soil moisture information to update the conventional EDP model (EDP + S and EDP + AS models) resulted in an improvement in prediction accuracy measured in RMSE. Especially, the Chungju and Soyang basins (ranking first and third in terms of basin drainage area) showed the most significant improvements with 11.34%, and 13.16% increases in the EDP + S model, and 13.98%, 12.69% increases in the EDP + AS model, respectively. This indicates that in terms of short-term drought predictions, the inclusion of soil moisture is more effective in basins with larger drainage areas.

Table 4 Average RMSE values of one-month lead forecasts for short-term droughts

Full size table

In terms of probabilistic evaluations of short-term droughts, measured with RPSS and BSS values, the results are shown in Figs. 5 and 6. From Fig. 5, in which the RPSS values of the four EDP models are shown in terms of heatmap, it can be inferred that the predictions from all four models improve the forecast results using only climatological values. The improvements can be especially highlighted during the non-irrigation periods (Fig. 5c), in which the intertemporal variabilities of precipitation are relatively smaller than the other two periods. On the other hand, when the short-term drought predictions are measured with BSS with respect to D0 and D2 drought phases as in Fig. 6, all EDP models were not able to predict extreme droughts with accuracy, especially in Daecheong and Chungju basins. On average, the EDP + S model showed the most improvement from with a BSS score of 0.366, and the EDP + AS model followed the EDP + S model with a score of 0.353. This result implies that although extreme droughts are difficult to predict, the inclusion of soil moisture information during the predictions may enhance the prediction accuracy, especially during the non-irrigation periods.

4.2.2 Long-term drought prediction

Similarly from the short-term drought predictions, the long-term droughts were evaluated with the same metrics but with the SRI12 values. Table 5 and Fig. 7 show the results of the long-term drought predictions measured in terms of RMSE. Comparing the results from short-term droughts, predicting long-term droughts yielded more reliable accuracy. The primary reasons behind this outcome can be derived as the prominent trade-off relationship between volatility and prediction accuracy, as SRI12 is an accumulated index of summation of streamflow for 12 months. Furthermore, the prediction performance was more sensitive with respect to basins, rather than the models and the information included during model formulations. As in Fig. 7, the improvement of RMSE does not stand out as the short-term predictions, as the predictions from the conventional EDP model for long-term drought predictions yielded reliable outcomes in terms of RMSE.

Table 5 Average RMSE values of one-month lead forecasts for long-term droughts

Full size table

The probabilistic evaluation results of the long-term droughts are shown in Fig. 8 (RPSS) and 9 (BSS). In accordance with the results measured in terms of RMSE, long-term drought prediction evaluations yielded more reliable outcomes as well. As notable in Fig. 8c, Fig. 9c and f, the prediction of long-term droughts during the non-irrigation periods with smaller intertemporal variabilities resulted in the best prediction results in most of the basins. Among different EDP models, notable differences or improvements in terms of either RPSS or BSS were not observed. Thus, it can be concluded that the inclusion of additional information (i.e., either precipitation forecasts in the EDP + A models or soil moisture information in the EDP + S models) affects the prediction accuracy of short-term droughts more significantly. This implies that the applicability of the proposed methodology of various EDP frameworks is more suitable for short-term drought predictions.

5 Conclusion & discussions

This study developed various EDP models in order to perform probabilistic prediction of hydrological droughts using the key components of the conventional ESP framework. During the model formulation process, precipitation forecasts and soil moisture were included in the basic EDP model to analyze the effect of including additional hydrological variables in prediction accuracy. The proposed methodology was applied to eight basins located in Korea, all with more than 30 years of accumulated observation data. For evaluation, both deterministic and probabilistic metrics were utilized and the prediction results were analyzed under the cross-validation process with the observation data. The key findings from this study can be summarized as follows:

1.
The alternative models proposed in this study (EDP + A, EDP + S, EDP + AS) were proven to improve prediction accuracy, especially in terms of short-term drought predictions.
2.
The prediction of droughts with soil moisture (EDP + S and EDP + AS) notably improved in basins with larger drainage areas, which can lead to the necessity of the inclusion of soil moisture in basins with larger drainage areas.
3.
When predicting long-term droughts, the results yielded that predicting droughts with smaller intertemporal variabilities (i.e., non-irrigation periods) proved to be more effective.

Despite multiple notable findings and the improvements of the probabilistic drought prediction methodology proved in this study, few potential advances regarding both the methodological components and the data remain. First of all, expanding the spatial scope from local to a global scale is possible by using various ensemble forecasts from other agencies. For instance, using the datasets from the European Centre for Medium-Range Weather Forecasts and National Center for Atmospheric Research provide global forecasts for the streamflow ensemble and soil moisture will prove the applicability of the proposed framework to a global scale when appropriate downscaling and bias-correction procedures were precedented.

Another direction includes the use of robust machine learning techniques (i.e., long short-term memory models for streamflow prediction and Gradient Boosted Decision Trees for drought classifications) during the application process. Since many studies suggest the use of machine learning techniques for hydrologic predictions accompanied with physics-based hydrological models (Gurbuz et al. 2023), the framework proposed in this study holds promise when coupled with machine learning approaches.

References

Allen RG, Pereira LS, Raes D, Smith M (1998) Crop evapotranspiration-Guidelines for computing crop water requirements-FAO Irrigation and drainage paper 56. Fao, Rome 300(9):D05109
Google Scholar
Bae DH, Son KH, So JM (2017) Utilization of the Bayesian method to improve hydrological drought prediction accuracy. Water Resour Manage 31(11):3527–3541
Article Google Scholar
Branković Č, Palmer TN, Molteni F, Tibaldi S, Cubasch U (1990) Extended-range predictions with ECMWF models: Time-lagged ensemble forecasting. Q J R Meteorol Soc 116(494):867–912
Article Google Scholar
Day GN (1985) Extended streamflow forecasting using NWSRFS. J Water Resour Plan Manag 111(2):157–170
Article Google Scholar
DeChant CM, Moradkhani H (2011) Improving the characterization of initial condition for ensemble streamflow prediction using data assimilation. Hydrol Earth Syst Sci 15(11):3399–3410
Article Google Scholar
Gavahi K, Abbaszadeh P, Moradkhani H, Zhan X, Hain C (2020) Multivariate assimilation of remotely sensed soil moisture and evapotranspiration for drought monitoring. J Hydrometeorol 21(10):2293–2308
Article Google Scholar
Gurbuz F, Mudireddy A, Mantilla R, Xiao S (2023) Using a physics-based hydrological model and storm transposition to investigate machine-learning algorithms for streamflow prediction. J Hydrol 628:130504
Article Google Scholar
Harrigan S, Prudhomme C, Parry S, Smith K, Tanguy M (2018) Benchmarking ensemble streamflow prediction skill in the UK. Hydrol Earth Syst Sci 22(3):2023–2039
Article Google Scholar
Hwang Y, Clark MP, Rajagopalan B (2011) Use of daily precipitation uncertainties in streamflow simulation and forecast. Stoch Env Res Risk Assess 25(7):957–972
Article Google Scholar
Jafarzadegan K, Abed-Elmdoust A, Kerachian R (2014) A stochastic model for optimal operation of inter-basin water allocation systems: a case study. Stoch Env Res Risk Assess 28(6):1343–1358
Article Google Scholar
Kim Y-O, Eum HI, Lee EG, Ko IH (2007) Optimizing operational policies of a Korean multireservoir system using sampling stochastic dynamic programming with ensemble streamflow prediction. J Water Resour Plan Manag 133(1):4–14
Article Google Scholar
Kim GJ, Seo SB, Kim Y-O (2019) Elicitation of drought alternatives based on water policy council and the role of shared vision model. J Korea Water Res Ass 52(6):429–440
Google Scholar
Kim GJ, Kim Y-O, Reed PM (2021) Improving the robustness of reservoir operations with stochastic dynamic programming. J Water Resour Plan Manag 147(7):04021030
Article Google Scholar
Kim GJ, Seo SB, Kim Y-O (2022) Adaptive reservoir management by reforming the zone-based hedging rules against multi-year droughts. Water Resour Manage 36:3575–3590
Article Google Scholar
Kim DH (2020). Introduction to Probabilistic Drought Prediction to Korea. M.S. thesis, Seoul National University, Seoul, Republic of Korea.
Koster RD, Walker GK, Mahanama SP, Reichle RH (2014) Soil moisture initialization error and subgrid variability of precipitation in seasonal streamflow forecasting. J Hydrometeorol 15(1):69–88
Article Google Scholar
Li H, Luo L, Wood EF, Schaake J (2009) The role of initial conditions and forcing uncertainties in seasonal hydrologic forecasting. J Geophys Res Atmos 114:D04114
Google Scholar
Li B, Rodell M, Kumar S, Beaudoing HK, Getirana A, Zaitchik BF, ... Bettadpur S (2019) Global GRACE data assimilation for groundwater and drought monitoring: Advances and challenges. Water Resour Res 55(9):7564–7586
Article Google Scholar
Liu J, Yuan X, Zeng J, Jiao Y, Li Y, Zhong L, Yao L (2022) Ensemble streamflow forecasting over a cascade reservoir catchment with integrated hydrometeorological modeling and machine learning. Hydrol Earth Syst Sci 26(2):265–278
Article Google Scholar
Luo L, Wood EF, Pan M (2007) Bayesian merging of multiple climate model forecasts for seasonal hydrological predictions. J Geophys Res Atmos 112:D10102
Article Google Scholar
Madadgar S, Moradkhani H (2013) A Bayesian framework for probabilistic seasonal drought forecasting. J Hydrometeorol 14(6):1685–1705
Article Google Scholar
Madadgar S, Moradkhani H (2014) Spatio-temporal drought forecasting within Bayesian networks. J Hydrol 512:134–146
Article Google Scholar
McCabe GJ, Markstrom SL (2007) A monthly water-balance model driven by a graphical user interface. US Geological Survey Open-File report 2007-1088, pp 6
Google Scholar
Mekonnen MM, Hoekstra AY (2016) Four billion people facing severe water scarcity. Sci Adv 2(2):e1500323
Article Google Scholar
Mo KC, Lyon B (2015) Global meteorological drought prediction using the North American multi-model ensemble. J Hydrometeorol 16(3):1409–1424
Article Google Scholar
Mo KC, Chen LC, Shukla S, Bohn TJ, Lettenmaier DP (2012) Uncertainties in North American land data assimilation systems over the contiguous United States. J Hydrometeorol 13(3):996–1009
Article Google Scholar
Park SY, Sur C, Kim JS, Lee JH (2018) Evaluation of multi-sensor satellite data for monitoring different drought impacts. Stoch Env Res Risk Assess 32:2551–2563
Article Google Scholar
Saha S, Moorthi S, Wu X, Wang J, Nadiga S, Tripp P, ... & Becker E (2014). The NCEP climate forecast system version 2. Journal of climate, 27(6), 2185–2208.
Seager R, Naik N, Vecchi GA (2010) Thermodynamic and dynamic mechanisms for large-scale changes in the hydrological cycle in response to global warming. J Clim 23(17):4651–4668
Article Google Scholar
Seo SB, Kim Y-O (2018) Impact of spatial aggregation level of climate indicators on a national-level selection for representative climate change scenarios. Sustain 10(7):2409
Article Google Scholar
Seo SB, Kim Y-O, Kang SU, Chun GI (2019) Improvement in long-range streamflow forecasting accuracy using the Bayes’ theorem. Hydrol Res 50(2):616–632
Article Google Scholar
Shukla S, Wood AW (2008) Use of a standardized runoff index for characterizing hydrologic drought. Geophys Res Lett 35:L02405
Article Google Scholar
Shukla S, Lettenmaier DP (2011) Seasonal hydrologic prediction in the United States: understanding the role of initial hydrologic conditions and seasonal climate forecast skill. Hydrol Earth Syst Sci 15(11):3529–3538
Article Google Scholar
Son KH, Bae DH, Chung JS (2011) Drought analysis and assessment by using land surface model on South Korea. J Korea Water Res Ass 44(8):667–681
Article Google Scholar
Stedinger JR, Kim Y-O (2010) Probabilities for ensemble forecasts reflecting climate information. J Hydrol 391:9–23
Article Google Scholar
Sugawara M (1995) Tank model, Computer models of watershed hydrology. Singh VP Computer Models of Watershed Hydrology Colorado: Water Resources Publications. Highlands Ranch, pp 1130
Tongal H, Booij MJ (2017) Quantification of parametric uncertainty of ANN models with GLUE method for different streamflow dynamics. Stoch Env Res Risk Assess 31(4):993–1010
Article Google Scholar
Trenberth KE, Dai A, Van Der Schrier G, Jones PD, Barichivich J, Briffa KR, Sheffield J (2014) Global warming and changes in drought. Nat Clim Chang 4(1):17–22
Article Google Scholar
Tyralis H, Papacharalampous G, Langousis A (2021) Super ensemble learning for daily streamflow forecasting: Large-scale demonstration and comparison with multiple machine learning algorithms. Neural Comput Appl 33(8):3053–3068
Article Google Scholar
Weigel AP, Liniger MA, Appenzeller C (2007) The discrete Brier and ranked probability skill scores. Mon Weather Rev 135(1):118–124
Article Google Scholar
Wood AW, Lettenmaier DP (2008) An ensemble approach for attribution of hydrologic prediction uncertainty. Geophys Res Lett 35:L14401
Article Google Scholar
Yoon JH, Mo K, Wood EF (2012) Dynamic-model-based seasonal prediction of meteorological drought over the contiguous United States. J Hydrometeorol 13(2):463–482
Article Google Scholar
Yuan X, Wood EF, Liang M (2014) Integrating weather and climate prediction: Toward seamless hydrologic forecasting. Geophys Res Lett 41(16):5891–5896
Article Google Scholar
Yuan X, Ma F, Wang L, Zheng Z, Ma Z, Ye A, Peng S (2016) An experimental seasonal hydrological forecasting system over the Yellow River basin–Part 1: Understanding the role of initial hydrological conditions. Hydrol Earth Syst Sci 20(6):2437–2451
Article Google Scholar

Download references

Funding

Open Access funding enabled and organized by Seoul National University. This study received funding from the BK21 PLUS research program of the National Research Foundation of Korea. The authors also wish to thank the Institute of Construction Environmental Engineering (ICEE) at Seoul National University for providing research facilities for this work.

Author information

Gi Joo Kim and Dae Ho Kim contributed equally to this work.

Authors and Affiliations

Department of Civil & Environmental Engineering, Tufts University, Medford, MA, 02155, USA
Gi Joo Kim
60 Hertz Inc, Seoul, 04538, South Korea
Dae Ho Kim
Department of Civil & Environmental Engineering, Seoul National University, Seoul, 08826, South Korea
Young-Oh Kim

Authors

Gi Joo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Dae Ho Kim
View author publications
You can also search for this author in PubMed Google Scholar
Young-Oh Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Dae Ho Kim, and Young-Oh Kim contributed to the study conception and design. Material preparation, data collection and analysis were performed by Dae Ho Kim and Gi Joo Kim. The first draft of the manuscript was written by Gi Joo Kim and Dae Ho Kim. Gi Joo Kim and Young-Oh Kim commented on previous versions of the manuscript. Gi Joo Kim, Dae Ho Kim and Young-Oh Kim read and approved the final manuscript.

Corresponding author

Correspondence to Young-Oh Kim.

Ethics declarations

Competing interests

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, G.J., Kim, D.H. & Kim, YO. Improving the probabilistic drought prediction with soil moisture information under the ensemble streamflow prediction framework. Stoch Environ Res Risk Assess (2024). https://doi.org/10.1007/s00477-024-02710-6

Download citation

Accepted: 16 March 2024
Published: 04 April 2024
DOI: https://doi.org/10.1007/s00477-024-02710-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Improving the probabilistic drought prediction with soil moisture information under the ensemble streamflow prediction framework

Abstract

Similar content being viewed by others

Climate Change and Drought: a Perspective on Drought Indices

Climate Change and Drought: From Past to Future

Drought severity across Africa: a comparative analysis of multi-source precipitation datasets

1 Introduction