Abstract

Most of the approaches to retrieve surface soil moisture (SSM) by optical and thermal infrared (TIR) spectroscopies are purposed to calculate various characteristic bands/indices and then to establish the regression relationship between them in combination with the measurement data. However, due to the combined impact of many factors, the regression relationship often shows nonlinearity. Moreover, the relationship between the single temporal image and the measured data are not transplantable in time and space, which makes it difficult to construct a more general model for the remote sensing (RS) estimation of SSM. In order to solve this problem, the back propagation (BP) neural network (NN) with an excellent nonlinear mapping ability is introduced to determine the relationship between the characteristic band/index and the measurement data. In the BPNN model, the optical and TIR RS data in different periods were taken as the input parameters, and the in situ soil moisture data were treated as the output parameter. There are 12 schemes designed to retrieve SSM. The key findings of study were as follows: (1) the BPNN model could retrieve SSM with a high accuracy that indicates the correlation coefficient between the estimated and measured soil moisture as 0.9001 and (2) the SSM retrieval model based on the BPNN can be applied to estimate the SSM with different spatial resolution values.

1. Introduction

As an important parameter in meteorology, hydrology, and agricultural science, soil water content (SWC) has a significant impact on land surface evaporation, vegetation photosynthesis, and water displacement [1, 2]. It acts as the hub connecting the groundwater-surface water-atmospheric water cycle system [3, 4]. Therefore, the accurate monitoring of soil moisture (SM) is essential for the prediction of climate change [5, 6]. Though direct measurement is the most accurate method to monitor SM, in situ measurement data are ineffective in showing the spatial distribution of SM, nor can it be used to monitor the spatial and temporal changes in SM on a large scale [7, 8].

The development of a remote sensing (RS) data algorithm provides an effective solution to the large-scale monitoring of surface soil moisture (SSM) [9, 10]. Various RS methods have been developed in different regions of the electromagnetic spectrum. These methods may be classified into three major groups: microwave (MW), thermal infrared (TIR), and optical [11, 12]. Microwave is advantageous due to its stronger penetration ability, longer wavelength, and the robustness to adverse weather conditions. Therefore, MW technology shows a greater potential in monitoring global SM dynamics [13]. However, due to the relatively rough resolution of SM products provided by Soil Moisture Active Passive (SMAP) mission and Soil Moisture and Ocean Salinity (SMOS) mission, they are not suitable for small-scale regional applications. Apart from that, a challenge presented by using MW to monitor SSM is the need to input various physical parameters, some of which are still not well defined at present [14]. For example, surface roughness, which is easily changeable, must be measured in the field, which is time-consuming.

Compared with MW SM products, optical/TIR RS data usually have a higher spatial resolution, which can well meet the needs of SM data in various fields [15]. Optical methods apply the visible, near infrared (NIR), and shortwave infrared (SWIR) data for SM retrieval. Many scholars have used different vegetation indices to monitor SM and dryness [16]. However, there is a certain lag in vegetation growth for the degree of surface drought. Some other studies used spectral feature space to construct SSM retrieval models. For example, the perpendicular drought index (PDI) [17] is derived from the red and near infrared (NIR) band reflectance triangular space. This method needs to determine the soil line artificially to realize the model. However, the distribution of soil lines highly depends on some parameters, such as soil minerals, iron oxide content, particle size distribution, and organic matter [18]. Although TIR methods are developed based on physical principles, the methods are usually empirical and depend on local meteorological conditions, such as humidity, temperature, and wind speed, and thus, the measurement results vary temporally and with land cover types [19, 20].

The combination of optical and TIR RS data mostly results in better accuracy for SM retrieval. To some extent, land surface temperature (LST) can be taken as an indicator of the change in SM and vegetation. Therefore, the retrieval model based on LST and vegetation index (VI) can be used to estimate SM [21]. The temperature vegetation dryness index (TVDI) is parameterized based on the LST-VI space [22]. Over the past two decades, this simple model and those improved models in relation to it have already been applied to SSM estimation [2326]. Despite this, the spatial distribution of LST-VI remains possible not to conform to triangle or trapezoid, thus affecting the accuracy of SM estimation. The spatial shape is treated as a triangle or trapezoid in the context of human intervention, which can lead to uncertainty in the calculation results.

A major method of estimating SWC based on optical and thermal infrared data is to retrieve the SM index or drought index using satellite data [2730], so as to establish the regression relationship between the index and the actual SWC using the measurement data [31]. However, the variation of SWC can be affected by various factors (e.g., soil texture, surface roughness, topography, temperature, and vegetation coverage), which leads to the nonlinear regression relationship. Moreover, the relationship between the single temporal image and the measurement data is not transferable in time and space, which makes it challenging to establish a more general model of RS estimation for SM.

Compared with those traditional methods, the neural network (NN) model demonstrates various advantages, including the flexibility in combining information from different sources and mapping the input-output relationship of any data without needing an empirical formula [32]. The statistical relationship between satellite observation and SSM can be simulated on the basis of satellite data and measured SM estimation without explicitly establishing any physical connection. The excellent nonlinear mapping ability of the NN model makes it capable to minimize uncertainty and improve the accuracy of SM retrieval from RS data [3335].

Some scholars have constructed an NN model based on microwave data for estimating soil moisture [36, 37]. There is also a synergistic use of microwave and optical data to retrieve soil moisture using NN models [38, 39]. In these studies, NN models estimated soil moisture with a high degree of accuracy. Some studies have used the normalized difference vegetation index calculated by optical data and land surface temperature calculated from thermal infrared data as parameters into the neural network model to estimate soil moisture [40]. Optical and thermal infrared data have an exceptional potential for estimating soil moisture. However, there are few studies on SM estimation using optical and TIR data with the NN model.

The primary purpose of this paper is to build a highly universal SM retrieval model, which can overcome the influence of soil texture, vegetation cover, terrain, and other factors. As one variety of the ANN, the back propagation (BP) neural network (NN) is applied to construct the SM prediction model. The characteristic band/index with high resolution and multitemporal data is taken as the input data, while the measured SM is treated as the output data. Finally, the multiple spatial and temporal Landsat-8 and MODIS data were used to estimate SSM. The model constructed in this paper is universal and can be applied to estimate SM in both bare land and vegetated areas.

2. Study Area and Data Source

2.1. Study Area

As shown in Figure 1, the study area is located in the middle of the Tibetan Plateau. The map of China is downloaded from website https://bzdt.ch.mnr.gov.cn/index.html. The climate in the Naqu area is relatively warm from May to September each year, and this period refers to the season of vigorous vegetation growth. During this period, the land cover is dominated by plateau meadow. The other period includes the snow and soil freezing period, during which most areas are covered by snow and frozen soil. As impacted by low biomass, air quality, and humidity, the MW signal attenuation is weak, so this area acts as an ideal place to verify satellite and model simulation of SM products [41].

2.2. In Situ Measurements

The measured SM data used in this study originated from the SM observation network in the Naqu area. There were 57 stations available in the network, with SM and temperature data at four observation depths (i.e., 5, 10, 20, and 40 cm), and the sampling interval was 30 min. The mentioned measurements provided considerable valuable experimental data for SM retrieval in the Tibetan Plateau [42]. In the present study, the measured SM data from 2014 to 2016 were collected to assess the performance of the SSM retrieval model. The dataset is provided by the National Tibetan Plateau Data Center.

2.3. Remote Sensing Data

Landsat 8 carries Operational Land Imager (OLI) and Thermal Infrared Sensor (TIRS). The OLI provides nine optical bands, and the TIRS provides two thermal bands. Twelve Landsat-8 images from 17 May, 24 May, 18 June, and 27 July, 2014; 20 May, 23 July, 30 July, 31 August, 9 September, and 16 September, 2015; 25 July and 17 August, 2016, were acquired.

3. Research Method

By using the BPNN, the coastal, blue, green, red, NIR, shortwave infrared (SWIR) band, and LST were taken as the input data, the measured soil volumetric moisture was taken as the output data, and the SM retrieval model is established. The LST data were calculated using the TIRS1 and TIRS2 bands of Landsat-8. In addition, the observed SM data are used to test the results of model estimation and evaluate retrieval accuracy.

3.1. BPNN Model

The BPNN refers to a type of the multilayer feed-forward network with a hidden layer. In the practical application of ANNs, the BP network has been extensively used in data compression, pattern recognition, as well as function approximation. The BP network consists of an input layer, hidden layer, and output layer. The BPNN comprises an input layer, a hidden layer, and an output layer. The structure with a single hidden layer of the BPNN is shown in Figure 2. All neurons were configured according to the model shown in Figure 3.

The training operation of the BPNN consists of signal forward propagation and error back propagation. Forward propagation means that the input signal was transmitted from the input layer via the hidden layer to the output layer. If the output layer yields the desired output (the actual output is consistent with the expected output), the learning algorithm ends; otherwise, it turns to back propagation. Back propagation aims to reduce the error by calculating the error according to the original connection path direction and adjusting the weight and threshold of each layer node based on the gradient descent method. After repeated learning and training, the network parameters (weight and threshold) are determined for the minimum error and the training is terminated. At this point, the trained NN is capable to process the input information of similar samples and to output the nonlinear converted information with the minimum error.

3.2. Input Parameter Scheme

The approach to monitoring SM with optical data is mainly based on the spatial shape of pixel distribution in Landsat-8 bands to estimate SM. The SWIR 1 and SWIR 2 bands of Landsat-8 data (1650 nm corresponding to band 6, 2210 nm corresponding to band 7) are sensitive to SWC, especially at 2210 nm [43]. In this study, the SWIR 2 is taken as the input parameters of the SSM estimation model.

There are twelve schemes that were constructed to analyze the effect of various parameters on the accuracy of SSM estimation (Table 1). Scheme 1 sets all optical bands as input parameters. Based on the formula expressing SM in Tasselled Cap transformation, the band used in the formula was taken as the input parameter of scheme 2. According to the previous models applied to calculate SM through optical data, it can be found out that there were only a small number of bands in red, NIR, and SWIR bands as used in these models, so that schemes 3–6 were designed. On the basis of schemes 1–6, the LST data were introduced, respectively, with schemes 7–12 set.

4. Results

4.1. SSM Estimation Results and Validation

In the experiment, a total of 252 sample data were extracted from the study area. In addition, the BPNN was set below: the training sample was 189 and the test sample was 63. In order to ensure the stability of the BPNN model, we trained the BPNN model 100 times to obtain the correlation coefficient of estimating and measuring soil moisture. Finally, the average value of 100 training results is selected as the estimation accuracy of the model. The in situ measured soil volumetric moisture was used as a reference to calculate the coefficient ®, root mean square error (RMSE), unbiased root mean square error (ubRMSE), and bias (Bias) of the estimated SSM. Figure 4 shows the scatter distribution of soil volumetric moisture estimated and measured by 12 schemes. Table 2 shows the accuracy of the estimated SSM in each scheme.

With only the optical bands as the input parameters of the SM estimation model (schemes 1–6), the estimated SM shows correlation with the measured value to a certain extent, and the correlation coefficient reaches up to 0.8682, which renders the accuracy of retrieval poor. In the scheme of estimating SM with optical data, all bands were treated as input to obtain the most accurate SM (scheme 1). In scheme 2, the r between the estimated and measured SM is 0.8163. After removing the coastal band, the estimation accuracy of SM does not change significantly. As for the input, only the red, NIR, and SWIR bands are retained (scheme 3). Whereas, the estimation accuracy of SM remains comparable to that of scheme 1, indicating that the red, NIR, and SWIR bands play a major role in SM estimation.

In schemes 7–12, the input parameters were set as optical bands and LST, and the estimated SSM was closer to the observation. The SM estimated by scheme 7 had the highest accuracy. It showed a RMSE of 0.0017 and r of 0.9001. The normalized difference vegetation index (NDVI) and normalized difference water index (NDWI) were calculated by red, NIR, and SWIR bands. According to the LST-VI SM estimation model, schemes 10–11 of red, NIR, SWIR, and LST are designed as input parameters. Scheme 11 shows a higher estimation accuracy of SSM than that of scheme 10, which suggests that NDWI is more suitable to represent VI for SSM estimation than NDVI. With red, SWIR band, and LST as input parameters (scheme 12), the SM estimation accuracy was higher than in schemes 10 and 11.

When using the BPNN model to retrieve soil moisture, the soil moisture with the highest accuracy can be estimated by selecting scheme 7. In case of lack of TIR data, scheme 1 is preferred to retrieve soil moisture.

4.2. Land Surface Feature Influence on SSM Estimation Results

Zhang et al. [44] used the random forest model, which had land surface feature parameters and in situ soil moisture data to predict SSM. It would be good to test the same experiment on a different environment to check the consistency of the results. To analyse the influence of land surface feature factors on SSM estimation results, the elevation, slope, aspect, and vegetation coverage (normalized difference vegetation index, NDVI) factors were added into BPNN model. The soil texture of all observation stations in the study area is the same type, so this factor was not considered. On the basis of scheme 7, each influence factor was added, respectively. The estimated soil moisture were grouped according to the value of the influence factors. Also, the r between each group of data and the corresponding measured soil moisture were discussed (Table 3).

With the increase of elevation and slope, the r between estimated and measured SSM gradually decreases. The r of the north aspect was obviously lower than that of other aspects. The terrain of Naqu study area fluctuates violently, and it was easy to have shadow in areas with large elevation, steep slope, and north aspect. In the shadow areas, the surface reflectance is very low, and the satellite data in this area will greatly reduce the accuracy of estimated SSM. It is worth noting that the mountain shadow in the southern hemisphere appears on the south aspect. In areas with different NDVI values, the r values of estimated and measured SSM were similar. Vegetation coverage will not significantly affect SSM estimated by the BPNN model.

BPNN is not affected by the underlying surface condition and can be applied to areas with different vegetation coverage. However, when using optical satellite data as input parameters, the BPNN model is more suitable for the area with flat terrain and gentle slope. Also, BPNN cannot’ construct the empirical formula to express the relationship between soil moisture and various parameters.

4.3. Application of the SSM Estimation Model

Since the grid size of Landsat data was 30 m × 30 m, the small-scale observation network of the Naqu SM observation network was taken as the retrieval area of SSM. Figure 5 shows the surface soil moisture map retrieved from scheme 7 on May 20, 2015. The soil volumetric moisture of the 9 stations estimated in the area ranged from 0.122 to 0.357 cm3/cm3 in accordance with the in situ data (0.145–0.395 cm3/cm3). The scatter plot between measured and estimated soil volumetric moisture is shown in Figure 5©. It showed a RMSE of 0.0429 and r of 0.8392. The result indicates that calibration of the BPNN model with in situ data generally leads to reasonable SM estimates.

5. Discussion

The PDI and TVDI models were used to retrieve soil moisture, respectively. The PDI and TVDI models were used to estimate SSM in the same area of Figure 5. The red and NIR bands shall be applied to calculate PDI (corresponding to scheme 4), while the red, NIR, and LST shall be applied to calculate TVDI (corresponding to scheme 10). The scatter plot of SM estimated and measured by PDI and TVDI is shown in Figure 6. The r between the estimated and measured SM are 0.4786 and 0.5047, respectively. Comparing the SSM calculated by PDI and TVDI, the accuracy of SM calculated by the BPNN has significantly improved.

We also brought MODIS data into the BPNN model to estimate soil moisture. There is a slight difference between the band range of MODIS and Landsat-8 data (Table 4), and there is no coastal band in MODIS data. The LST data were obtained from MOD11A1 products. There are ten schemes (schemes 2–6 and schemes 7–12 in Table 1) to estimate SM using the BPNN.

We collected 742 MODIS data, 594 data were used as training samples, and the remaining 148 data were used as test samples. The retrieved SSM had the highest accuracy when the input parameters were green, blue, red, NIR, SWIR band, and LST data. The scatter plot between measured and estimated soil volumetric moisture is shown in Figure 7. It showed a RMSE of 0.0573 and r of 0.8164. The SSM estimation accuracy is similar to Landsat-8 data (scheme 8).

It is worth noting that the more the input sample data, the higher the accuracy of SSM retrieved by the BPNN. However, this method is not applicable in cloud and shadow-covered areas. Since the input parameters in the model are all derived from RS raster data, the images with different spatial resolutions can be used for the SSM estimation model based on the BPNN. Sentinel-2 is regarded as another ideal RS data due to its high temporal and spatial resolution.

The BPNN model is not limited by RS data. The RS images such as Landsat-8, MODIS, and Sentinel-2 can be used as input parameters to retrieve soil moisture. Since the BPNN model directly uses the surface spectral reflectance data to retrieve soil moisture, the model is not limited by regional conditions. The BPNN model can simultaneously retrieve soil moisture of bare soil and vegetated areas. However, when optical and TIR data are input into the NN model, it is necessary to exclude raster data that are obscured by clouds or affected by the atmosphere.

6. Conclusion

Combined with optical-thermal infrared RS data and measured soil volumetric moisture data, a retrieval model of SSM is established on the basis of the BPNN. It can be found that the BPNN is suitable for establishing the relationship between the characteristic band/index and the measurement data, thus improving the accuracy of the SM estimation model. However, the accuracy of SSM estimated only by optical data is poor. By introducing TIR data, the retrieval accuracy of SSM can be improved significantly. It is worth noting that when the optical satellite data are used as the input parameters, the BPNN model is more suitable for the area with flat terrain and gentle slope.

When the quality of RS image varies, it is possible that the spatial distribution of the LST-VI model is unable to form a complete triangle or trapezoid, thus resulting in a significant difference between the estimated SM and the measured value. The model proposed in this study addresses the abovementioned problems while estimating SSM in case of a small sample size. The BPNN model provides a powerful tool for estimating SSM in complex/heterogeneous media (e.g., rough vegetated surfaces). In this paper, the soil moisture retrieved by Landsat-8 and MODIS data has high accuracy. The SSM retrieval model based on the BPNN can be applied to estimate the SSM with different spatial resolution.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The authors would like to thank for the soil moisture observation data provided by National Tibetan Plateau Data Center and the Landsat-8 data provided by the USGS.