1. Introduction
Snow is an important indicator for global climate change, which has significant positive and noteworthy negative feedbacks on climate system, water resources and ecological environment [
1,
2,
3,
4]. On the positive side, due to the high surface albedo and large-scale cooling effect, snow cover (SC) participates in energy exchange between the atmosphere and Earth’s land, thus regulating regional and global climate. Snowmelt can provide water resources for ecosystem and participate in regional water cycles, which is beneficial for arid and semi-arid region. Meanwhile, snow water equivalent (SWE) is an important driving factor for meteorological, climate and hydrological models, which can enhance the understanding of regional water resources environment and improve the accuracy of a weather forecast. From a negative perspective, heavy snow, sustained low temperatures and prolonged SC will result in snow disasters that affect animal husbandry, and sudden excessive snow melting will also cause soil erosion and flood risk [
5].
As the most sensitive and active response to climate change, snow has become an important field of research. Currently, snow cover mapping using optical remote sensing has developed many mature datasets, such as Landsat [
6], Systeme Probatoire d’Observation de la Terre (SPOT) [
7], Advanced Very High Resolution Radiometer (AVHRR) [
8] and Moderate-resolution Imaging Spectroradiometer (MODIS) snow cover datasets [
9,
10,
11,
12,
13]. MODIS carried by Terra and Aqua satellites, is the most deeply studied and widely applied. The high spectral and spatial resolution characteristics of the sensor provide favorable conditions for the exploration and research of snow cover extraction. However, due to the similar reflectance spectrum characteristics of snow and cloud, optical remote-sensing monitoring of snow cover is greatly limited by weather conditions. Simultaneously, snow depth is hardly detected by optical sensors. Passive microwave can penetrate cloud layers for all-weather work, and achieve the parameter of snow depth through the surface based on the sensitivity of different snow frequency.
Passive microwave remote-sensing data are mainly used for monitoring snow depth and SWE at a global or hemispherical scale. To date, the existing space-borne passive microwave sensors are the Scanning Multichannel Microwave Radiometer (SMMR) on Nimbus-7, Special Sensor Microwave Imager (SSM/I) [
14] and Special Sensor Microwave Imager/Sounder (SSMI/S) on the Defense Meteorological Satellite Program (DMSP), the Advanced Microwave Scanning Radiometer-EOS (AMSR-E) on the Aqua satellite, the Advanced Microwave Scanning Radiometer 2 (AMSR-2) on the Global Change Observation Mission–Water (GCOM-W1) satellite, and the Microwave Radiation Imager (MWRI) on Fengyun-3 (FY-3) series satellites. Many methods for retrieving snow depth and SWE using passive microwave brightness temperature (BT) data have been proposed and developed [
15].
The widely used Chang’s algorithm [
16] uses radiation transfer model simulation results and ground station observation data for linear regression analysis, and finally obtains the linear relationship between snow depth and brightness temperature difference (BTD) of 18 GHz and 36 GHz horizontal polarization. Similarly, the spectral polarization difference (SPD) algorithm proposed by Aschbacher [
17] is based on BTD between horizontal and vertical polarization at 19 GHz and 37 GHz. Given the difference between snow characteristics in China and North America, Che [
18] revised the retrieval coefficients applicable to China based on Chang’s algorithm. Due to the influence of forest, water body, elevation and topography on snow attributes, many researchers introduced parameters such as forest coverage fraction and land-cover types to modify the coefficients of snow depth retrieval algorithm to improve the accuracy [
2,
19,
20]. Based on the abovementioned algorithms, many scholars have improved the algorithm by considering the influence of snow physical properties on microwave emission characteristics. The temperature gradient index (TGI) is used to describe the cumulative effect of temperature gradient on snow layer, and then calculate snow depth [
21]. However, the result shows that the method has high accuracy only in the North American plain. Kelly used Dense media radial transfer (DMRT) to simulate the change of snow particle size and density and integrates the influence of snow attribute changes on BT in the retrieval process [
22]. However, these algorithms have poor applicability. The radiative transfer model of snow cover can simulate the radiative transfer process of microwave in the snow layer and detect the parameters of snow [
23,
24]. However, these models simplify the radiative transfer of snow, and the detailed parameters in the models are difficult to obtain. The static factors that affect the distribution of snow and dynamic factors affecting the snow melting have been considered and calculated, and the retrieval model of snow depth has been improved continuously.
Machine-learning and deep-learning technology power many aspects of remote sensing: from target recognition [
25,
26,
27,
28,
29] to semantic segmentation [
30,
31] to spatial-temporal prediction [
32,
33], and they are increasingly present in snow parameter estimation and retrieval [
34,
35,
36]. Tedesco [
37] constructed a neural network to retrieve the snow depth and SWE, which showed the highest accuracy compared with the other four retrieval algorithms. Tabari [
38] achieved the similar conclusion by using a neural network to estimate snow depth and SWE in the Samsami Basin, Iran. Theoretically speaking, microwave BTD enhances with the increases of snow depth, but when the snow depth exceeds a certain threshold (50 cm) there will be a large error in the estimation results. The neural network could overcome various complex problems existing in large-scale retrievals, such as non-linear modeling, classification and association by learning and summarizing a large number of data and does not rely on the understanding of physical process when modeling. The support vector machine (SVM) is also widely used to solve the nonlinear problems in geosciences. Xue [
34] reveals that SVM is more sensitive than the neural network in estimating snow parameters. It has been found that the results based on the SVM algorithm have better accuracy no matter in deep or shallow snow cover, forest coverage area, snow accumulation and melting period. Xiao [
39] used the support vector regression (SVR) algorithm to establish a snow depth retrieval model based on different vegetation types and different snow periods, showing better accuracy and reducing "snow saturation effect". Machine learning and deep learning technology can describe the non-linear relationship between BTD and snow parameters, and overcome the limitations of a linear algorithm in different areas [
40,
41]. Although the machine learning technology has a wide range of applications and high accuracy, there is no detailed snow physical model involved in the retrieval process [
42], thus the interpretability of the results is poor.
The model of retrieving snow depth from microwave data has developed from simple linear regression to non-linear models considering complex meteorological and geographical factors. However, the snow depth with spatial resolution from 10 km to 25 km is not suitable for arid and semi-arid alpine region where the vertical drop is extremely significant. In practical application, snow depth mapping with higher spatial resolution is more useful for regional snow disaster evaluation and hydrological model establishment. Recent studies have demonstrated that higher snow depth mapping can be developed based on high-resolution geographic and meteorological data, together with downscaling microwave data. Wang developed a multifactor power snow depth downscaling model and greatly improved accuracy of snow depth retrieval [
43]. Walters [
44] describes and tests a linear model to downscale MOD10A1 fractional snow cover area (fSCA) (500 m) data to higher-resolution (30 m) spatially explicit binary snow cover area (SCA) estimates. Mhawej [
45] downscaled Advanced Microwave Scanning Radiometer – Earth Observing System snow water equivalent (AMSR_E SWE) datasets based on the cooperative observation of MODIS Terra-Aqua data. For the arid and semi-arid alpine regions such as Northern Xinjiang (NX) with complex topography and violent vertical variation, high-resolution snow depth mapping can more accurately describe the accumulation and melting of snow in the region, and provide precise input parameters for hydrological process and snow disaster early warning. Thus, in this manuscript, we propose a deep learning approach to downscale and evaluate the snow depth in NX.
This manuscript is organized as follows. After introducing the study area and data preprocessing in
Section 2, the methodology for snow identification and downscaling snow depth retrieval is expounded in
Section 3. Results of accuracy verification and spatial comparison will be presented in
Section 4, and more details will be discussed in
Section 5. Ultimately, a summary of the research will be described in
Section 6.
3. Methodology
Due to the different characteristics of an optical sensor and passive microwave sensor, MODIS data of the optical sensor and MWRI data of the passive microwave radiometer have advantages and limitations in snow parameter retrieval. Generally speaking, MODIS data has a high spatial resolution. The visible and shortwave infrared bands can be used to identify snow cover effectively. However, large-scale snowfall was often accompanied by cloudy and bad weather. The strong reflection of visible and near-infrared at the cloud top will shield the ground information, resulting in the lack of snow parameters under the cloud. Also, cirrus clouds have similar spectral characteristics to snow cover in the visible band, which often leads to the misjudgment of snow cover. MWRI data can penetrate through cloud and fog, not affected by cloud layer and other factors. MWRI data has the ability to observe the ground snow information all day long and penetrate the snow layer to detect the snow depth and SWE. However, there is a large deviation in regional scale snow recognition due to the coarse spatial resolution of MWRI data.
To realize the complementary advantages of the two types of data, improve the accuracy of snow identification, maxing the snow information, and obtain daily continuous snow monitoring data under all-weather conditions with and without clouds, it is imperative to effectively combine the two types of data to retrieve snow parameters.
A collaborative retrieval of snow parameters based on MODIS and MWRI data is designed in this manuscript, and the technical flow chart is shown in
Figure 3. The cloud will move rapidly with the wind, and its spatial position and coverage in the morning and afternoon will often change rapidly, while the snow coverage and the spatial position of the ground features are relatively fixed. Therefore, the complementary information and synthesis of the snow extent in the morning and afternoon can effectively remove the influence of cloud and restore the snow under the cloud layer. Meanwhile, the high-resolution information of the optical sensor is retained to the maximum extent. The maximum daily snow cover with 500 m resolution under all weather conditions is obtained by using optical and microwave data, high-resolution auxiliary data (DEM, Geodata, etc.) is added to the microwave data to map the snow depth. The concrete steps are as follows:
Step 1: Using the remote-sensing data MOD10A1 and MYD10A1, the daily snow cover MXD10A1 is extracted cooperatively, and the cloud interference is removed based on the dynamic of clouds;
Step 2: Extraction of snow cover under cloud-covered using MWRI data based on decision tree algorithm;
Step 3: The snow cover retrieved from MODIS and MWRI single sensor is combined to further remove the influence of cloud, MXD10A1 was adopted in cloud free conditions, while MWRI derived snow cover was adopted in cloud covered conditions, the complete snow cover information without cloud interference is thus combined and obtained;
Step 4: Preparation of input variables: Resampling and spatiotemporal matching of snow cover and auxiliary data;
Step 5: Ground observations are used as target labels to train the model and minimize errors.
3.1. Snow Cover Identification from MWRI
Nowadays, snow cover datasets widely used in investigations of climate, hydrology, and glaciology are likely derived from MODIS data, which cover the whole Earth at a near-half daily frequency through the cooperative observation of Terra and Aqua [
3,
4]. The combination of MOD10A1 and MYD10A1 can effectively reduce cloud interference and obtain more accurate snow cover distribution. However, the cloud layer limits the optical sensor to detect the ground snow information. In winter, the cloud layer will cover more than 50% of NX [
6]. Microwave data were used to retrieve snow cover distribution under clouds.
There is a large difference of polarization between snow cover and the bare surface, and the horizontal polarization is much smaller than the radiation BT of vertical polarization. With the increase of snow depth, the polarization effect of microwave radiation enhanced, the dual polarization BT of different frequencies tends to be consistent and there is free snow cover. In arid and semi-arid regions, water, frozen soil and cold desert will be misjudged as snow cover due to the similar microwave radiation characteristics. Snow cover should be accurately distinguished from other land covers before retrieving snow depth. The decision tree method has been widely used to distinguish snow from other scattering signatures [
7,
8,
9] and is applied in this study. The detailed algorithm flow is shown in
Figure 4.
3.2. Downscaling Snow Depth Retrieval Model
As stated in
Section 1, previous studies have focused on the linear relationship between snow depth and microwave BT to retrieve snow depth. However, a nonparametric statistical approach has greater potential to explore the non-linear relationship between snow depth and BTD in a complex ground environment. A neural network, as a classic non-parametric method, has been proved to be effective in the retrieval and estimation of geological parameters. In this study, a three hidden layer backpropagation neural network, with 20, 20 and 10 neurons in the 1st, 2nd and 3rd hidden layer was constructed by the Pytorch framework.
As illustrated in
Figure 5, the BTD, topographic data and meteorological data were resampled to the same spatial resolution of 500 m, and then the pixel values of specified longitude and latitude are extracted as input data to match the target task, that is, ground snow depth observation. The computation between neurons in each layer is illustrated in Formula (1):
where is the amount of layers,
represents that the variables belong to the
layer,
is the weight matrix and
is the bias of the layer; is the sigmoid function, a non-linear activation function, is used by the neurons in the third hidden layer, which can capture non-linear relationships between various inputs and snow depth. The backpropagation is a method that repeatedly adjusts weights to minimize the disparity between actual and expected outputs. Mean squared error (MSE) cost function and stochastic gradient descent (SGD) optimization function were used in the proposed deep neural network to minimize a sum of the mean squared error, and SGD is described in Formula (2):
where
is the cost function,
is the learning rate, set to 0.001,
and
are the weight matrix and bias vector in the layer
.
3.3. Model Evaluation and Accuracy Metrics
The measured snow depth from ground stations was used as the truth value to evaluate the performance of the downscaling model and the accuracy of retrieval results. The specific evaluation criteria included the mean coefficient of determination (R
2), root mean squared error (RMSE), positive mean error (PME), negative mean error (NME), average absolute error (MAE) and bias (BIAS). The calculation Formulas (3)–(8) of the five indexes are as follows:
In the aforementioned formulas, is the measured snow depth from ground station; is the retrieved snow depth based on microwave data; and are the number of samples whose measured snow depth are greater and less than the retrieved snow depth, respectively; and is the total number of tested samples.
4. Results
4.1. Corrections of Factors
The correlation between snow depth and BTD is described quantitatively in this section. Four typical stations (Altay: 88.08°E, 47.73°N; Tuscaloosa: 83.00°E, 46.73°N; Yining: 81.33°E, 43.95°N; UlanWusu: 85.82°E, 44.28°N) were chosen for factor evaluation. Altay station is located in the covered area of Altai Mountain vein. The mean temperature in cold winter is −23 °C, the mean snow depth in winter is 48 cm, and the annual snow cover days is more than 6 months with a 98.8% coverage rate. In winter, dry snow is dominated and well-distributed in Altay area. The underlying ground is farmland and alpine grassland. Tuscaloosa station is located in Tuscaloosa basin, with cold climate and mean snow depth of 32 cm in winter. As in Altay region, well-distributed dry snow dominates Tuscaloosa basin and the underlying surface is farmland or sparse vegetation, however, the terrain is flatter. The Ili River valley where Yining is located is a bell mouth terrain ringed on three sides by mountains. The water vapor from the Caspian Sea and the Black Sea accumulates here, making the region rich in precipitation and humid climate. The snow with wet feature dominates the hills and mountains, covering the underlying surface of farmland and vegetation heterogeneously. Ulan Wusu station is located in the Economic Zone on the north slope of Tianshan Mountain. The mean snow depth in this area is about 23 cm, and the underlying surface of snow is more complex, including farmland, shrub vegetation and shelterbelt. The above contents are the climate, topography and underlying surface characteristics of four typical stations in NX.
Four representative regions were selected to assess the correlation between BTD and snow depth. The BT polarization difference in Altay, Tuscaloosa, Yining and Ulan Wusu from 1 November 2014 to 31 March 2015, as well as the snow depth and ground surface temperature recorded by meteorological stations on the corresponding date. It can be seen that there are obvious changes in the polarization difference of MWRI BT corresponding to the snow cover, especially the brightness temperature difference between the low-frequency channel and the high-frequency channel (
Figure 6). The specific features are as follows: (1) BTD18h36h and BTD18v36v increased with the accumulation of new snow and the increase of snow depth; (2) BTD decreases with the increase of the ground surface temperature, even the result is negative. Therefore, the air temperature factor should be considered in the passive microwave snow depth retrieval model.
Detailed coefficient of determination between snow depth and BTD at four typical stations were described in
Table 2. Generally speaking, except for the BTD18h36h and BTD18v36v data of Yining station, the other BTD data have a high coefficient of determination with snow depth. Altay, where dry snow is evenly distributed, is the best place for satellite synchronous observation and ground verification in snow remote-sensing research. The coefficient of determination between BTD and snow depth shows the best result and reaches 0.4819. Moreover, the coefficient of determination between BTD18h36h, BTD18v36v and snow depth, which is widely used in the classical Chang algorithm, is as high as 0.6037 and 0.6243. The coefficient of determination at Yining station is relatively poor, which is related to the humid climate in the valley and the characteristic of wet snow. Only from the perspective of BTD, the mean R
2 of the six BTDs and snow depth at each station are relatively close, except that the poor coefficient of determination of BTD18v36v in Yining station reduces the mean value. Therefore, the six BTDs can be used to retrieve the snow depth in NX based on the contents described in the above figures and table.
The radiation BT of snow cover in the high-frequency channel decrease obviously. Thus, the BTD between high and low frequency channels is used as scattering index, its change and threshold value are used as a characteristic index in retrieval algorithm. In the actual snow depth microwave radiation modeling, the influence of surface roughness (high mountain, hilly, flat, etc.) and land cover parameters (glacier, vegetation, forest, desert, desert, etc.) should be considered. Moreover, microwave radiation can penetrate snow and detect radiation from the underlying surface, which indicates that the detection results are affected by the volume scattering caused by the difference of soil physical properties in vertical direction. Therefore, the natural environmental differences of snowfields must be considered comprehensively in snow depth retrieval.
4.2. Case Study of Spatial Distribution
The comparison of snow depth mapping was used to show the consistency of the spatial distribution among the downscaled snow depth and existing snow depth datasets, and try to present more detailed information, especially in alpine regions and river valleys. The snow depth mapping for 1 December 2014, as a typical example, is shown in
Figure 7.
The downscaling snow depth map (
Figure 7g), the corresponding WESTDC snow depth datasets (
Figure 7e), and the official ascending (
Figure 7a) and descending (
Figure 7c) snow depth datasets via Jiang’s [
63] algorithm for the same day are shown. Due to orbital scanning, both official ascending and descending snow depth maps have large strip gaps across NX. Data only based on a single orbit will lead to missing snow depth information. WESTDC snow depth map reflects the snow distribution covering the entire NX (
Figure 7e). However, the proposed downscaling approach can derive snowpack information and provide details on snow depth variations at 500 m spatial resolution. As shown in
Figure 7, on 1 December 2014, the snow cover was primarily distributed in Altai Mountains, Tianshan Mountains and Western Tianshan Mountains. There is also a few shallow snow distributed over Hami area in the eastern of NX and Tuscaloosa in the western of NX. Tianshan Mountains and the Altay area are the two high-value regions of snow depth and Tianshan Mountains were selected for detailed display. In
Figure 7h enlarged views, the snow depth distribution varies with the altitude of the mountains, showing obvious texture characteristics, that are consistent with the actual situation. WESTDC snow depth datasets, official ascending and descending snow depth datasets also described the regional snow depth information with coarse resolution, especially WESTDC snow depth datasets has good consistency with the snow depth map generated by the downscaling approach in spatial distribution. However, extensive heterogeneity of snow depth is only shown in the downscaled snow depth mapping.
4.3. Accuracy Validation
Snow depth observations from 37 national meteorological stations in NX (from 1 November 2014 to 31 March 2015) were selected to validate and evaluate the accuracy of downscaled snow depth and to compare the accuracy of MWRI_A, MWRI_D, WESTDC_SD. The detailed RMSE, PME, NME, MAE and BIAS are described in
Table 3. MWRI_D_SD shows the highest RMSE of 10.55 cm, highest NME of −11.16 cm, highest MAE of 6.13 cm, and highest BIAS of −6.28 cm, followed by MWRI_A_SD with an RMSE of 9.45 cm, PME of 4.42 cm, NME of −10.11 cm, MAE of 5.49 cm and BIAS of −3.73 cm. WESTDC_SD, a widely reported and approved snow depth dataset [
2,
15,
62], is more accurate than MWRI_A_SD and MWRI_D_SD in NX, with the lowest PME of 2.95 cm. PME values are lower than NME of all SD datasets. There are significant errors of the snow depth retrieved by microwave data compared with ground observations in NX, but the difference between PME and NME for downscaled SD is minimal. Most importantly, the downscaled snow depth has the best accuracy compared to the other three snow depth datasets, with the lowest RMSE of 8.16 cm, MAE of 4.73 cm and BIAS of −2.71 cm.
In the overall accuracy evaluation, downscaled snow depth is dominant compared to snow depth datasets, while detailed results based on snow comparison at different depths are described in
Table 4. Similar to the results in
Table 3, downscaled snow depth performs best at different snow depth levels, followed by WESTDC_SD and MWRI_A_SD, and MWRI_D_SD has the poorest accuracy performance.