There are three major components in the retrieval algorithm development in this study, including the retrieval scheme design, improvement of cloud screening, and bias-calibration scheme development. The retrieval is based on inter-satellite calibrated HIRS longwave channel data. Details on the data used in the study and each of the development components are described below.
2.1. Data
The temperature profiles are derived from HIRS longwave channel measurements. Channel brightness temperatures are limb-corrected using a linear multivariate regression algorithm based on multiple HIRS channels [
11]. Due to the difference in individual HIRS instrumentss channel spectral response functions, along with other factors, there are differences in observations from different satellites. Inter-satellite calibrated HIRS measurements provide an essential dataset for climate studies. Simultaneous nadir overpass (SNO) observations are used to obtain inter-satellite differences between overlapping satellite pairs [
12,
13,
14]. Measurement over the equatorial Western Pacific region is used to assess inter-satellite differences in the warm range of observations. For the majority of the HIRS longwave channels, the inter-satellite differences vary with channel radiances [
14]. Inter-satellite differences of these channels are derived from overlapping satellites as a function of brightness temperatures. These values are then applied to individual satellite’s HIRS channel measurements to inter-calibrate data to the reference satellite, for which Metop-A is designated.
For channel 4, the values of inter-satellite differences do not correlate with the values of scene brightness temperatures. When there is a difference in channel weighting functions of different satellites, the channels essentially sense temperatures in different heights. For most of the channels, the scene temperatures are correlated with the lapse rate and therefore the inter-satellite differences are also correlated with scene temperatures. However, channel 4 senses temperatures in the upper troposphere near the tropopause. The inter-satellite difference is largely dependent on the heights of the weighting function peaks in relation to the tropopause. It is found that instead of being correlated to the scene temperature, the inter-satellite difference of channel 4 is highly correlated with the temperature lapse rate represented by the differences of vertically adjacent HIRS sounding channels [
15]. Therefore, the inter-satellite calibration of channel 4 uses a method different from other longwave channels. Using SNO observations, calibrations based on linear regression are developed between channel 4 inter-satellite differences and the lapse rate factors below and above channel 4 (represented by the sounding channel brightness temperature difference of channels 5 and 4, and between the difference of channels 4 and 3, respectively). Using the regressions, measurements of channel 4 are calibrated to the same reference satellite, Metop-A.
GPS RO measurements from Constellation Observing System for Meteorology Ionosphere and Climate (COSMIC) and radiosonde observations are incorporated as part of the retrieval process. The re-processed version of GPS RO derived profiles, COSMIC2013, is obtained from University Corporation for Atmospheric Research COSMIC Data Analysis and Archive Center (CDAAC) (
http://cosmicio.cosmic.ucar.edu/cdaac/index.html) to use in the project. Various studies have shown that GPS RO missions provide a unique opportunity to measure stratospheric temperatures in high accuracy [
16,
17,
18]. The GPS RO data does not contain mission-dependent biases. This makes them a good candidate as a climate benchmark [
19]. The GPS RO dataset is incorporated to calibrate the HIRS stratospheric temperature retrievals.
However, the GPS RO derivation of temperature and humidity for the troposphere relies on the reanalysis data as input and, therefore, the derived profiles are not considered direct observations. One alternative is to incorporate radiosonde observations that are homogeneous both spatially and temporally. Though using radiosonde observations may bring some biases toward profiles over land surfaces, options are very limited for other choices. None of other data sources has a more extensive global dataset of direct observations of temperature and humidity in the troposphere. There are hundreds of radiosonde stations globally with different types of radiosonde systems. The accuracy of radiosonde temperature and humidity measurements can vary significantly for different sensor types [
20]. To serve as a calibration database for remotely-sensed data, it is necessary that the selected radiosonde measurements have self-consistency with good quality. Among radiosonde types, measurements from RS92 radiosondes have been shown to provide homogeneous observations in global locations [
21] and, thus, they are used in this study for the bias calibration process of the global tropospheric temperature and humidity retrievals.
2.2. Carbon Dioxide Effect
Observations at Mauna Loa show significant rise of CO
2 in the past several decades [
22]. Since HIRS observation began in 1978, the atmospheric CO
2 concentration has increased from below 335 ppmv to over 400 ppmv. To examine the CO
2 effect, a radiative transfer model, RTTOV [
23], is used to simulate HIRS brightness temperatures in an increasing CO
2 environment. The use of a model facilitates the separation of CO
2 effect from any other changes. A diverse sample of global profiles analyzed by the European Center for Medium-range Weather Forecasts (ECMWF) system [
24] is used to represent the atmospheric conditions. For the model simulations, the input profiles are kept unchanged except for the CO
2 concentrations, and RTTOV is run for the CO
2 concentrations of 330 and 410 ppmv, respectively. The averaged brightness temperature of the global results for each HIRS channel is calculated. The differences of channel brightness temperatures between the two CO
2 concentration values are displayed in
Figure 1.
The weighting functions of HIRS CO
2 absorption channels vary with the amount of CO
2. With the increase in CO
2, the atmosphere becomes more opaque to the HIRS CO
2 sounding channels, therefore the peaks of these channels’ weighting functions go up. As the mean temperature lapse rate in the troposphere decreases with height and,
vice versa, in the stratosphere, there is a negative and positive impact in the troposphere and stratosphere, respectively. For easy reference, the absorbents of HIRS longwave channels and the approximate broad layers where the channels sense are listed in
Table 1. The simulation shows that if CO
2 concentration were the only change in the atmosphere, when CO
2 increases from 330 ppmv to 410 ppmv, the measurements from HIRS would appear to decrease in channels 4–7. The largest impact occurs in channels 5 and 6, with a value as large as −2.0 K. The decreases of brightness temperatures for channels 4, and 7 are −1.6 and −1.0 K, respectively. For channels sensing the stratosphere (channels 1 and 2), the measurements from HIRS observation would appear larger in an increased CO
2 environment. The increases in these two channels are 0.8 and 0.6 K, respectively. Channel 3 senses temperatures in the lower stratosphere near the tropopause where the lapse rate is small and, thus, there is not much effect from the CO
2 change. There are no impacts on channels 8–12 because these are window channel, ozone channel, and water vapor channels.
Figure 1 shows that there are large effects from CO
2 increase on several temperature sounding channels. If these effects are not considered in the temperature profile retrieval, it can lead to an underestimation of tropospheric temperatures and an overestimation of stratospheric temperatures in the HIRS measurements in an increasing CO
2 environment.
2.3. Retrieval
The analysis above shows that it is a requirement for the retrieval algorithm to be able to account for CO
2 effect on HIRS observations. This can be accomplished by the use of a radiative transfer model. As the RTTOV model is developed to simulate satellite sounder measurements with HIRS as one of the main sounders in the design, the model is chosen in the present study as a tool to build a retrieval training dataset. Selections of the ECMWF sampled profiles [
24] provide the input database to RTTOV. The profile dataset was formed by carefully analyzing reanalysis data to extract a subset that has a global representation. Millions of global profiles from two years of the ECMWF analysis fields were divided into seven groups according to total precipitable water vapor contents at the interval of 0.5 kg·m
−2. About the same number of samples from each group was extracted, except for the group with the smallest total precipitable water vapor content, where twice as many profiles were extracted. Only the clear-sky profiles from the ECMWF sample profiles are used for the present study, which are comprised of 6891 profiles covering all latitudes and longitudes. The HIRS channel brightness temperatures for the reference satellite of the inter-calibrated dataset, Metop-A, are simulated by the radiative transfer model RTTOV.
Tests are carried out with the use of RTTOV on the sensitivity of surface emissivity errors on retrievals. Results show that surface emissivity has a significant impact on the surface and near-surface temperatures. A variation of 0.2 in emissivity can have an impact of 1.1 °C on the surface skin temperature and 0.7 °C on surface air temperature. Therefore, surface emissivity is included as an input for the retrieval of surface and near-surface variables. The impact is reduced to less than 0.3 °C at levels 850 hPa and up. The impact is small on specific humidity. With an emissivity variation of 0.2, the impact is 0.17 g/kg near the surface and smaller toward upper levels.
Among the HIRS channels, the input to the upper air temperature retrievals consists of channels 2–12, and for the upper air specific humidity retrievals the input channels include the tropospheric channels 4–8 and 10–12. For the surface retrievals, only channels with sensitivity to the surface are used, which include channels 7, 8, and 10. Monthly mean values of CO
2 observation from Mauna Loa are included in the input. The surface emissivity values are taken from the ISCCP dataset [
7]. Outputs from the three neural networks are temperature profiles, including surface skin temperature, air temperature at the reference height of 2 m, and temperature at standard pressure levels from 1000 or the lowest pressure level above the surface to 50 hPa, and humidity profiles including specific humidity at 2 m and at standard pressures from 1000 hPa or the lowest pressure level above the surface to 300 hPa.
As atmospheric profiles over high surface elevations can differ significantly from those over low elevations, the clear-sky training dataset is divided into three groups according to the surface elevation in terms of surface pressure (Ps) calculated using the hydrostatic equation. The three groups are defined as Ps < 700 hPa, 700 hPa ≤ Ps < 850 hPa, and Ps ≥ 850 hPa.
Figure 2 shows a global map of where the groups are located. The vast majority of areas are in the group with Ps > 850 hPa. From each surface elevation group, collocated data are randomly divided into three sub-groups at 60%, 20% and 20%. The 60% sub-group is assigned to be the training dataset. A testing dataset is formed by one 20% sub-group to use during iterations of neural network development. The remaining 20% is set aside for assessing the performance of the retrieval.
A neural network approach is applied to connect temperature and humidity profiles with the HIRS longwave channel measurements. The use of the neural network technique enables the establishment of the non-linear relationship between the retrieved variables and channel radiance observations. Running retrievals with a neural network scheme is fast. Processing 35 years of HIRS retrievals using a Linux computer platform that has a moderate specification takes about a week to complete, and processing in parallel reduces this even further.
Different neural networks are built for the retrievals of the upper air temperature, upper air specific humidity, and surface temperature and humidity. Neural networks have been used in past studies to retrieve atmospheric temperature and humidity along with other variables. For example, a three-layer backpropagation neural network was applied to derive atmospheric temperature profiles from AMSU-A measurement [
25]. The retrieved variables included surface air temperature, temperatures at 26 pressure levels from 1000 to 10 hPa, and the tropopause temperature, height, and pressure over both land and ocean surfaces. Multi-layer backpropagation approaches were used to derive water vapor, cloud liquid water path, surface temperature, and emissivity over land from satellite microwave observations [
26] and derive temperature, water vapor, and ozone profiles from IASI observations [
27]. Three-layer feed-forward neural networks were also used to retrieve near-surface atmospheric variables over the ocean surfaces including SST, Ta, Qa, and wind speed [
28]. In the present study, three-layer backpropagation networks, with one input layer, one hidden layer, and one output layer, are constructed for the retrievals of temperature and humidity.
As described in the previous AMSU-A retrieval study [
25], in a backpropagation network, each layer is fully connected to the layers below and above. When the network is given an input, the updating of activation values propagates forward from the input layer through the internal layer to the output layer. Each neuron in the output layer produces an output, which is compared to the target output defined in the training dataset. An error value is calculated for each neuron in the output layer. The network corrects its parameters to lessen the errors. The correction mechanism starts with the output neurons and propagates backward through the internal layer to the input layer. The iteration continues until pre-set convergence criteria are met.
The converged neural network parameters are applied to the data that were set aside and were not used in the construction of the neural network to evaluate the network performance. The RMSEs of temperature and specific humidity are calculated assuming the outputs in the validation database as truth data. For temperature, the RMSEs are approximately 2.7 °C for the temperature at the lowest standard pressure levels, 1.3–1.5 °C in the mid-troposphere, and 1.5–2.6 °C around the tropopause and in the lower stratosphere. For the specific humidity, the RMSE is 2.0 g/kg at 1000 hPa, and it steadily decreases to 1.1 g/kg at 700 hPa, and less than 0.4 g/kg above 500 hPa. The RMSEs for temperature and humidity profiles at standard pressure levels calculated from data not used during network calibration are plotted in
Figure 3. These error estimates represent the uncertainty error structures from the model simulation and the neural network scheme.
One of the error sources may come from channel observation noises. The NOAA Polar Orbiter Data User’s Guide [
29] and NOAA KLM User’s Guide [
30] show that values of the noise equivalent differential radiance (NEΔN) for HIRS longwave channels are in the range of 0.10–0.65 mW/(m
2·sr·cm
−1) for channels 2–12, and 3.00 mW/(m
2·sr·cm
−1) for channel 1. Channel 1 is not used in the retrieval scheme because of the concern on the high NEΔN. Using the developed neural networks, a sensitivity test is performed by adding noise to each of channels 2–12 in the training dataset, one channel at a time, to examine how much difference it produces in the retrieval results. The result for the pressure level that corresponds to the mean weighting function peak of an individual channel from an added noise value of 0.5 K is displayed in
Table 2 and
Table 3. The channel NEΔN values are included in the tables to provide typical values of channel noises. For the temperature sounding channels, NEΔN values of 0.10–0.65 mW/(m
2·sr·cm
−1) approximately correspond to brightness temperature noises of 0.1–0.5 K. Different noise values are tested. A value of −0.5 K produces about the same impact as that from a value of 0.5 K, but with the opposite sign. Halving the noise value approximately halves the impact.
2.4. Cloud Screening
Retrieving atmospheric temperature and humidity profiles from an infrared instrument requires clear-sky conditions. When clouds are present, an infrared sounder, such as HIRS, can only sense to the top of clouds. The development of temperature and humidity profile retrievals is, therefore, based on clear-sky HIRS pixels. The clearing of cloudy pixels employs a two-tiered approach. The first cloud filtering is based on a simplified cloud detection procedure [
31] as in ISCCP [
32]. Cloudy pixels are first identified by comparisons of brightness temperature differences both spatially and temporally, among neighboring pixels in days before and after. Though the majority of clouds are cleared by this approach, a small portion of cloudy pixels having small spatial and temporal variations can be misclassified as clear pixels in the process. Often these are semi-permanent stratiform clouds. A second approach using cloud products derived from the Advanced Very High Resolution Radiometer (AVHRR) on board the same satellites, is added to further screen the clear-sky pixels identified in the first approach.
The cloud products are part of the AVHRR Pathfinder Atmospheres-Extended (PATMOS-x) CDR dataset acquired from NOAA’s National Centers for Environmental Information [
33]. PATMOS-x generates mapped products with a spatial resolution of 0.1 degrees on a global latitude-longitude grid. Two products, the cloud fraction and cloud probability, from the PATMOS-x dataset are chosen to screen the likelihood of cloud contamination in the HIRS pixels identified in the first step.
An optimization scheme is used to find the optimal thresholds for cloud fraction and cloud probability to identify HIRS pixels that have high likelihood of being cloudy and, therefore, should not be used to derive clear-sky profiles. The HIRS temperature retrievals are compared to co-located RS92 observations in the lower atmosphere at 850 hPa. The optimization scheme finds the thresholds for cloud fraction and cloud probability that result in maximum correlation between HIRS retrievals and RS92 observations and minimum standard deviation of their differences, with the maximum amount of HIRS data retained. The optimal thresholds are found to be 0.5812 for cloud fraction and 0.9638 for cloud probability. The HIRS pixels having either higher values of cloud fraction or cloud probability as determined from PATMOS-x products are considered likely cloud-contaminated and should not be used, as is indicated with associated quality flags. Associated with each observation in the final dataset, a quality flag value is set indicating: (0) clear, (1) possibility of partially cloudiness, (2) likely cloudy, and (3) no cloud fraction/probability information available.
2.5. Retrieval Calibration
The aim of the retrieval algorithm development is to have outputs that are consistent with conventional global observations in terms of minimized systematic differences. Multiple factors may contribute to the systematic differences. The use of model simulations may have model-related biases, and a retrieval scheme may carry retrieval biases. In this study the upper air outputs from model-simulation-based neural networks are calibrated to radiosonde observations from RS92 and GPS RO profiles to achieve a systematically consistent dataset with the conventional measurements. For the upper air temperature and humidity retrievals in the troposphere, the calibration database is comprised of RS92 radiosonde observations. For the temperature outputs in the stratosphere, the retrievals are calibrated to GPS RO profiles.
Pixels with cloud quality flags 0 and 1 (as defined above) were used to co-locate with RS92 and COSMIC2013 data for the calibration scheme development. The co-location criteria are within 0.1 latitude/longitude degree and 1 h at each pressure level. For each pressure level, and the northern and southern hemispheres separately, a multiple linear regression was performed. Additionally, for temperature, individual regressions were done for each 10 degree temperature bin; four bins of specific humidity values, each having approximately the same number of matchups, were created for each pressure level from the calibration dataset. Then, setting:
where
THIRS is HIRS retrieval at a standard pressure level, and
Tind is COSMIC2013 or RS92 we regress to find a, b, and c such that:
where
LHIRS is the latitude of the
THIRS observation. From here, the data was corrected as:
An analogous method was used for specific humidity.
To avoid artifacts along the equator which may arise from the hemisphere-specific regressions, smoothing was applied to all data located between −10 and 10 degrees latitude. The associated latitude value was used as the linear interpolant for the smoothing. After bias calibrations were applied to the specific humidity data, a final quality control check was done to ensure that all specific humidity values were nonnegative and monotonically decreasing with increasing altitude. If a specific humidity value is found not monotonically decreasing with altitude, the value above is adjusted to the value below. No vertical adjustment is applied to temperature profiles. In many places, persistent temperature inversions are well captured in the retrievals. Temperature and specific humidity retrievals over southern high latitudes are not calibrated due to the very limited radiosonde measurements in the region for which no meaningful statistics can be generated. Over the Antarctic, more surface in situ long-term observations are required to establish a climate calibration benchmark.
The left panels in
Figure 4 and
Figure 5 show the remaining errors in terms of RMSEs of the retrievals globally and for different latitude bands after the calibration is applied. The standard deviation (STD) values for both HIRS retrievals (solid lines) and radiosonde or GPS RO observations (dashed lines) are included in the right panels to provide a context of variability at the standard pressure levels. There are about 3500 HIRS and radiosonde match-ups and about 4000 HIRS and GPS RO match-ups for pressure levels 850 hPa and above. The number of match-ups is less for 1000 hPa (about 1600), as the elevations of some areas are higher and thus there are no measurements available. The STD values of radiosondes and GPS RO profiles are consistently larger than those of HIRS retrievals. This indicates the difference between the footprint area measurement of HIRS and the point measurement from radiosondes or GPS RO, among other factors contributing to the differences. Globally, the RMSEs for the temperatures are in the range of 2–3 °C at the lower tropospheric levels, 1.4–1.6 °C at the mid- and upper-tropospheric levels, and 2.1–2.3 °C at the stratospheric levels. The temperature RMSEs in the low latitudes are, generally, smaller than the global RMSEs. The temperature RMSEs in the troposphere increase toward higher latitude bands as there is larger variability of temperatures in mid- to high-latitudes. For specific humidity, the global RMSEs are 2.4 g/kg at 1000 hPa, and steadily decrease to 1.6 g/kg at 700 hPa, 0.5 g/kg at 500 hPa, and 0.05 g/kg at 300 hPa. The largest RMSEs are from the low latitude band proportional to the high humidity values in the tropics.
The mean bias errors (MBEs) after the calibration procedure between the retrievals and 2011–2012 radiosonde and GPS RO profiles are shown in
Figure 6. The biases are defined as HIRS values minus radiosonde or GPS RO values. Globally, the MBEs at all levels are zero for both temperature and specific humidity. This shows that the calibration has accomplished the goal of minimizing biases in a global mean. Very small biases remain in differentiated latitude bands. For temperature, the biases are mostly less than ±0.1 °C for the mid-tropospheric to stratospheric levels. For the rest of the pressure levels the biases are less than ±0.6 °C. For the specific humidity, the tropical band has the smallest biases with values less than ±0.04 g/kg, and over the mid-latitude bands, the biases are less than ±0.15 g/kg. Over high latitudes, the co-located data are noisier and have a smaller number of matches; thus, the resulting biases are larger.
The calibration of the surface air temperature was performed separately using U.S. Climate Reference Network (USCRN) observations. The USCRN network consists of more than 200 stations in the contiguous U.S. and more than twenty in Alaska and Hawaii. One important goal of USCRN is to provide data for satellite calibration and validation [
34]. Each USCRN site includes triple redundancy for the primary air temperature and precipitation variables and for soil moisture/temperature. Instrumentation is regularly calibrated to National Institute for Standards and Technology standards. HIRS-derived surface temperatures were co-located with a full year of USCRN observations in the development of a linear regression equation for calibrating the retrievals to USCRN observations [
35].