3.1. Comparison of Fusion Results
This section presents the fusion results for the proposed algorithm and compares them to those of conventional image fusion algorithms. First, our proposed fusion process is performed step-by-step by using experimental images, which are shown in
Figure 2 and
Figure 3. To achieve reasonable computation time, only a portion of the image was extracted. Site 1 was selected as 700 × 700 pixels and Site 2 was selected as 600 × 600 pixels. As mentioned above, only the RGB bands of the reference Landsat-8 OLI image are extracted and classification is then performed. In this experiment, the number of classes is selected as a minimum of six. For the KOMPSAT-5 image, features other than intensity and intensity are extracted. Afterward, co-registration is performed, invariant pixels are extracted to be used as training data, and RF regression is obtained for each class to obtain a fused KOMPSAT-5 image. Then our proposed method is compared with conventional image fusion methods. To ensure a fair comparison, pixel-based image fusion algorithms are compared and the standard techniques, modified intensity-hue-saturation (IHS), principal component analysis (PCA), Gram–Schmidt (GS), and Ehlers fusion are selected. A modified IHS method proposed in Reference [
47] is a vast improvement over traditional IHS, which converts a color image from RGB to IHS color space and intensity band is replaced by the PAN image. The technique works by assessing the spectral overlap between each MS and PAN images and weighting the merge based on these relative wavelengths [
48]. The PCA method transforms a multivariate dataset of correlated variables into a dataset of new uncorrelated linear combinations of the original variables [
49]. It is assumed that the first PC band contains the most amount of information of the original image and replace it with the PAN image. Then an inverse PCA transform is performed to obtain the fused image. The GS method simulates a PAN image from the MS image, which is achieved by averaging the MS image. As the next step, GS transformation is performed for the simulated PAN image and the MS image with the simulated PAN image is employed as the first band. Then, the original PAN image replaces the first GS band. Lastly, an inverse GS transform is applied to create the fused image [
9]. The Ehlers method is based on the IHS transformation coupled with Fourier domain filtering [
38]. In other words, the first three bands of MS image are transformed to an IHS image and then a two dimensional Fast Fourier transform (FFT) is used to transform the intensity component and the PAN image into the frequency domain. The intensity spectrum is filtered with the low passage filter while the spectrum of the PAN image is filtered with an inverse high pass filter. After filtering, an inverse FFT is performed and added together to form a fused intensity component with the low-frequency information from the MS image and the high-frequency information from the PAN image. As the last step, an inverse IHS transformation produces a fused image. At this time, since the conventional image fusion methods work by using similar-time images, image fusion between the KOMPSAT-5 image and the Landsat-8 OLI image (called the criterion Landsat-8 OLI image and used for the performance evaluation) acquired on 19 September, 2014 (
Figure 4a and
Figure 5a) is performed. As mentioned above, the results of the proposed method and image fusion are compared by visual inspection and statistical evaluation and the results of Sites 1 and 2 are shown in
Figure 4 and
Figure 5, respectively.
Visual inspection shows that the proposed method and the image fusion methods contain more information than the original single image. All of them include not only the characteristics of the original KOMPSAT-5 image but also color information of the Landsat-8 OLI image. However, on all sites, in the case of modified IHS, PCA, and GS, a massive color distortion is shown. The difference between the imaging mechanism of the KOMPSAT-5 and Landsat-8 OLI images is not considered at all. In addition, spectral preservation is not performed well when compared to the criterion Landsat-8 OLI image. With the Ehlers method, spectral preservation is achieved when compared to other image fusions, but blurriness is present, which results in some loss of detail especially on linear features. On the other hand, our proposed method shows that there is less color distortion and the spectral characteristics of the Landsat-8 OLI image are well preserved. It also provides surface roughness characteristics of the KOMPSAT-5 image, which fully combines complementary and supplementary information of the original images. In other words, the visual inspection shows that the proposed method produces better results than the other image fusion methods.
Although the visual inspection is easy and direct, it is highly subjective and cannot be used to accurately evaluate the practical effects of the algorithms. Therefore, the performance of each method is further analyzed quantitatively based on UIQI, CC, entropy, and CPBD, which is mentioned above. The calculated values of UIQI, CC, entropy, and CPBD are presented in
Table 2,
Table 3,
Table 4 and
Table 5. The UIQI in
Table 2 indicates that the proposed method is significantly better than the image fusion methods. Compared to the modified IHS, PCA, GS, and Ehlers, the proposed method improves, on average, by 0.4843, 0.3688, 0.2704, and 0.0907 in Site 1 and 0.4194, 0.3964, 0.2882, and 0.1440 in Site 2, respectively. The higher UIQI of the proposed algorithm indicates that it is similar to the criterion Landsat-8 OLI image.
Table 3 presents the CC, which is also higher than with image fusion. The improvements are similar to UIQI, which improve, on average, 0.3916, 0.3086, 0.2996, and 0.1524 in Site 1 and 0.4086, 0.3852, 0.2902, and 0.1457 in Site 2, respectively. The higher CC indicates that the proposed method provides better spectral preservation. In other words, spectral qualities are improved significantly when compared with the image fusion methods. In addition, the spatial quality, entropy, and CPBD improved significantly at all sites, which is shown in
Table 4 and
Table 5. This means that the fused image obtained by the proposed method contains more average information and less perceptually blurry distortions. In other words, it can be confirmed that the proposed method from the viewpoint of spectral and spatial quality is remarkably superior to the conventional image fusion methods.
Furthermore, comparing the performance between Site 1 and Site 2, it is confirmed that Site 1 is higher than Site 2 regardless of the spectral and spatial qualities. Based on the reference Landsat-8 OLI, Site 1 is mostly made up of the built-up area and includes more forest area than Site 2. On the other hand, Site 2 contains a lot of water and barren areas and somewhat less build-up and forest areas compared to Site 1. Generally, in the SAR images, build-up and forest areas have relatively high backscattered intensities while the water area has the lowest backscatter intensities and barren area also has low backscattered intensities but slightly higher than the water area due to involvement of soil roughness. In other words, it is confirmed that the backscattered intensities of the SAR image have an effect on the fusion results locally, which represent that build-up and forest areas with relatively high backscattered intensities are able to retrieve more information from the reference MS image than barren and water areas with low backscattered intensities.
In addition, a comparison was made with the results of fusion performed through other ensemble approaches, stochastic gradient boosting (SGB) regression, and adaptive boosting (AdaBoost) regression for further verification, which is shown in
Figure 4g,h and
Figure 5g,h , respectively. For the results of SGB regression compared with the results obtained by RF regression, it is not possible to retrieve the color of the ground well, but the colors of the other areas are well captured. On the other hand, the results of AdaBoost regression can be confirmed to be in somewhat reddish tones highlighted in both sites. In addition, the portions containing the SAR characteristics include noise of a green color. In other words, when visually analyzed, the results of SGB regression are somewhat worse than those obtained by RF regression, but they are fairly similar while the results of AdaBoost regression contain significantly different color information.
Both results are also evaluated statistically and the results are shown in
Table 6,
Table 7,
Table 8 and
Table 9. The spectral qualities, UIQI, and CC of the results obtained through RF regression are the highest regardless of site and band. In other words, when obtaining the results through RF regression, the most similar spectral information is included and the spectral preservation is performed well. On the other hand, spatial quality, entropy, and CPBD showed somewhat different results depending on the site and the band. In the case of entropy, Site 1 has the highest results obtained through RF regression regardless of the band while Site 2 shows different results by the band. For CPBD, except for the G band of Site 1, it can be confirmed that the results obtained through AdaBoost are the highest. However, although the spatial qualities vary according to site and band, it can be confirmed that the color information of the results obtained by RF regression is abundant and more stable as a whole.
3.3. Comparison of Change Detection Results
To investigate the applicability of the fused KOMPSAT-5 image in the change detection, change detection with the reference Landsat-8 OLI image is performed and compared with the result between the reference and criterion Landsat-8 OLI and KOMPSAT-5 images. The KOMPSAT-5 images with similar time periods as the reference Landsat-8 OLI are shown in
Figure 8. As mentioned above, the change detection of this study consists of two steps, which are pixel-based detection and object-based recognition. The final change detection results are shown in
Figure 9b–d and
Figure 10b–d , respectively. The black areas indicate the unchanged areas and the white areas indicate the changed areas. In order to observe the difference more clearly, several regions are selected and marked with red rectangles. Three rectangles are selected in Site 1 and two rectangles are selected in Site 2.
For rectangle 1 of Site 1, the change detection result of Landsat-8 OLI captures the exact change shape and the change detection result using the fused KOMPSAT-5 image shows a somewhat overestimated change shape but extracts fairly accurate changes. On the other hand, for the change detection result of KOMPSAT-5, the change is in the form of salt-and-pepper noise even though object recognition is performed. In rectangle 2, only the result using the fused KOMPAT-5 is correct. The change detection result between Landsat-8 OLI images is underestimated and the change detection result between KOMPSAT-5 images does not capture the change at all. Rectangle 3 is a relatively small change area, which detects only the change detection result of Landsat-8 OLI correctly. In Site 2, rectangle 1 is the region where only the fused KOMPSAT-5 image correctly detects the change, which shows that the change detection results between Landsat-8 OLI and KOMPSAT-5 are considerably underestimated. Rectangle 2, which is a relatively small change area compared with rectangle 1, extracts the correct change region only in the change detection result of Landsat-8 OLI. In other words, when using the fused KOMPSAT-5 image, it is judged that there is a limitation to detecting changes in a narrow area while detecting changes in a relatively large area is accurate.
Precision, recall, and the F-measure are then obtained to quantitatively evaluate the results of each change detection procedure based on the ground-truth data (
Figure 9a and
Figure 10a) and these are shown in
Table 10. At this time, precision and recall have an inverse proportion and there are limitations to the evaluation through two accuracy indices. Therefore, the accuracies are assessed using the F-measure combined with precision and recall. In Sites 1 and 2, the F-measure using the fused KOMPSAT-5 image shows 76.97% and 81.14%. The F-measure between Landsat-8 OLI images shows 78.68% and 81.45% and the F-measure between the KOMPSAT-5 images shows 59.29% and 57.02%. In all sites, using the fused KOMPSAT-5 image is similar to the change detection results of Landsat-8 OLI and is significantly improved compared to the change detection results of KOMPSAT-5 images. Lastly, the applicability of the fused KOMPSAT-5 image in the change detection is identified.