Geographical knowledge-driven representation learning for remote sensing images

W Li, K Chen, H Chen, Z Shi - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
IEEE Transactions on Geoscience and Remote Sensing, 2021ieeexplore.ieee.org
The proliferation of remote sensing satellites has resulted in a massive amount of remote
sensing images. However, due to human and material resource constraints, the vast majority
of remote sensing images remain unlabeled. As a result, it cannot be applied to currently
available deep learning methods. To fully utilize the remaining unlabeled images, we
propose a Geographical Knowledge-driven Representation (GeoKR) learning method for
remote sensing images, improving network performance and reduce the demand for …
The proliferation of remote sensing satellites has resulted in a massive amount of remote sensing images. However, due to human and material resource constraints, the vast majority of remote sensing images remain unlabeled. As a result, it cannot be applied to currently available deep learning methods. To fully utilize the remaining unlabeled images, we propose a Geographical Knowledge-driven Representation (GeoKR) learning method for remote sensing images, improving network performance and reduce the demand for annotated data. The global land cover products and geographical location associated with each remote sensing image are regarded as geographical knowledge to provide supervision for representation learning and network pretraining. An efficient pretraining framework is proposed to eliminate the supervision noises caused by imaging times and resolutions difference between remote sensing images and geographical knowledge. A large-scale pretraining dataset Levir-KR is constructed to support network pretraining. It contains 1431950 remote sensing images from Gaofen series satellites with various resolutions. Experimental results demonstrate that our proposed method outperforms ImageNet pretraining and self-supervised representation learning methods and significantly reduces the burden of data annotation on downstream tasks, such as scene classification, semantic segmentation, object detection, and cloud/snow detection. It demonstrates that our proposed method can be used as a novel paradigm for pretraining neural networks. Codes will be available on https://github.com/flyakon/Geographical-Knowledge-driven-Representaion-Learning .
ieeexplore.ieee.org