Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal Prediction

Traditional change detection (CD) methods operate in the simple image domain or hand-crafted features, which has less robustness to the inconsistencies (e.g., brightness and noise distribution, etc.) between bitemporal satellite images. Recently, deep learning techniques have reported compelling per...

Full description

Bibliographic Details
Main Authors: Huihui Dong, Wenping Ma, Yue Wu, Jun Zhang, Licheng Jiao
Format: Article
Language:English
Published: MDPI AG 2020-06-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/12/11/1868
id doaj-50be74128b6c421b88b31b20ebd2d5f5
record_format Article
spelling doaj-50be74128b6c421b88b31b20ebd2d5f52020-11-25T03:32:55ZengMDPI AGRemote Sensing2072-42922020-06-01121868186810.3390/rs12111868Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal PredictionHuihui Dong0Wenping Ma1Yue Wu2Jun Zhang3Licheng Jiao4School of Artificial Intelligence, The Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, Xidian University, Xi’an 710071, ChinaSchool of Artificial Intelligence, The Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, Xidian University, Xi’an 710071, ChinaSchool of Computer Science and Technology, The Xi’an Key Laboratory of Big Data and Intelligent Vision, Xidian University, Xi’an 710071, ChinaSchool of Artificial Intelligence, The Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, Xidian University, Xi’an 710071, ChinaSchool of Artificial Intelligence, The Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, Xidian University, Xi’an 710071, ChinaTraditional change detection (CD) methods operate in the simple image domain or hand-crafted features, which has less robustness to the inconsistencies (e.g., brightness and noise distribution, etc.) between bitemporal satellite images. Recently, deep learning techniques have reported compelling performance on robust feature learning. However, generating accurate semantic supervision that reveals real change information in satellite images still remains challenging, especially for manual annotation. To solve this problem, we propose a novel self-supervised representation learning method based on temporal prediction for remote sensing image CD. The main idea of our algorithm is to transform two satellite images into more consistent feature representations through a self-supervised mechanism without semantic supervision and any additional computations. Based on the transformed feature representations, a better difference image (DI) can be obtained, which reduces the propagated error of DI on the final detection result. In the self-supervised mechanism, the network is asked to identify different sample patches between two temporal images, namely, temporal prediction. By designing the network for the temporal prediction task to imitate the discriminator of generative adversarial networks, the distribution-aware feature representations are automatically captured and the result with powerful robustness can be acquired. Experimental results on real remote sensing data sets show the effectiveness and superiority of our method, improving the detection precision up to 0.94–35.49%.https://www.mdpi.com/2072-4292/12/11/1868unsupervised change detectiongenerative adversarial networksdeep belief networksself-supervised representation learningremote sensing images
collection DOAJ
language English
format Article
sources DOAJ
author Huihui Dong
Wenping Ma
Yue Wu
Jun Zhang
Licheng Jiao
spellingShingle Huihui Dong
Wenping Ma
Yue Wu
Jun Zhang
Licheng Jiao
Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal Prediction
Remote Sensing
unsupervised change detection
generative adversarial networks
deep belief networks
self-supervised representation learning
remote sensing images
author_facet Huihui Dong
Wenping Ma
Yue Wu
Jun Zhang
Licheng Jiao
author_sort Huihui Dong
title Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal Prediction
title_short Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal Prediction
title_full Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal Prediction
title_fullStr Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal Prediction
title_full_unstemmed Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal Prediction
title_sort self-supervised representation learning for remote sensing image change detection based on temporal prediction
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2020-06-01
description Traditional change detection (CD) methods operate in the simple image domain or hand-crafted features, which has less robustness to the inconsistencies (e.g., brightness and noise distribution, etc.) between bitemporal satellite images. Recently, deep learning techniques have reported compelling performance on robust feature learning. However, generating accurate semantic supervision that reveals real change information in satellite images still remains challenging, especially for manual annotation. To solve this problem, we propose a novel self-supervised representation learning method based on temporal prediction for remote sensing image CD. The main idea of our algorithm is to transform two satellite images into more consistent feature representations through a self-supervised mechanism without semantic supervision and any additional computations. Based on the transformed feature representations, a better difference image (DI) can be obtained, which reduces the propagated error of DI on the final detection result. In the self-supervised mechanism, the network is asked to identify different sample patches between two temporal images, namely, temporal prediction. By designing the network for the temporal prediction task to imitate the discriminator of generative adversarial networks, the distribution-aware feature representations are automatically captured and the result with powerful robustness can be acquired. Experimental results on real remote sensing data sets show the effectiveness and superiority of our method, improving the detection precision up to 0.94–35.49%.
topic unsupervised change detection
generative adversarial networks
deep belief networks
self-supervised representation learning
remote sensing images
url https://www.mdpi.com/2072-4292/12/11/1868
work_keys_str_mv AT huihuidong selfsupervisedrepresentationlearningforremotesensingimagechangedetectionbasedontemporalprediction
AT wenpingma selfsupervisedrepresentationlearningforremotesensingimagechangedetectionbasedontemporalprediction
AT yuewu selfsupervisedrepresentationlearningforremotesensingimagechangedetectionbasedontemporalprediction
AT junzhang selfsupervisedrepresentationlearningforremotesensingimagechangedetectionbasedontemporalprediction
AT lichengjiao selfsupervisedrepresentationlearningforremotesensingimagechangedetectionbasedontemporalprediction
_version_ 1724565918936203264