A Two-Branch CNN Architecture for Land Cover Classification of PAN and MS Imagery

The use of Very High Spatial Resolution (VHSR) imagery in remote sensing applications is nowadays a current practice whenever fine-scale monitoring of the earth’s surface is concerned. VHSR Land Cover classification, in particular, is currently a well-established tool to support decisions...

Full description

Bibliographic Details
Main Authors: Raffaele Gaetano, Dino Ienco, Kenji Ose, Remi Cresson
Format: Article
Language:English
Published: MDPI AG 2018-11-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/10/11/1746
Description
Summary:The use of Very High Spatial Resolution (VHSR) imagery in remote sensing applications is nowadays a current practice whenever fine-scale monitoring of the earth&#8217;s surface is concerned. VHSR Land Cover classification, in particular, is currently a well-established tool to support decisions in several domains, including urban monitoring, agriculture, biodiversity, and environmental assessment. Additionally, land cover classification can be employed to annotate VHSR imagery with the aim of retrieving spatial statistics or areas with similar land cover. Modern VHSR sensors provide data at multiple spatial and spectral resolutions, most commonly as a couple of a higher-resolution single-band panchromatic (PAN) and a coarser multispectral (MS) imagery. In the typical land cover classification workflow, the multi-resolution input is preprocessed to generate a single multispectral image at the highest resolution available by means of a pan-sharpening process. Recently, deep learning approaches have shown the advantages of avoiding data preprocessing by letting machine learning algorithms automatically transform input data to best fit the classification task. Following this rationale, we here propose a new deep learning architecture to jointly use PAN and MS imagery for a direct classification without any prior image sharpening or resampling process. Our method, namely <inline-formula> <math display="inline"> <semantics> <mrow> <mi>M</mi> <mi>u</mi> <mi>l</mi> <mi>t</mi> <mi>i</mi> <mi>R</mi> <mi>e</mi> <mi>s</mi> <mi>o</mi> <mi>L</mi> <mi>C</mi> <mi>C</mi> </mrow> </semantics> </math> </inline-formula>, consists of a two-branch end-to-end network which extracts features from each source at their native resolution and lately combine them to perform land cover classification at the PAN resolution. Experiments are carried out on two real-world scenarios over large areas with contrasted land cover characteristics. The experimental results underline the quality of our method while the characteristics of the proposed scenarios underline the applicability and the generality of our strategy in operational settings.
ISSN:2072-4292