Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas

The accuracy of training samples used for data classification methods, such as support vector machines (SVMs), has had a considerable positive impact on the results of urban area extractions. To improve the accuracy of urban built-up area extractions, this paper presents a sample-optimized approach...

Full description

Bibliographic Details
Main Authors: Xiaolong Ma, Xiaohua Tong, Sicong Liu, Xin Luo, Huan Xie, Chengming Li
Format: Article
Language:English
Published: MDPI AG 2017-03-01
Series:Remote Sensing
Subjects:
Online Access:http://www.mdpi.com/2072-4292/9/3/236
id doaj-4e98b57cdeb64f4584ec9b167038a055
record_format Article
spelling doaj-4e98b57cdeb64f4584ec9b167038a0552020-11-25T01:13:29ZengMDPI AGRemote Sensing2072-42922017-03-019323610.3390/rs9030236rs9030236Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up AreasXiaolong Ma0Xiaohua Tong1Sicong Liu2Xin Luo3Huan Xie4Chengming Li5College of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaCollege of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaCollege of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaCollege of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaCollege of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaInstitute of Cartography and Geographic Information System, Chinese Academy of Surveying and Mapping, Beijing 100830, ChinaThe accuracy of training samples used for data classification methods, such as support vector machines (SVMs), has had a considerable positive impact on the results of urban area extractions. To improve the accuracy of urban built-up area extractions, this paper presents a sample-optimized approach for classifying urban area data using a combination of the Defense Meteorological Satellite Program-Operational Linescan System (DMSP-OLS) for nighttime light data, Landsat images, and GlobeLand30, which is a 30-m global land cover data product. The proposed approach consists of three main components: (1) initial sample generation and data classification into built-up and non-urban built-up areas based on the maximum and minimum intervals of digital numbers from the DMSP-OLS data, respectively; (2) refined sample selection and optimization by the probability threshold of each pixel based on vegetation-cover, using the Landsat-derived normalized differential vegetation index (NDVI) and artificial surfaces extracted from the GlobeLand30 product as the constraints; (3) iterative classification and urban built-up area data extraction using the relationship between these three aspects of data collection together with the training sets. Experiments were conducted for several cities in western China using this proposed approach for the extraction of built-up areas, which were classified using urban construction statistical yearbooks and Landsat images and were compared with data obtained from traditional data collection methods, such as the threshold dichotomy method and the improved neighborhood focal statistics method. An analysis of the empirical results indicated that (1) the sample training process was improved using the proposed method, and the overall accuracy (OA) increased from 89% to 96% for both the optimized and non-optimized sample selection; (2) the proposed method had a relative error of less than 10%, as calculated by an accuracy assessment; (3) the overall and individual class accuracy were higher for artificial surfaces in GlobeLand30; and (4) the average OA obviously improved and the Kappa coefficient in the case of Chengdu increased from 0.54 to 0.80. Therefore, the experimental results demonstrated that our proposed approach is a reliable solution for extracting urban built-up areas with a high degree of accuracy.http://www.mdpi.com/2072-4292/9/3/236urban built-up areassample-optimized approachSVM classificationDMSP-OLSinitial training samplesGlobeLand30NDVIprobability thresholditerative optimization process
collection DOAJ
language English
format Article
sources DOAJ
author Xiaolong Ma
Xiaohua Tong
Sicong Liu
Xin Luo
Huan Xie
Chengming Li
spellingShingle Xiaolong Ma
Xiaohua Tong
Sicong Liu
Xin Luo
Huan Xie
Chengming Li
Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas
Remote Sensing
urban built-up areas
sample-optimized approach
SVM classification
DMSP-OLS
initial training samples
GlobeLand30
NDVI
probability threshold
iterative optimization process
author_facet Xiaolong Ma
Xiaohua Tong
Sicong Liu
Xin Luo
Huan Xie
Chengming Li
author_sort Xiaolong Ma
title Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas
title_short Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas
title_full Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas
title_fullStr Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas
title_full_unstemmed Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas
title_sort optimized sample selection in svm classification by combining with dmsp-ols, landsat ndvi and globeland30 products for extracting urban built-up areas
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2017-03-01
description The accuracy of training samples used for data classification methods, such as support vector machines (SVMs), has had a considerable positive impact on the results of urban area extractions. To improve the accuracy of urban built-up area extractions, this paper presents a sample-optimized approach for classifying urban area data using a combination of the Defense Meteorological Satellite Program-Operational Linescan System (DMSP-OLS) for nighttime light data, Landsat images, and GlobeLand30, which is a 30-m global land cover data product. The proposed approach consists of three main components: (1) initial sample generation and data classification into built-up and non-urban built-up areas based on the maximum and minimum intervals of digital numbers from the DMSP-OLS data, respectively; (2) refined sample selection and optimization by the probability threshold of each pixel based on vegetation-cover, using the Landsat-derived normalized differential vegetation index (NDVI) and artificial surfaces extracted from the GlobeLand30 product as the constraints; (3) iterative classification and urban built-up area data extraction using the relationship between these three aspects of data collection together with the training sets. Experiments were conducted for several cities in western China using this proposed approach for the extraction of built-up areas, which were classified using urban construction statistical yearbooks and Landsat images and were compared with data obtained from traditional data collection methods, such as the threshold dichotomy method and the improved neighborhood focal statistics method. An analysis of the empirical results indicated that (1) the sample training process was improved using the proposed method, and the overall accuracy (OA) increased from 89% to 96% for both the optimized and non-optimized sample selection; (2) the proposed method had a relative error of less than 10%, as calculated by an accuracy assessment; (3) the overall and individual class accuracy were higher for artificial surfaces in GlobeLand30; and (4) the average OA obviously improved and the Kappa coefficient in the case of Chengdu increased from 0.54 to 0.80. Therefore, the experimental results demonstrated that our proposed approach is a reliable solution for extracting urban built-up areas with a high degree of accuracy.
topic urban built-up areas
sample-optimized approach
SVM classification
DMSP-OLS
initial training samples
GlobeLand30
NDVI
probability threshold
iterative optimization process
url http://www.mdpi.com/2072-4292/9/3/236
work_keys_str_mv AT xiaolongma optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas
AT xiaohuatong optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas
AT sicongliu optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas
AT xinluo optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas
AT huanxie optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas
AT chengmingli optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas
_version_ 1725162004516175872