Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas
The accuracy of training samples used for data classification methods, such as support vector machines (SVMs), has had a considerable positive impact on the results of urban area extractions. To improve the accuracy of urban built-up area extractions, this paper presents a sample-optimized approach...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2017-03-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | http://www.mdpi.com/2072-4292/9/3/236 |
id |
doaj-4e98b57cdeb64f4584ec9b167038a055 |
---|---|
record_format |
Article |
spelling |
doaj-4e98b57cdeb64f4584ec9b167038a0552020-11-25T01:13:29ZengMDPI AGRemote Sensing2072-42922017-03-019323610.3390/rs9030236rs9030236Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up AreasXiaolong Ma0Xiaohua Tong1Sicong Liu2Xin Luo3Huan Xie4Chengming Li5College of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaCollege of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaCollege of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaCollege of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaCollege of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, ChinaInstitute of Cartography and Geographic Information System, Chinese Academy of Surveying and Mapping, Beijing 100830, ChinaThe accuracy of training samples used for data classification methods, such as support vector machines (SVMs), has had a considerable positive impact on the results of urban area extractions. To improve the accuracy of urban built-up area extractions, this paper presents a sample-optimized approach for classifying urban area data using a combination of the Defense Meteorological Satellite Program-Operational Linescan System (DMSP-OLS) for nighttime light data, Landsat images, and GlobeLand30, which is a 30-m global land cover data product. The proposed approach consists of three main components: (1) initial sample generation and data classification into built-up and non-urban built-up areas based on the maximum and minimum intervals of digital numbers from the DMSP-OLS data, respectively; (2) refined sample selection and optimization by the probability threshold of each pixel based on vegetation-cover, using the Landsat-derived normalized differential vegetation index (NDVI) and artificial surfaces extracted from the GlobeLand30 product as the constraints; (3) iterative classification and urban built-up area data extraction using the relationship between these three aspects of data collection together with the training sets. Experiments were conducted for several cities in western China using this proposed approach for the extraction of built-up areas, which were classified using urban construction statistical yearbooks and Landsat images and were compared with data obtained from traditional data collection methods, such as the threshold dichotomy method and the improved neighborhood focal statistics method. An analysis of the empirical results indicated that (1) the sample training process was improved using the proposed method, and the overall accuracy (OA) increased from 89% to 96% for both the optimized and non-optimized sample selection; (2) the proposed method had a relative error of less than 10%, as calculated by an accuracy assessment; (3) the overall and individual class accuracy were higher for artificial surfaces in GlobeLand30; and (4) the average OA obviously improved and the Kappa coefficient in the case of Chengdu increased from 0.54 to 0.80. Therefore, the experimental results demonstrated that our proposed approach is a reliable solution for extracting urban built-up areas with a high degree of accuracy.http://www.mdpi.com/2072-4292/9/3/236urban built-up areassample-optimized approachSVM classificationDMSP-OLSinitial training samplesGlobeLand30NDVIprobability thresholditerative optimization process |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiaolong Ma Xiaohua Tong Sicong Liu Xin Luo Huan Xie Chengming Li |
spellingShingle |
Xiaolong Ma Xiaohua Tong Sicong Liu Xin Luo Huan Xie Chengming Li Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas Remote Sensing urban built-up areas sample-optimized approach SVM classification DMSP-OLS initial training samples GlobeLand30 NDVI probability threshold iterative optimization process |
author_facet |
Xiaolong Ma Xiaohua Tong Sicong Liu Xin Luo Huan Xie Chengming Li |
author_sort |
Xiaolong Ma |
title |
Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas |
title_short |
Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas |
title_full |
Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas |
title_fullStr |
Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas |
title_full_unstemmed |
Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas |
title_sort |
optimized sample selection in svm classification by combining with dmsp-ols, landsat ndvi and globeland30 products for extracting urban built-up areas |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2017-03-01 |
description |
The accuracy of training samples used for data classification methods, such as support vector machines (SVMs), has had a considerable positive impact on the results of urban area extractions. To improve the accuracy of urban built-up area extractions, this paper presents a sample-optimized approach for classifying urban area data using a combination of the Defense Meteorological Satellite Program-Operational Linescan System (DMSP-OLS) for nighttime light data, Landsat images, and GlobeLand30, which is a 30-m global land cover data product. The proposed approach consists of three main components: (1) initial sample generation and data classification into built-up and non-urban built-up areas based on the maximum and minimum intervals of digital numbers from the DMSP-OLS data, respectively; (2) refined sample selection and optimization by the probability threshold of each pixel based on vegetation-cover, using the Landsat-derived normalized differential vegetation index (NDVI) and artificial surfaces extracted from the GlobeLand30 product as the constraints; (3) iterative classification and urban built-up area data extraction using the relationship between these three aspects of data collection together with the training sets. Experiments were conducted for several cities in western China using this proposed approach for the extraction of built-up areas, which were classified using urban construction statistical yearbooks and Landsat images and were compared with data obtained from traditional data collection methods, such as the threshold dichotomy method and the improved neighborhood focal statistics method. An analysis of the empirical results indicated that (1) the sample training process was improved using the proposed method, and the overall accuracy (OA) increased from 89% to 96% for both the optimized and non-optimized sample selection; (2) the proposed method had a relative error of less than 10%, as calculated by an accuracy assessment; (3) the overall and individual class accuracy were higher for artificial surfaces in GlobeLand30; and (4) the average OA obviously improved and the Kappa coefficient in the case of Chengdu increased from 0.54 to 0.80. Therefore, the experimental results demonstrated that our proposed approach is a reliable solution for extracting urban built-up areas with a high degree of accuracy. |
topic |
urban built-up areas sample-optimized approach SVM classification DMSP-OLS initial training samples GlobeLand30 NDVI probability threshold iterative optimization process |
url |
http://www.mdpi.com/2072-4292/9/3/236 |
work_keys_str_mv |
AT xiaolongma optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas AT xiaohuatong optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas AT sicongliu optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas AT xinluo optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas AT huanxie optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas AT chengmingli optimizedsampleselectioninsvmclassificationbycombiningwithdmspolslandsatndviandglobeland30productsforextractingurbanbuiltupareas |
_version_ |
1725162004516175872 |