Surface Heterogeneity-Involved Estimation of Sample Size for Accuracy Assessment of Land Cover Product from Satellite Imagery

Sample size estimation is a key issue for validating land cover products derived from satellite images. Based on the fact that present sample size estimation methods account for the characteristics of the Earth’s subsurface, this study developed a model for estimating sample size by consid...

Full description

Bibliographic Details
Main Authors: Huiqun Ren, Guoyin Cai, Mingyi Du
Format: Article
Language:English
Published: MDPI AG 2019-10-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/19/20/4430
id doaj-e89757d77365477d808bd76df762af80
record_format Article
spelling doaj-e89757d77365477d808bd76df762af802020-11-25T01:56:43ZengMDPI AGSensors1424-82202019-10-011920443010.3390/s19204430s19204430Surface Heterogeneity-Involved Estimation of Sample Size for Accuracy Assessment of Land Cover Product from Satellite ImageryHuiqun Ren0Guoyin Cai1Mingyi Du2School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, Beijing 100044, ChinaSchool of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, Beijing 100044, ChinaSchool of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, Beijing 100044, ChinaSample size estimation is a key issue for validating land cover products derived from satellite images. Based on the fact that present sample size estimation methods account for the characteristics of the Earth&#8217;s subsurface, this study developed a model for estimating sample size by considering the scale effect and surface heterogeneity. First, we introduced a watershed with different areas to indicate the scale effect on the sample size. Then, by employing an all-subsets regression feature selection method, three landscape indicators describing the aggregation and diversity of the land cover patches were selected (from 14 indicators) as the main factors for indicating the surface heterogeneity. Finally, we developed a multi-level linear model for sample size estimation using explanatory variables, including the estimated sample size (<i>n</i>) calculated from the traditional statistical model, size of the test region, and three landscape indicators. As reference data for developing this model, we employed a case study in the Jiangxi Province using a 30 m spatial resolution global land cover product (Globeland30) from 2010 as a classified map, and national 30 m land use/cover change (LUCC) data from 2010 in China. The results showed that the adjusted square coefficient of R<sup>2</sup> is 0.79, indicating that the joint explanatory ability of all predictive variables in the model to the sample size is 79%. This means that the predictability of this model is at a good level. By comparing the sample size <i>N<sub>S</sub></i> obtained by the developed multi-level linear model and <i>n</i> as calculated from the statistics model, we find that <i>N<sub>S</sub></i> is much smaller than <i>n</i>, which mainly contributes to the concerns regarding surface heterogeneity in this study. The validity of the established model is tested and is proven as effective in the Anhui Province. This indicates that the estimated sample size from considering the scale effect and spatial heterogeneity in this study achieved the same accuracy as that calculated from a probability statistical model, while simultaneously saving more time, labour, and money in the accuracy assessment of a land cover dataset.https://www.mdpi.com/1424-8220/19/20/4430sample size estimationland cover productssatellite imagessurface heterogeneityjiangxi province
collection DOAJ
language English
format Article
sources DOAJ
author Huiqun Ren
Guoyin Cai
Mingyi Du
spellingShingle Huiqun Ren
Guoyin Cai
Mingyi Du
Surface Heterogeneity-Involved Estimation of Sample Size for Accuracy Assessment of Land Cover Product from Satellite Imagery
Sensors
sample size estimation
land cover products
satellite images
surface heterogeneity
jiangxi province
author_facet Huiqun Ren
Guoyin Cai
Mingyi Du
author_sort Huiqun Ren
title Surface Heterogeneity-Involved Estimation of Sample Size for Accuracy Assessment of Land Cover Product from Satellite Imagery
title_short Surface Heterogeneity-Involved Estimation of Sample Size for Accuracy Assessment of Land Cover Product from Satellite Imagery
title_full Surface Heterogeneity-Involved Estimation of Sample Size for Accuracy Assessment of Land Cover Product from Satellite Imagery
title_fullStr Surface Heterogeneity-Involved Estimation of Sample Size for Accuracy Assessment of Land Cover Product from Satellite Imagery
title_full_unstemmed Surface Heterogeneity-Involved Estimation of Sample Size for Accuracy Assessment of Land Cover Product from Satellite Imagery
title_sort surface heterogeneity-involved estimation of sample size for accuracy assessment of land cover product from satellite imagery
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2019-10-01
description Sample size estimation is a key issue for validating land cover products derived from satellite images. Based on the fact that present sample size estimation methods account for the characteristics of the Earth&#8217;s subsurface, this study developed a model for estimating sample size by considering the scale effect and surface heterogeneity. First, we introduced a watershed with different areas to indicate the scale effect on the sample size. Then, by employing an all-subsets regression feature selection method, three landscape indicators describing the aggregation and diversity of the land cover patches were selected (from 14 indicators) as the main factors for indicating the surface heterogeneity. Finally, we developed a multi-level linear model for sample size estimation using explanatory variables, including the estimated sample size (<i>n</i>) calculated from the traditional statistical model, size of the test region, and three landscape indicators. As reference data for developing this model, we employed a case study in the Jiangxi Province using a 30 m spatial resolution global land cover product (Globeland30) from 2010 as a classified map, and national 30 m land use/cover change (LUCC) data from 2010 in China. The results showed that the adjusted square coefficient of R<sup>2</sup> is 0.79, indicating that the joint explanatory ability of all predictive variables in the model to the sample size is 79%. This means that the predictability of this model is at a good level. By comparing the sample size <i>N<sub>S</sub></i> obtained by the developed multi-level linear model and <i>n</i> as calculated from the statistics model, we find that <i>N<sub>S</sub></i> is much smaller than <i>n</i>, which mainly contributes to the concerns regarding surface heterogeneity in this study. The validity of the established model is tested and is proven as effective in the Anhui Province. This indicates that the estimated sample size from considering the scale effect and spatial heterogeneity in this study achieved the same accuracy as that calculated from a probability statistical model, while simultaneously saving more time, labour, and money in the accuracy assessment of a land cover dataset.
topic sample size estimation
land cover products
satellite images
surface heterogeneity
jiangxi province
url https://www.mdpi.com/1424-8220/19/20/4430
work_keys_str_mv AT huiqunren surfaceheterogeneityinvolvedestimationofsamplesizeforaccuracyassessmentoflandcoverproductfromsatelliteimagery
AT guoyincai surfaceheterogeneityinvolvedestimationofsamplesizeforaccuracyassessmentoflandcoverproductfromsatelliteimagery
AT mingyidu surfaceheterogeneityinvolvedestimationofsamplesizeforaccuracyassessmentoflandcoverproductfromsatelliteimagery
_version_ 1724978355767345152