Disaggregating County-Level Census Data for Population Mapping Using Residential Geo-Objects With Multisource Geo-Spatial Data

Accurate spatialization of socioeconomic data is conducive to understand the spatial and temporal distribution of human social development status and, thus, effectively support future scientific decision-making. This study focuses on population mapping, which is a classical spatialization of macroec...

Full description

Bibliographic Details
Main Authors: Tianjun Wu, Jiancheng Luo, Wen Dong, Lijing Gao, Xiaodong Hu, Zhifeng Wu, Yingwei Sun, Jinsong Liu
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9042311/
id doaj-a389915b5a144ada823cc2f795a0be22
record_format Article
spelling doaj-a389915b5a144ada823cc2f795a0be222021-06-03T23:00:55ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352020-01-01131189120510.1109/JSTARS.2020.29748969042311Disaggregating County-Level Census Data for Population Mapping Using Residential Geo-Objects With Multisource Geo-Spatial DataTianjun Wu0https://orcid.org/0000-0003-0178-2342Jiancheng Luo1Wen Dong2Lijing Gao3Xiaodong Hu4Zhifeng Wu5Yingwei Sun6https://orcid.org/0000-0002-3051-3259Jinsong Liu7School of Geology Engineering and Geomatics, Chang'an University, Xi'an, ChinaState Key Laboratory of Remote Sensing Science of the Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaState Key Laboratory of Remote Sensing Science of the Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaState Key Laboratory of Remote Sensing Science of the Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaState Key Laboratory of Remote Sensing Science of the Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaSchool of Geographical Sciences, Guangzhou University in Guangzhou, Guangdong, ChinaState Key Laboratory of Remote Sensing Science of the Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaSchool of Resources and Environmental Sciences, Hebei Normal University, Hebei, ChinaAccurate spatialization of socioeconomic data is conducive to understand the spatial and temporal distribution of human social development status and, thus, effectively support future scientific decision-making. This study focuses on population mapping, which is a classical spatialization of macroeconomic data of the social economy. Traditional population mapping based on rough grids or administrative divisions such as townships often has deficiencies in the accuracy of spatial pattern and prediction. In this article, hence, we employ residential geo-objects as basic mapping units and formalize the problem as a spatial prediction process using machine-learning (ML) methods with high-spatial-resolution (HSR) satellite remote sensing images and multisource geospatial data. The indicators of population spatial density, including residential geo-objects' area, building existence index, terrain slope, night light intensity, density of point of interest (POI) and road network from Internet electronic maps, and locational factors such as the distances from road and river, are jointly applied to establish the relationship between these multivariable factors and quantitative index of population density using ML algorithms such as Random Forests and XGBoost. The predicated values of population density from the mined nonlinear regression relation are further used to calculate the weights of disaggregation of each unit, and then the population quantity distribution at the scale of residential geo-objects is obtained under the control of the total amount of population statistics. Experiments with a county area show that the methodology has the ability to achieve better results than the traditional deterministic methods by reproducing a more accurate and finer geographic population distribution pattern. Meanwhile, it is found that the optimization of mapping results may benefit from the multisources geospatial data, and thus the methodological framework can be recommended to be extended to other spatialization areas of socioeconomic data.https://ieeexplore.ieee.org/document/9042311/Census datamachine-learning (ML) algorithmsmultisource geospatial datapopulation mappingresidential geo-objectsspatialization
collection DOAJ
language English
format Article
sources DOAJ
author Tianjun Wu
Jiancheng Luo
Wen Dong
Lijing Gao
Xiaodong Hu
Zhifeng Wu
Yingwei Sun
Jinsong Liu
spellingShingle Tianjun Wu
Jiancheng Luo
Wen Dong
Lijing Gao
Xiaodong Hu
Zhifeng Wu
Yingwei Sun
Jinsong Liu
Disaggregating County-Level Census Data for Population Mapping Using Residential Geo-Objects With Multisource Geo-Spatial Data
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Census data
machine-learning (ML) algorithms
multisource geospatial data
population mapping
residential geo-objects
spatialization
author_facet Tianjun Wu
Jiancheng Luo
Wen Dong
Lijing Gao
Xiaodong Hu
Zhifeng Wu
Yingwei Sun
Jinsong Liu
author_sort Tianjun Wu
title Disaggregating County-Level Census Data for Population Mapping Using Residential Geo-Objects With Multisource Geo-Spatial Data
title_short Disaggregating County-Level Census Data for Population Mapping Using Residential Geo-Objects With Multisource Geo-Spatial Data
title_full Disaggregating County-Level Census Data for Population Mapping Using Residential Geo-Objects With Multisource Geo-Spatial Data
title_fullStr Disaggregating County-Level Census Data for Population Mapping Using Residential Geo-Objects With Multisource Geo-Spatial Data
title_full_unstemmed Disaggregating County-Level Census Data for Population Mapping Using Residential Geo-Objects With Multisource Geo-Spatial Data
title_sort disaggregating county-level census data for population mapping using residential geo-objects with multisource geo-spatial data
publisher IEEE
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn 2151-1535
publishDate 2020-01-01
description Accurate spatialization of socioeconomic data is conducive to understand the spatial and temporal distribution of human social development status and, thus, effectively support future scientific decision-making. This study focuses on population mapping, which is a classical spatialization of macroeconomic data of the social economy. Traditional population mapping based on rough grids or administrative divisions such as townships often has deficiencies in the accuracy of spatial pattern and prediction. In this article, hence, we employ residential geo-objects as basic mapping units and formalize the problem as a spatial prediction process using machine-learning (ML) methods with high-spatial-resolution (HSR) satellite remote sensing images and multisource geospatial data. The indicators of population spatial density, including residential geo-objects' area, building existence index, terrain slope, night light intensity, density of point of interest (POI) and road network from Internet electronic maps, and locational factors such as the distances from road and river, are jointly applied to establish the relationship between these multivariable factors and quantitative index of population density using ML algorithms such as Random Forests and XGBoost. The predicated values of population density from the mined nonlinear regression relation are further used to calculate the weights of disaggregation of each unit, and then the population quantity distribution at the scale of residential geo-objects is obtained under the control of the total amount of population statistics. Experiments with a county area show that the methodology has the ability to achieve better results than the traditional deterministic methods by reproducing a more accurate and finer geographic population distribution pattern. Meanwhile, it is found that the optimization of mapping results may benefit from the multisources geospatial data, and thus the methodological framework can be recommended to be extended to other spatialization areas of socioeconomic data.
topic Census data
machine-learning (ML) algorithms
multisource geospatial data
population mapping
residential geo-objects
spatialization
url https://ieeexplore.ieee.org/document/9042311/
work_keys_str_mv AT tianjunwu disaggregatingcountylevelcensusdataforpopulationmappingusingresidentialgeoobjectswithmultisourcegeospatialdata
AT jianchengluo disaggregatingcountylevelcensusdataforpopulationmappingusingresidentialgeoobjectswithmultisourcegeospatialdata
AT wendong disaggregatingcountylevelcensusdataforpopulationmappingusingresidentialgeoobjectswithmultisourcegeospatialdata
AT lijinggao disaggregatingcountylevelcensusdataforpopulationmappingusingresidentialgeoobjectswithmultisourcegeospatialdata
AT xiaodonghu disaggregatingcountylevelcensusdataforpopulationmappingusingresidentialgeoobjectswithmultisourcegeospatialdata
AT zhifengwu disaggregatingcountylevelcensusdataforpopulationmappingusingresidentialgeoobjectswithmultisourcegeospatialdata
AT yingweisun disaggregatingcountylevelcensusdataforpopulationmappingusingresidentialgeoobjectswithmultisourcegeospatialdata
AT jinsongliu disaggregatingcountylevelcensusdataforpopulationmappingusingresidentialgeoobjectswithmultisourcegeospatialdata
_version_ 1721398812848685056