Local Population Mapping Using a Random Forest Model Based on Remote and Social Sensing Data: A Case Study in Zhengzhou, China

High-resolution gridded population data are important for understanding and responding to many socioeconomic and environmental problems. Local estimates of the population allow officials and researchers to make a better local planning (e.g., optimizing public services and facilities). This study use...

Full description

Bibliographic Details
Main Authors: Ge Qiu, Yuhai Bao, Xuchao Yang, Chen Wang, Tingting Ye, Alfred Stein, Peng Jia
Format: Article
Language:English
Published: MDPI AG 2020-05-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/12/10/1618
id doaj-0c2aa7af8e3e4c099b95154be2d69c50
record_format Article
spelling doaj-0c2aa7af8e3e4c099b95154be2d69c502020-11-25T02:18:53ZengMDPI AGRemote Sensing2072-42922020-05-01121618161810.3390/rs12101618Local Population Mapping Using a Random Forest Model Based on Remote and Social Sensing Data: A Case Study in Zhengzhou, ChinaGe Qiu0Yuhai Bao1Xuchao Yang2Chen Wang3Tingting Ye4Alfred Stein5Peng Jia6College of Geographic Science, Inner Mongolia Normal University, Huhhot 010022, ChinaCollege of Geographic Science, Inner Mongolia Normal University, Huhhot 010022, ChinaOcean College, Zhejiang University, Zhoushan 316021, ChinaSatellite Application Center for Ecology and Environment, Ministry of Ecology and Environment, Beijing 100094, ChinaOcean College, Zhejiang University, Zhoushan 316021, ChinaFaculty of Geo-Information Science and Earth Observation (ITC), University of Twente, 7500 Enschede, The NetherlandsFaculty of Geo-Information Science and Earth Observation (ITC), University of Twente, 7500 Enschede, The NetherlandsHigh-resolution gridded population data are important for understanding and responding to many socioeconomic and environmental problems. Local estimates of the population allow officials and researchers to make a better local planning (e.g., optimizing public services and facilities). This study used a random forest algorithm, on the basis of remote sensing (i.e., satellite imagery) and social sensing data (i.e., point-of-interest and building footprint), to disaggregate census population data for the five municipal districts of Zhengzhou city, China, onto 100 × 100 m grid cells. We used a statistical tool to detect areas with an abnormal population density; e.g., areas containing many empty houses or houses rented by more people than allowed, and conducted field work to validate our findings. Results showed that some categories of points-of-interest, such as residential communities, parking lots, banks, and government buildings were the most important contributing elements in modeling the spatial distribution of the residential population in Zhengzhou City. The exclusion of areas with an abnormal population density from model training and dasymetric mapping increased the accuracy of population estimates in other areas with a more common population density. We compared our product with three widely used gridded population products: Worldpop, the Gridded Population of the World, and the 1-km Grid Population Dataset of China. The relative accuracy of our modeling approach was higher than that of those three products in the five municipal districts of Zhengzhou. This study demonstrated potential for the combination of remote and social sensing data to more accurately estimate the population density in urban areas, with minimum disturbance from the abnormal population density.https://www.mdpi.com/2072-4292/12/10/1618population distributionrandom forestremote sensingsocial sensingpoint-of-interestbuilding footprint
collection DOAJ
language English
format Article
sources DOAJ
author Ge Qiu
Yuhai Bao
Xuchao Yang
Chen Wang
Tingting Ye
Alfred Stein
Peng Jia
spellingShingle Ge Qiu
Yuhai Bao
Xuchao Yang
Chen Wang
Tingting Ye
Alfred Stein
Peng Jia
Local Population Mapping Using a Random Forest Model Based on Remote and Social Sensing Data: A Case Study in Zhengzhou, China
Remote Sensing
population distribution
random forest
remote sensing
social sensing
point-of-interest
building footprint
author_facet Ge Qiu
Yuhai Bao
Xuchao Yang
Chen Wang
Tingting Ye
Alfred Stein
Peng Jia
author_sort Ge Qiu
title Local Population Mapping Using a Random Forest Model Based on Remote and Social Sensing Data: A Case Study in Zhengzhou, China
title_short Local Population Mapping Using a Random Forest Model Based on Remote and Social Sensing Data: A Case Study in Zhengzhou, China
title_full Local Population Mapping Using a Random Forest Model Based on Remote and Social Sensing Data: A Case Study in Zhengzhou, China
title_fullStr Local Population Mapping Using a Random Forest Model Based on Remote and Social Sensing Data: A Case Study in Zhengzhou, China
title_full_unstemmed Local Population Mapping Using a Random Forest Model Based on Remote and Social Sensing Data: A Case Study in Zhengzhou, China
title_sort local population mapping using a random forest model based on remote and social sensing data: a case study in zhengzhou, china
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2020-05-01
description High-resolution gridded population data are important for understanding and responding to many socioeconomic and environmental problems. Local estimates of the population allow officials and researchers to make a better local planning (e.g., optimizing public services and facilities). This study used a random forest algorithm, on the basis of remote sensing (i.e., satellite imagery) and social sensing data (i.e., point-of-interest and building footprint), to disaggregate census population data for the five municipal districts of Zhengzhou city, China, onto 100 × 100 m grid cells. We used a statistical tool to detect areas with an abnormal population density; e.g., areas containing many empty houses or houses rented by more people than allowed, and conducted field work to validate our findings. Results showed that some categories of points-of-interest, such as residential communities, parking lots, banks, and government buildings were the most important contributing elements in modeling the spatial distribution of the residential population in Zhengzhou City. The exclusion of areas with an abnormal population density from model training and dasymetric mapping increased the accuracy of population estimates in other areas with a more common population density. We compared our product with three widely used gridded population products: Worldpop, the Gridded Population of the World, and the 1-km Grid Population Dataset of China. The relative accuracy of our modeling approach was higher than that of those three products in the five municipal districts of Zhengzhou. This study demonstrated potential for the combination of remote and social sensing data to more accurately estimate the population density in urban areas, with minimum disturbance from the abnormal population density.
topic population distribution
random forest
remote sensing
social sensing
point-of-interest
building footprint
url https://www.mdpi.com/2072-4292/12/10/1618
work_keys_str_mv AT geqiu localpopulationmappingusingarandomforestmodelbasedonremoteandsocialsensingdataacasestudyinzhengzhouchina
AT yuhaibao localpopulationmappingusingarandomforestmodelbasedonremoteandsocialsensingdataacasestudyinzhengzhouchina
AT xuchaoyang localpopulationmappingusingarandomforestmodelbasedonremoteandsocialsensingdataacasestudyinzhengzhouchina
AT chenwang localpopulationmappingusingarandomforestmodelbasedonremoteandsocialsensingdataacasestudyinzhengzhouchina
AT tingtingye localpopulationmappingusingarandomforestmodelbasedonremoteandsocialsensingdataacasestudyinzhengzhouchina
AT alfredstein localpopulationmappingusingarandomforestmodelbasedonremoteandsocialsensingdataacasestudyinzhengzhouchina
AT pengjia localpopulationmappingusingarandomforestmodelbasedonremoteandsocialsensingdataacasestudyinzhengzhouchina
_version_ 1724880104308342784