Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.

Obtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focu...

Full description

Bibliographic Details
Main Authors: Weston Anderson, Seth Guikema, Ben Zaitchik, William Pan
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4081515?pdf=render
id doaj-3ab27644afa9457ab11677fc2d91ca35
record_format Article
spelling doaj-3ab27644afa9457ab11677fc2d91ca352020-11-25T02:04:18ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0197e10003710.1371/journal.pone.0100037Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.Weston AndersonSeth GuikemaBen ZaitchikWilliam PanObtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focuses on model-based methods for estimating population when no direct samples are available in the area of interest. To explore the efficacy of tree-based models for estimating population density, we compare six different model structures including Random Forest and Bayesian Additive Regression Trees. Results demonstrate that without information from prior time periods, non-parametric tree-based models produced more accurate predictions than did conventional regression methods. Improving estimates of population density in non-sampled areas is important for regions with incomplete census data and has implications for economic, health and development policies.http://europepmc.org/articles/PMC4081515?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Weston Anderson
Seth Guikema
Ben Zaitchik
William Pan
spellingShingle Weston Anderson
Seth Guikema
Ben Zaitchik
William Pan
Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.
PLoS ONE
author_facet Weston Anderson
Seth Guikema
Ben Zaitchik
William Pan
author_sort Weston Anderson
title Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.
title_short Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.
title_full Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.
title_fullStr Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.
title_full_unstemmed Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.
title_sort methods for estimating population density in data-limited areas: evaluating regression and tree-based models in peru.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description Obtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focuses on model-based methods for estimating population when no direct samples are available in the area of interest. To explore the efficacy of tree-based models for estimating population density, we compare six different model structures including Random Forest and Bayesian Additive Regression Trees. Results demonstrate that without information from prior time periods, non-parametric tree-based models produced more accurate predictions than did conventional regression methods. Improving estimates of population density in non-sampled areas is important for regions with incomplete census data and has implications for economic, health and development policies.
url http://europepmc.org/articles/PMC4081515?pdf=render
work_keys_str_mv AT westonanderson methodsforestimatingpopulationdensityindatalimitedareasevaluatingregressionandtreebasedmodelsinperu
AT sethguikema methodsforestimatingpopulationdensityindatalimitedareasevaluatingregressionandtreebasedmodelsinperu
AT benzaitchik methodsforestimatingpopulationdensityindatalimitedareasevaluatingregressionandtreebasedmodelsinperu
AT williampan methodsforestimatingpopulationdensityindatalimitedareasevaluatingregressionandtreebasedmodelsinperu
_version_ 1724943179641257984