Internal Migration Reconciliation for the Kintampo HDSS Core Datasets using Probabilistic Record Linkage Techniques

A research report submitted to the Faculty of Health Science in partial fulfillment of the requirements for the degree of Master of Science (MSc) in Epidemiology - Research Data Management March, 2018. === Internal migration reconciliation involves the tracking of internal migrants to link their...

Full description

Bibliographic Details
Main Author: Adda, Robert Awiah
Format: Others
Language:en
Published: 2018
Online Access:https://hdl.handle.net/10539/25302
id ndltd-netd.ac.za-oai-union.ndltd.org-wits-oai-wiredspace.wits.ac.za-10539-25302
record_format oai_dc
spelling ndltd-netd.ac.za-oai-union.ndltd.org-wits-oai-wiredspace.wits.ac.za-10539-253022019-05-11T03:41:20Z Internal Migration Reconciliation for the Kintampo HDSS Core Datasets using Probabilistic Record Linkage Techniques Adda, Robert Awiah A research report submitted to the Faculty of Health Science in partial fulfillment of the requirements for the degree of Master of Science (MSc) in Epidemiology - Research Data Management March, 2018. Internal migration reconciliation involves the tracking of internal migrants to link their places of origin and destination for each movement within a given Health and Demographic Surveillance Area (HDSS) site. This involves several related activities adopted to account for the time the person moves from one location to the other but, remaining under the HDSS surveillance. This poses a major challenge for longitudinal studies particularly Kintampo HDSS site, where manual data capture modality is still being used. For most HDSS sites, all data related operations rely much on an individual unique personal identifier which is issued to identify a resident member during registration. The identifier enables the data system to keep track of resident members over an extended period of time to enable an accurate estimation of the population under surveillance. Movement of resident members within the geographical boundaries of the surveillance areas must be tracked based on these personal identifiers. However, residents’ records may not be linked to each other in the event of multiple movements when the personal identifier cannot be recorded or is wrongly recorded. The effort in reconciling such cases of inconsistencies resulting from internal migration involves the printing of mismatched records for field supervisors to trace back to original locations to ascertain the identity of migrants. This process is very expensive as a full-time field supervisor is required and the reconciliation process is time demanding. In this project, we explored alternative automatic and cheaper method of reconciling all categories of discrepancies that exist in the internal migration datasets using the probabilistic record linking techniques.A theoretical foundation for probabilistic record linkage technology was provided and the sequential order was followed.The EM algorithm was used in the estimation of parameters. This research report demonstrate clearly that the probabilistic record linkage frame work by Fellegi and sunter is useful for HDSS internal migration reconciliation. The EM algorithm showed an improved performance in terms of linked records compare to the probabilistic framework. However, more work needs to done to explore other parameter estimation algorithm such as the Frequency-based EM algorithm. Such results can be compared to the results in this report. LG2018 2018-08-14T06:44:14Z 2018-08-14T06:44:14Z 2018 Thesis https://hdl.handle.net/10539/25302 en application/pdf
collection NDLTD
language en
format Others
sources NDLTD
description A research report submitted to the Faculty of Health Science in partial fulfillment of the requirements for the degree of Master of Science (MSc) in Epidemiology - Research Data Management March, 2018. === Internal migration reconciliation involves the tracking of internal migrants to link their places of origin and destination for each movement within a given Health and Demographic Surveillance Area (HDSS) site. This involves several related activities adopted to account for the time the person moves from one location to the other but, remaining under the HDSS surveillance. This poses a major challenge for longitudinal studies particularly Kintampo HDSS site, where manual data capture modality is still being used. For most HDSS sites, all data related operations rely much on an individual unique personal identifier which is issued to identify a resident member during registration. The identifier enables the data system to keep track of resident members over an extended period of time to enable an accurate estimation of the population under surveillance. Movement of resident members within the geographical boundaries of the surveillance areas must be tracked based on these personal identifiers. However, residents’ records may not be linked to each other in the event of multiple movements when the personal identifier cannot be recorded or is wrongly recorded. The effort in reconciling such cases of inconsistencies resulting from internal migration involves the printing of mismatched records for field supervisors to trace back to original locations to ascertain the identity of migrants. This process is very expensive as a full-time field supervisor is required and the reconciliation process is time demanding. In this project, we explored alternative automatic and cheaper method of reconciling all categories of discrepancies that exist in the internal migration datasets using the probabilistic record linking techniques.A theoretical foundation for probabilistic record linkage technology was provided and the sequential order was followed.The EM algorithm was used in the estimation of parameters. This research report demonstrate clearly that the probabilistic record linkage frame work by Fellegi and sunter is useful for HDSS internal migration reconciliation. The EM algorithm showed an improved performance in terms of linked records compare to the probabilistic framework. However, more work needs to done to explore other parameter estimation algorithm such as the Frequency-based EM algorithm. Such results can be compared to the results in this report. === LG2018
author Adda, Robert Awiah
spellingShingle Adda, Robert Awiah
Internal Migration Reconciliation for the Kintampo HDSS Core Datasets using Probabilistic Record Linkage Techniques
author_facet Adda, Robert Awiah
author_sort Adda, Robert Awiah
title Internal Migration Reconciliation for the Kintampo HDSS Core Datasets using Probabilistic Record Linkage Techniques
title_short Internal Migration Reconciliation for the Kintampo HDSS Core Datasets using Probabilistic Record Linkage Techniques
title_full Internal Migration Reconciliation for the Kintampo HDSS Core Datasets using Probabilistic Record Linkage Techniques
title_fullStr Internal Migration Reconciliation for the Kintampo HDSS Core Datasets using Probabilistic Record Linkage Techniques
title_full_unstemmed Internal Migration Reconciliation for the Kintampo HDSS Core Datasets using Probabilistic Record Linkage Techniques
title_sort internal migration reconciliation for the kintampo hdss core datasets using probabilistic record linkage techniques
publishDate 2018
url https://hdl.handle.net/10539/25302
work_keys_str_mv AT addarobertawiah internalmigrationreconciliationforthekintampohdsscoredatasetsusingprobabilisticrecordlinkagetechniques
_version_ 1719083639333978112