Defining Populations Within Linked Administrative Data

This presentation will outline methods developed by the ABS to define populations within linked administrative data. The methods developed use a combination of both direct and indirect signals (referred to as ‘signs of life’) to infer whether individuals are in a particular population at a given po...

Full description

Bibliographic Details
Main Author: Duncan Wijnberg
Format: Article
Language:English
Published: Swansea University 2020-12-01
Series:International Journal of Population Data Science
Online Access:https://ijpds.org/article/view/1533
id doaj-0556eac82f254117a66ad281e2fcbf73
record_format Article
spelling doaj-0556eac82f254117a66ad281e2fcbf732021-02-10T16:42:36ZengSwansea UniversityInternational Journal of Population Data Science2399-49082020-12-015510.23889/ijpds.v5i5.1533Defining Populations Within Linked Administrative DataDuncan Wijnberg0Australian Bureau of Statistics This presentation will outline methods developed by the ABS to define populations within linked administrative data. The methods developed use a combination of both direct and indirect signals (referred to as ‘signs of life’) to infer whether individuals are in a particular population at a given point in time. Data from multiple government sources are used. Introduction Linked administrative datasets hold significant potential to unlock new insights and understanding about different populations of interest. A key challenge is to create a research dataset that is properly representative of the population of interest at a given point in time. Without a representative population to serve as the basis of analysis, research outcomes are harder to interpret and compare against those from other populations. This has flow on consequences for any research findings derived from linked admin data. Objectives and Approach This project sought to develop methods to define a representative Australian population from the Multi Agency Data Integration Project (MADIP) data asset. The core of the asset is created through a three-way linkage between Australian Medicare, Social Security and Taxation datasets. Together, these datasets have very high coverage of the Australian population and enable high quality linkage of other datasets into the asset. A ‘signs of life’ approach was used that sought to distinguish a representative population at a given point in time from the MADIP asset. Results An experimental representative population has been developed from the linkage spine that closely approximates the national age distribution and other breakdowns of the ABS’s Estimated Resident Population. Conclusion / Implications This work demonstrates one approach that can be used to derive useful analytical populations from linked datasets with overly exhaustive scopes. https://ijpds.org/article/view/1533
collection DOAJ
language English
format Article
sources DOAJ
author Duncan Wijnberg
spellingShingle Duncan Wijnberg
Defining Populations Within Linked Administrative Data
International Journal of Population Data Science
author_facet Duncan Wijnberg
author_sort Duncan Wijnberg
title Defining Populations Within Linked Administrative Data
title_short Defining Populations Within Linked Administrative Data
title_full Defining Populations Within Linked Administrative Data
title_fullStr Defining Populations Within Linked Administrative Data
title_full_unstemmed Defining Populations Within Linked Administrative Data
title_sort defining populations within linked administrative data
publisher Swansea University
series International Journal of Population Data Science
issn 2399-4908
publishDate 2020-12-01
description This presentation will outline methods developed by the ABS to define populations within linked administrative data. The methods developed use a combination of both direct and indirect signals (referred to as ‘signs of life’) to infer whether individuals are in a particular population at a given point in time. Data from multiple government sources are used. Introduction Linked administrative datasets hold significant potential to unlock new insights and understanding about different populations of interest. A key challenge is to create a research dataset that is properly representative of the population of interest at a given point in time. Without a representative population to serve as the basis of analysis, research outcomes are harder to interpret and compare against those from other populations. This has flow on consequences for any research findings derived from linked admin data. Objectives and Approach This project sought to develop methods to define a representative Australian population from the Multi Agency Data Integration Project (MADIP) data asset. The core of the asset is created through a three-way linkage between Australian Medicare, Social Security and Taxation datasets. Together, these datasets have very high coverage of the Australian population and enable high quality linkage of other datasets into the asset. A ‘signs of life’ approach was used that sought to distinguish a representative population at a given point in time from the MADIP asset. Results An experimental representative population has been developed from the linkage spine that closely approximates the national age distribution and other breakdowns of the ABS’s Estimated Resident Population. Conclusion / Implications This work demonstrates one approach that can be used to derive useful analytical populations from linked datasets with overly exhaustive scopes.
url https://ijpds.org/article/view/1533
work_keys_str_mv AT duncanwijnberg definingpopulationswithinlinkedadministrativedata
_version_ 1724275137879998464