A Contamination Model for Selective Editing
The aim of selective editing is to identify observations affected by influential errors. A score function based on the impact of the potential error on target estimates is useful to prioritize observations for accurate reviewing. We assume a Gaussian model for true data and an “intermittent” error m...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Sciendo
2013-12-01
|
Series: | Journal of Official Statistics |
Subjects: | |
Online Access: | https://doi.org/10.2478/jos-2013-0039 |
id |
doaj-dadaa166b678405da8317cb5dccd2249 |
---|---|
record_format |
Article |
spelling |
doaj-dadaa166b678405da8317cb5dccd22492021-09-06T19:41:46ZengSciendoJournal of Official Statistics2001-73672013-12-0129453955510.2478/jos-2013-0039A Contamination Model for Selective EditingDi Zio Marco0Guarnera Ugo1ISTAT, Italian National Institute of Statistics, Via Cesare Balbo 16, 00184 Rome, Italy.ISTAT, Italian National Institute of Statistics, Via Cesare Balbo 16, 00184 Rome, ItalyThe aim of selective editing is to identify observations affected by influential errors. A score function based on the impact of the potential error on target estimates is useful to prioritize observations for accurate reviewing. We assume a Gaussian model for true data and an “intermittent” error mechanism such that a proportion of data is contaminated by an additive Gaussian error. In this setting, scores can be related to the expected value of errors affecting data. Consequently, a set of units can be selected such that the expected residual error in data is below a prefixed threshold. In the context of economic surveys when positive variables are analyzed, the method is more realistically applied to logarithms of data instead of data in their original scale. The method is illustrated through an experimental study on real business survey data where contamination is simulated according to error mechanisms frequently encountered in the practical context of economic surveys.https://doi.org/10.2478/jos-2013-0039statistical data editinginfluential errorsfinite mixture modelsscore function |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Di Zio Marco Guarnera Ugo |
spellingShingle |
Di Zio Marco Guarnera Ugo A Contamination Model for Selective Editing Journal of Official Statistics statistical data editing influential errors finite mixture models score function |
author_facet |
Di Zio Marco Guarnera Ugo |
author_sort |
Di Zio Marco |
title |
A Contamination Model for Selective Editing |
title_short |
A Contamination Model for Selective Editing |
title_full |
A Contamination Model for Selective Editing |
title_fullStr |
A Contamination Model for Selective Editing |
title_full_unstemmed |
A Contamination Model for Selective Editing |
title_sort |
contamination model for selective editing |
publisher |
Sciendo |
series |
Journal of Official Statistics |
issn |
2001-7367 |
publishDate |
2013-12-01 |
description |
The aim of selective editing is to identify observations affected by influential errors. A score function based on the impact of the potential error on target estimates is useful to prioritize observations for accurate reviewing. We assume a Gaussian model for true data and an “intermittent” error mechanism such that a proportion of data is contaminated by an additive Gaussian error. In this setting, scores can be related to the expected value of errors affecting data. Consequently, a set of units can be selected such that the expected residual error in data is below a prefixed threshold. In the context of economic surveys when positive variables are analyzed, the method is more realistically applied to logarithms of data instead of data in their original scale. The method is illustrated through an experimental study on real business survey data where contamination is simulated according to error mechanisms frequently encountered in the practical context of economic surveys. |
topic |
statistical data editing influential errors finite mixture models score function |
url |
https://doi.org/10.2478/jos-2013-0039 |
work_keys_str_mv |
AT diziomarco acontaminationmodelforselectiveediting AT guarneraugo acontaminationmodelforselectiveediting AT diziomarco contaminationmodelforselectiveediting AT guarneraugo contaminationmodelforselectiveediting |
_version_ |
1717765504949551104 |