A Contamination Model for Selective Editing

The aim of selective editing is to identify observations affected by influential errors. A score function based on the impact of the potential error on target estimates is useful to prioritize observations for accurate reviewing. We assume a Gaussian model for true data and an “intermittent” error m...

Full description

Bibliographic Details
Main Authors: Di Zio Marco, Guarnera Ugo
Format: Article
Language:English
Published: Sciendo 2013-12-01
Series:Journal of Official Statistics
Subjects:
Online Access:https://doi.org/10.2478/jos-2013-0039
id doaj-dadaa166b678405da8317cb5dccd2249
record_format Article
spelling doaj-dadaa166b678405da8317cb5dccd22492021-09-06T19:41:46ZengSciendoJournal of Official Statistics2001-73672013-12-0129453955510.2478/jos-2013-0039A Contamination Model for Selective EditingDi Zio Marco0Guarnera Ugo1ISTAT, Italian National Institute of Statistics, Via Cesare Balbo 16, 00184 Rome, Italy.ISTAT, Italian National Institute of Statistics, Via Cesare Balbo 16, 00184 Rome, ItalyThe aim of selective editing is to identify observations affected by influential errors. A score function based on the impact of the potential error on target estimates is useful to prioritize observations for accurate reviewing. We assume a Gaussian model for true data and an “intermittent” error mechanism such that a proportion of data is contaminated by an additive Gaussian error. In this setting, scores can be related to the expected value of errors affecting data. Consequently, a set of units can be selected such that the expected residual error in data is below a prefixed threshold. In the context of economic surveys when positive variables are analyzed, the method is more realistically applied to logarithms of data instead of data in their original scale. The method is illustrated through an experimental study on real business survey data where contamination is simulated according to error mechanisms frequently encountered in the practical context of economic surveys.https://doi.org/10.2478/jos-2013-0039statistical data editinginfluential errorsfinite mixture modelsscore function
collection DOAJ
language English
format Article
sources DOAJ
author Di Zio Marco
Guarnera Ugo
spellingShingle Di Zio Marco
Guarnera Ugo
A Contamination Model for Selective Editing
Journal of Official Statistics
statistical data editing
influential errors
finite mixture models
score function
author_facet Di Zio Marco
Guarnera Ugo
author_sort Di Zio Marco
title A Contamination Model for Selective Editing
title_short A Contamination Model for Selective Editing
title_full A Contamination Model for Selective Editing
title_fullStr A Contamination Model for Selective Editing
title_full_unstemmed A Contamination Model for Selective Editing
title_sort contamination model for selective editing
publisher Sciendo
series Journal of Official Statistics
issn 2001-7367
publishDate 2013-12-01
description The aim of selective editing is to identify observations affected by influential errors. A score function based on the impact of the potential error on target estimates is useful to prioritize observations for accurate reviewing. We assume a Gaussian model for true data and an “intermittent” error mechanism such that a proportion of data is contaminated by an additive Gaussian error. In this setting, scores can be related to the expected value of errors affecting data. Consequently, a set of units can be selected such that the expected residual error in data is below a prefixed threshold. In the context of economic surveys when positive variables are analyzed, the method is more realistically applied to logarithms of data instead of data in their original scale. The method is illustrated through an experimental study on real business survey data where contamination is simulated according to error mechanisms frequently encountered in the practical context of economic surveys.
topic statistical data editing
influential errors
finite mixture models
score function
url https://doi.org/10.2478/jos-2013-0039
work_keys_str_mv AT diziomarco acontaminationmodelforselectiveediting
AT guarneraugo acontaminationmodelforselectiveediting
AT diziomarco contaminationmodelforselectiveediting
AT guarneraugo contaminationmodelforselectiveediting
_version_ 1717765504949551104