Planned missing data designs and methods: Options for strengthening inference, increasing research efficiency and improving animal welfare in ecological and evolutionary research

Abstract Ecological and evolutionary research questions are increasingly requiring the integration of research fields along with larger data sets to address fundamental local‐ and global‐scale problems. Unfortunately, these agendas are often in conflict with limited funding and a need to balance ani...

Full description

Bibliographic Details
Main Authors: Daniel W. A. Noble, Shinichi Nakagawa
Format: Article
Language:English
Published: Wiley 2021-08-01
Series:Evolutionary Applications
Subjects:
Online Access:https://doi.org/10.1111/eva.13273
id doaj-26bc088cecd34b19a3241ba1f654c5c5
record_format Article
spelling doaj-26bc088cecd34b19a3241ba1f654c5c52021-08-18T13:02:00ZengWileyEvolutionary Applications1752-45712021-08-011481958196810.1111/eva.13273Planned missing data designs and methods: Options for strengthening inference, increasing research efficiency and improving animal welfare in ecological and evolutionary researchDaniel W. A. Noble0Shinichi Nakagawa1Division of Ecology and Evolution Research School of Biology The Australian National University Canberra ACT AustraliaEcology and Evolution Research Centre School of Biological, Earth and Environmental Sciences The University of New South Wales Sydney NSW AustraliaAbstract Ecological and evolutionary research questions are increasingly requiring the integration of research fields along with larger data sets to address fundamental local‐ and global‐scale problems. Unfortunately, these agendas are often in conflict with limited funding and a need to balance animal welfare concerns. Planned missing data design (PMDD), where data are randomly and deliberately missed during data collection, combined with missing data procedures, can be useful tools when working under greater research constraints. Here, we review how PMDD can be incorporated into existing experimental designs by discussing alternative design approaches and demonstrate with simulated data sets how missing data procedures work with incomplete data. PMDDs can provide researchers with a unique toolkit that can be applied during the experimental design stage. Planning and thinking about missing data early can (1) reduce research costs by allowing for the collection of less expensive measurement variables; (2) provide opportunities to distinguish predictions from alternative hypotheses by allowing more measurement variables to be collected; and (3) minimize distress caused by experimentation by reducing the reliance on invasive procedures or allowing data to be collected on fewer subjects (or less often on a given subject). PMDDs and missing data methods can even provide statistical benefits under certain situations by improving statistical power relative to a complete case design. The impacts of unplanned missing data, which can cause biases in parameter estimates and their uncertainty, can also be ameliorated using missing data procedures. PMDDs are still in their infancy. We discuss some of the difficulties in their implementation and provide tentative solutions. While PMDDs may not always be the best option, missing data procedures are becoming more sophisticated and more easily implemented and it is likely that PMDDs will be effective tools for a wide range of experimental designs, data types and problems in ecology and evolution.https://doi.org/10.1111/eva.13273data augmentationhierarchical modelsmixed effects modelsmultilevel modellingmultiple imputationmultiple working hypotheses
collection DOAJ
language English
format Article
sources DOAJ
author Daniel W. A. Noble
Shinichi Nakagawa
spellingShingle Daniel W. A. Noble
Shinichi Nakagawa
Planned missing data designs and methods: Options for strengthening inference, increasing research efficiency and improving animal welfare in ecological and evolutionary research
Evolutionary Applications
data augmentation
hierarchical models
mixed effects models
multilevel modelling
multiple imputation
multiple working hypotheses
author_facet Daniel W. A. Noble
Shinichi Nakagawa
author_sort Daniel W. A. Noble
title Planned missing data designs and methods: Options for strengthening inference, increasing research efficiency and improving animal welfare in ecological and evolutionary research
title_short Planned missing data designs and methods: Options for strengthening inference, increasing research efficiency and improving animal welfare in ecological and evolutionary research
title_full Planned missing data designs and methods: Options for strengthening inference, increasing research efficiency and improving animal welfare in ecological and evolutionary research
title_fullStr Planned missing data designs and methods: Options for strengthening inference, increasing research efficiency and improving animal welfare in ecological and evolutionary research
title_full_unstemmed Planned missing data designs and methods: Options for strengthening inference, increasing research efficiency and improving animal welfare in ecological and evolutionary research
title_sort planned missing data designs and methods: options for strengthening inference, increasing research efficiency and improving animal welfare in ecological and evolutionary research
publisher Wiley
series Evolutionary Applications
issn 1752-4571
publishDate 2021-08-01
description Abstract Ecological and evolutionary research questions are increasingly requiring the integration of research fields along with larger data sets to address fundamental local‐ and global‐scale problems. Unfortunately, these agendas are often in conflict with limited funding and a need to balance animal welfare concerns. Planned missing data design (PMDD), where data are randomly and deliberately missed during data collection, combined with missing data procedures, can be useful tools when working under greater research constraints. Here, we review how PMDD can be incorporated into existing experimental designs by discussing alternative design approaches and demonstrate with simulated data sets how missing data procedures work with incomplete data. PMDDs can provide researchers with a unique toolkit that can be applied during the experimental design stage. Planning and thinking about missing data early can (1) reduce research costs by allowing for the collection of less expensive measurement variables; (2) provide opportunities to distinguish predictions from alternative hypotheses by allowing more measurement variables to be collected; and (3) minimize distress caused by experimentation by reducing the reliance on invasive procedures or allowing data to be collected on fewer subjects (or less often on a given subject). PMDDs and missing data methods can even provide statistical benefits under certain situations by improving statistical power relative to a complete case design. The impacts of unplanned missing data, which can cause biases in parameter estimates and their uncertainty, can also be ameliorated using missing data procedures. PMDDs are still in their infancy. We discuss some of the difficulties in their implementation and provide tentative solutions. While PMDDs may not always be the best option, missing data procedures are becoming more sophisticated and more easily implemented and it is likely that PMDDs will be effective tools for a wide range of experimental designs, data types and problems in ecology and evolution.
topic data augmentation
hierarchical models
mixed effects models
multilevel modelling
multiple imputation
multiple working hypotheses
url https://doi.org/10.1111/eva.13273
work_keys_str_mv AT danielwanoble plannedmissingdatadesignsandmethodsoptionsforstrengtheninginferenceincreasingresearchefficiencyandimprovinganimalwelfareinecologicalandevolutionaryresearch
AT shinichinakagawa plannedmissingdatadesignsandmethodsoptionsforstrengtheninginferenceincreasingresearchefficiencyandimprovinganimalwelfareinecologicalandevolutionaryresearch
_version_ 1721202716948037632