Genomic Selection Accuracy using Historical Data Generated in a Wheat Breeding Program

Cross-validation (CRV) methods were designed to simulate genomic selection (GS) for yield in a wheat ( L.) breeding program with data of 318 genotypes grown over an 11-yr period at six locations in France. Two methods, CVSWO (cross-validation-specific without location as factor) and CVSW (cross-vali...

Full description

Bibliographic Details
Main Authors: Eric Storlie, Gilles Charmet
Format: Article
Language:English
Published: Wiley 2013-03-01
Series:The Plant Genome
Online Access:https://dl.sciencesocieties.org/publications/tpg/articles/6/1/plantgenome2013.01.0001
id doaj-e38cf4946cb944e2a3709a0505d7157a
record_format Article
spelling doaj-e38cf4946cb944e2a3709a0505d7157a2020-11-25T03:43:59ZengWileyThe Plant Genome1940-33722013-03-016110.3835/plantgenome2013.01.0001plantgenome2013.01.0001Genomic Selection Accuracy using Historical Data Generated in a Wheat Breeding ProgramEric StorlieGilles CharmetCross-validation (CRV) methods were designed to simulate genomic selection (GS) for yield in a wheat ( L.) breeding program with data of 318 genotypes grown over an 11-yr period at six locations in France. Two methods, CVSWO (cross-validation-specific without location as factor) and CVSW (cross-validation-specific with location as factor), included 11 folds, each comprising genotypes grown during a specific year and each representing target populations, while the remaining folds comprising genotypes grown during the other 10 yr represented training populations. These methods were compared with CVRWO (cross-validation-random without location as factor) and CVRW (cross-validation-random with location as factor), designed to simulate standard CRV while retaining the structure of the first two CRV methods; the same 318 genotypes were used to create 11 folds, each comprising randomly selected genotypes. Results suggest the accuracy of the CRV methods using specifically selected genotypes (correlation coefficient between (marker based) estimate of breeding value and observed phenotype [] = 0.20) based on years grown were significantly less than methods using randomly selected genotypes ( = 0.40–0.50). These results imply wheat yield is more difficult to predict for unknown, futuristic years than standard CRV methods suggest. An alternative measure of accuracy based on predicted genotypic ranks, termed predicted rank conversion (PRC), was implemented for the purpose of improving accuracies and reducing the differences between CRV methods.https://dl.sciencesocieties.org/publications/tpg/articles/6/1/plantgenome2013.01.0001
collection DOAJ
language English
format Article
sources DOAJ
author Eric Storlie
Gilles Charmet
spellingShingle Eric Storlie
Gilles Charmet
Genomic Selection Accuracy using Historical Data Generated in a Wheat Breeding Program
The Plant Genome
author_facet Eric Storlie
Gilles Charmet
author_sort Eric Storlie
title Genomic Selection Accuracy using Historical Data Generated in a Wheat Breeding Program
title_short Genomic Selection Accuracy using Historical Data Generated in a Wheat Breeding Program
title_full Genomic Selection Accuracy using Historical Data Generated in a Wheat Breeding Program
title_fullStr Genomic Selection Accuracy using Historical Data Generated in a Wheat Breeding Program
title_full_unstemmed Genomic Selection Accuracy using Historical Data Generated in a Wheat Breeding Program
title_sort genomic selection accuracy using historical data generated in a wheat breeding program
publisher Wiley
series The Plant Genome
issn 1940-3372
publishDate 2013-03-01
description Cross-validation (CRV) methods were designed to simulate genomic selection (GS) for yield in a wheat ( L.) breeding program with data of 318 genotypes grown over an 11-yr period at six locations in France. Two methods, CVSWO (cross-validation-specific without location as factor) and CVSW (cross-validation-specific with location as factor), included 11 folds, each comprising genotypes grown during a specific year and each representing target populations, while the remaining folds comprising genotypes grown during the other 10 yr represented training populations. These methods were compared with CVRWO (cross-validation-random without location as factor) and CVRW (cross-validation-random with location as factor), designed to simulate standard CRV while retaining the structure of the first two CRV methods; the same 318 genotypes were used to create 11 folds, each comprising randomly selected genotypes. Results suggest the accuracy of the CRV methods using specifically selected genotypes (correlation coefficient between (marker based) estimate of breeding value and observed phenotype [] = 0.20) based on years grown were significantly less than methods using randomly selected genotypes ( = 0.40–0.50). These results imply wheat yield is more difficult to predict for unknown, futuristic years than standard CRV methods suggest. An alternative measure of accuracy based on predicted genotypic ranks, termed predicted rank conversion (PRC), was implemented for the purpose of improving accuracies and reducing the differences between CRV methods.
url https://dl.sciencesocieties.org/publications/tpg/articles/6/1/plantgenome2013.01.0001
work_keys_str_mv AT ericstorlie genomicselectionaccuracyusinghistoricaldatageneratedinawheatbreedingprogram
AT gillescharmet genomicselectionaccuracyusinghistoricaldatageneratedinawheatbreedingprogram
_version_ 1724516948501331968