A Bioinformatics Pipeline to Identify a Subset of SNPs for Genomics-Assisted Potato Breeding

Modern potato breeding methods following a genomic-led approach provide means for shortening breeding cycles and increasing breeding efficiency across selection cycles. Acquiring genetic data for large breeding populations remains expensive. We present a pipeline to reduce the number of single nucle...

Full description

Bibliographic Details
Main Authors: Catja Selga, Alexander Koc, Aakash Chawade, Rodomiro Ortiz
Format: Article
Language:English
Published: MDPI AG 2021-12-01
Series:Plants
Subjects:
Online Access:https://www.mdpi.com/2223-7747/10/1/30
id doaj-3042512452364c1696a31ffef7d29b7a
record_format Article
spelling doaj-3042512452364c1696a31ffef7d29b7a2020-12-25T00:05:08ZengMDPI AGPlants2223-77472021-12-0110303010.3390/plants10010030A Bioinformatics Pipeline to Identify a Subset of SNPs for Genomics-Assisted Potato BreedingCatja Selga0Alexander Koc1Aakash Chawade2Rodomiro Ortiz3Department of Plant Breeding, Swedish University of Agricultural Sciences (SLU), Box 101, SE-230 53 Alnarp, SwedenDepartment of Plant Breeding, Swedish University of Agricultural Sciences (SLU), Box 101, SE-230 53 Alnarp, SwedenDepartment of Plant Breeding, Swedish University of Agricultural Sciences (SLU), Box 101, SE-230 53 Alnarp, SwedenDepartment of Plant Breeding, Swedish University of Agricultural Sciences (SLU), Box 101, SE-230 53 Alnarp, SwedenModern potato breeding methods following a genomic-led approach provide means for shortening breeding cycles and increasing breeding efficiency across selection cycles. Acquiring genetic data for large breeding populations remains expensive. We present a pipeline to reduce the number of single nucleotide polymorphisms (SNPs) to lower the cost of genotyping. First, we reduced the number of individuals to be genotyped with a high-throughput method according to the multi-trait variation as defined by principal component analysis of phenotypic characteristics. Next, we reduced the number of SNPs by pruning for linkage disequilibrium. By adjusting the square of the correlation coefficient between two adjacent loci, we obtained reduced subsets of SNPs. We subsequently tested these SNP subsets by two methods; (1) a genome-wide association study (GWAS) for marker identification, and (2) genomic selection (GS) to predict genomic estimated breeding values. The results indicate that both GWAS and GS can be done without loss of information after SNP reduction. The pipeline allows for creating custom SNP subsets to cover all variation found in any particular breeding population. Low-throughput genotyping will reduce the genotyping cost associated with large populations, thereby making genomic breeding methods applicable to large potato breeding populations by reducing genotyping costs.https://www.mdpi.com/2223-7747/10/1/30linkage disequilibrium pruninggenomic selectiongenotypingGWASpotato breeding
collection DOAJ
language English
format Article
sources DOAJ
author Catja Selga
Alexander Koc
Aakash Chawade
Rodomiro Ortiz
spellingShingle Catja Selga
Alexander Koc
Aakash Chawade
Rodomiro Ortiz
A Bioinformatics Pipeline to Identify a Subset of SNPs for Genomics-Assisted Potato Breeding
Plants
linkage disequilibrium pruning
genomic selection
genotyping
GWAS
potato breeding
author_facet Catja Selga
Alexander Koc
Aakash Chawade
Rodomiro Ortiz
author_sort Catja Selga
title A Bioinformatics Pipeline to Identify a Subset of SNPs for Genomics-Assisted Potato Breeding
title_short A Bioinformatics Pipeline to Identify a Subset of SNPs for Genomics-Assisted Potato Breeding
title_full A Bioinformatics Pipeline to Identify a Subset of SNPs for Genomics-Assisted Potato Breeding
title_fullStr A Bioinformatics Pipeline to Identify a Subset of SNPs for Genomics-Assisted Potato Breeding
title_full_unstemmed A Bioinformatics Pipeline to Identify a Subset of SNPs for Genomics-Assisted Potato Breeding
title_sort bioinformatics pipeline to identify a subset of snps for genomics-assisted potato breeding
publisher MDPI AG
series Plants
issn 2223-7747
publishDate 2021-12-01
description Modern potato breeding methods following a genomic-led approach provide means for shortening breeding cycles and increasing breeding efficiency across selection cycles. Acquiring genetic data for large breeding populations remains expensive. We present a pipeline to reduce the number of single nucleotide polymorphisms (SNPs) to lower the cost of genotyping. First, we reduced the number of individuals to be genotyped with a high-throughput method according to the multi-trait variation as defined by principal component analysis of phenotypic characteristics. Next, we reduced the number of SNPs by pruning for linkage disequilibrium. By adjusting the square of the correlation coefficient between two adjacent loci, we obtained reduced subsets of SNPs. We subsequently tested these SNP subsets by two methods; (1) a genome-wide association study (GWAS) for marker identification, and (2) genomic selection (GS) to predict genomic estimated breeding values. The results indicate that both GWAS and GS can be done without loss of information after SNP reduction. The pipeline allows for creating custom SNP subsets to cover all variation found in any particular breeding population. Low-throughput genotyping will reduce the genotyping cost associated with large populations, thereby making genomic breeding methods applicable to large potato breeding populations by reducing genotyping costs.
topic linkage disequilibrium pruning
genomic selection
genotyping
GWAS
potato breeding
url https://www.mdpi.com/2223-7747/10/1/30
work_keys_str_mv AT catjaselga abioinformaticspipelinetoidentifyasubsetofsnpsforgenomicsassistedpotatobreeding
AT alexanderkoc abioinformaticspipelinetoidentifyasubsetofsnpsforgenomicsassistedpotatobreeding
AT aakashchawade abioinformaticspipelinetoidentifyasubsetofsnpsforgenomicsassistedpotatobreeding
AT rodomiroortiz abioinformaticspipelinetoidentifyasubsetofsnpsforgenomicsassistedpotatobreeding
AT catjaselga bioinformaticspipelinetoidentifyasubsetofsnpsforgenomicsassistedpotatobreeding
AT alexanderkoc bioinformaticspipelinetoidentifyasubsetofsnpsforgenomicsassistedpotatobreeding
AT aakashchawade bioinformaticspipelinetoidentifyasubsetofsnpsforgenomicsassistedpotatobreeding
AT rodomiroortiz bioinformaticspipelinetoidentifyasubsetofsnpsforgenomicsassistedpotatobreeding
_version_ 1724371427886366720