Machine learning, transcriptome, and genotyping chip analyses provide insights into SNP markers identifying flower color in Platycodon grandiflorus
Abstract Bellflower is an edible ornamental gardening plant in Asia. For predicting the flower color in bellflower plants, a transcriptome-wide approach based on machine learning, transcriptome, and genotyping chip analyses was used to identify SNP markers. Six machine learning methods were deployed...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2021-04-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-021-87281-0 |
id |
doaj-315537c8b84243b4b178f8d17ea3d7d7 |
---|---|
record_format |
Article |
spelling |
doaj-315537c8b84243b4b178f8d17ea3d7d72021-04-18T11:33:47ZengNature Publishing GroupScientific Reports2045-23222021-04-011111910.1038/s41598-021-87281-0Machine learning, transcriptome, and genotyping chip analyses provide insights into SNP markers identifying flower color in Platycodon grandiflorusGo-Eun Yu0Younhee Shin1Sathiyamoorthy Subramaniyam2Sang-Ho Kang3Si-Myung Lee4Chuloh Cho5Seung-Sik Lee6Chang-Kug Kim7Genomics Division, National Institute of Agricultural SciencesResearch and Development Center, Insilicogen Inc.Research and Development Center, Insilicogen Inc.Genomics Division, National Institute of Agricultural SciencesGenomics Division, National Institute of Agricultural SciencesCrop Foundation Research Division, National Institute of Crop Science, RDAAdvanced Radiation Technology Institute, Korea Atomic Energy Research InstituteGenomics Division, National Institute of Agricultural SciencesAbstract Bellflower is an edible ornamental gardening plant in Asia. For predicting the flower color in bellflower plants, a transcriptome-wide approach based on machine learning, transcriptome, and genotyping chip analyses was used to identify SNP markers. Six machine learning methods were deployed to explore the classification potential of the selected SNPs as features in two datasets, namely training (60 RNA-Seq samples) and validation (480 Fluidigm chip samples). SNP selection was performed in sequential order. Firstly, 96 SNPs were selected from the transcriptome-wide SNPs using the principal compound analysis (PCA). Then, 9 among 96 SNPs were later identified using the Random forest based feature selection method from the Fluidigm chip dataset. Among six machines, the random forest (RF) model produced higher classification performance than the other models. The 9 SNP marker candidates selected for classifying the flower color classification were verified using the genomic DNA PCR with Sanger sequencing. Our results suggest that this methodology could be used for future selection of breeding traits even though the plant accessions are highly heterogeneous.https://doi.org/10.1038/s41598-021-87281-0 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Go-Eun Yu Younhee Shin Sathiyamoorthy Subramaniyam Sang-Ho Kang Si-Myung Lee Chuloh Cho Seung-Sik Lee Chang-Kug Kim |
spellingShingle |
Go-Eun Yu Younhee Shin Sathiyamoorthy Subramaniyam Sang-Ho Kang Si-Myung Lee Chuloh Cho Seung-Sik Lee Chang-Kug Kim Machine learning, transcriptome, and genotyping chip analyses provide insights into SNP markers identifying flower color in Platycodon grandiflorus Scientific Reports |
author_facet |
Go-Eun Yu Younhee Shin Sathiyamoorthy Subramaniyam Sang-Ho Kang Si-Myung Lee Chuloh Cho Seung-Sik Lee Chang-Kug Kim |
author_sort |
Go-Eun Yu |
title |
Machine learning, transcriptome, and genotyping chip analyses provide insights into SNP markers identifying flower color in Platycodon grandiflorus |
title_short |
Machine learning, transcriptome, and genotyping chip analyses provide insights into SNP markers identifying flower color in Platycodon grandiflorus |
title_full |
Machine learning, transcriptome, and genotyping chip analyses provide insights into SNP markers identifying flower color in Platycodon grandiflorus |
title_fullStr |
Machine learning, transcriptome, and genotyping chip analyses provide insights into SNP markers identifying flower color in Platycodon grandiflorus |
title_full_unstemmed |
Machine learning, transcriptome, and genotyping chip analyses provide insights into SNP markers identifying flower color in Platycodon grandiflorus |
title_sort |
machine learning, transcriptome, and genotyping chip analyses provide insights into snp markers identifying flower color in platycodon grandiflorus |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2021-04-01 |
description |
Abstract Bellflower is an edible ornamental gardening plant in Asia. For predicting the flower color in bellflower plants, a transcriptome-wide approach based on machine learning, transcriptome, and genotyping chip analyses was used to identify SNP markers. Six machine learning methods were deployed to explore the classification potential of the selected SNPs as features in two datasets, namely training (60 RNA-Seq samples) and validation (480 Fluidigm chip samples). SNP selection was performed in sequential order. Firstly, 96 SNPs were selected from the transcriptome-wide SNPs using the principal compound analysis (PCA). Then, 9 among 96 SNPs were later identified using the Random forest based feature selection method from the Fluidigm chip dataset. Among six machines, the random forest (RF) model produced higher classification performance than the other models. The 9 SNP marker candidates selected for classifying the flower color classification were verified using the genomic DNA PCR with Sanger sequencing. Our results suggest that this methodology could be used for future selection of breeding traits even though the plant accessions are highly heterogeneous. |
url |
https://doi.org/10.1038/s41598-021-87281-0 |
work_keys_str_mv |
AT goeunyu machinelearningtranscriptomeandgenotypingchipanalysesprovideinsightsintosnpmarkersidentifyingflowercolorinplatycodongrandiflorus AT younheeshin machinelearningtranscriptomeandgenotypingchipanalysesprovideinsightsintosnpmarkersidentifyingflowercolorinplatycodongrandiflorus AT sathiyamoorthysubramaniyam machinelearningtranscriptomeandgenotypingchipanalysesprovideinsightsintosnpmarkersidentifyingflowercolorinplatycodongrandiflorus AT sanghokang machinelearningtranscriptomeandgenotypingchipanalysesprovideinsightsintosnpmarkersidentifyingflowercolorinplatycodongrandiflorus AT simyunglee machinelearningtranscriptomeandgenotypingchipanalysesprovideinsightsintosnpmarkersidentifyingflowercolorinplatycodongrandiflorus AT chulohcho machinelearningtranscriptomeandgenotypingchipanalysesprovideinsightsintosnpmarkersidentifyingflowercolorinplatycodongrandiflorus AT seungsiklee machinelearningtranscriptomeandgenotypingchipanalysesprovideinsightsintosnpmarkersidentifyingflowercolorinplatycodongrandiflorus AT changkugkim machinelearningtranscriptomeandgenotypingchipanalysesprovideinsightsintosnpmarkersidentifyingflowercolorinplatycodongrandiflorus |
_version_ |
1721522140324298752 |