Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments

TILLING (Targeting Induced Local Lesions IN Genomes) is a powerful reverse genetics method in plant functional genomics and breeding to identify mutagenized individuals with improved behavior for a trait of interest. Pooled high throughput sequencing (HTS) of the targeted genes allows efficient iden...

Full description

Bibliographic Details
Main Authors: Juanita Gil, Juan Sebastian Andrade-Martínez, Jorge Duitama
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-02-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2021.624513/full
id doaj-3d44475e1e9e45d1accb35d680acea26
record_format Article
spelling doaj-3d44475e1e9e45d1accb35d680acea262021-02-03T04:50:56ZengFrontiers Media S.A.Frontiers in Genetics1664-80212021-02-011210.3389/fgene.2021.624513624513Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING ExperimentsJuanita Gil0Juan Sebastian Andrade-Martínez1Juan Sebastian Andrade-Martínez2Jorge Duitama3Systems and Computing Engineering Department, Universidad de Los Andes, Bogotá, ColombiaResearch Group on Computational Biology and Microbial Ecology, Department of Biological Sciences, Universidad de Los Andes, Bogotá, ColombiaMax Planck Tandem Group in Computational Biology, Universidad de Los Andes, Bogotá, ColombiaSystems and Computing Engineering Department, Universidad de Los Andes, Bogotá, ColombiaTILLING (Targeting Induced Local Lesions IN Genomes) is a powerful reverse genetics method in plant functional genomics and breeding to identify mutagenized individuals with improved behavior for a trait of interest. Pooled high throughput sequencing (HTS) of the targeted genes allows efficient identification and sample assignment of variants within genes of interest in hundreds of individuals. Although TILLING has been used successfully in different crops and even applied to natural populations, one of the main issues for a successful TILLING experiment is that most currently available bioinformatics tools for variant detection are not designed to identify mutations with low frequencies in pooled samples or to perform sample identification from variants identified in overlapping pools. Our research group maintains the Next Generation Sequencing Experience Platform (NGSEP), an open source solution for analysis of HTS data. In this manuscript, we present three novel components within NGSEP to facilitate the design and analysis of TILLING experiments: a pooled variants detector, a sample identifier from variants detected in overlapping pools and a simulator of TILLING experiments. A new implementation of the NGSEP calling model for variant detection allows accurate detection of low frequency mutations within pools. The samples identifier implements the process to triangulate the mutations called within overlapping pools in order to assign mutations to single individuals whenever possible. Finally, we developed a complete simulator of TILLING experiments to enable benchmarking of different tools and to facilitate the design of experimental alternatives varying the number of pools and individuals per pool. Simulation experiments based on genes from the common bean genome indicate that NGSEP provides similar accuracy and better efficiency than other tools to perform pooled variants detection. To the best of our knowledge, NGSEP is currently the only tool that generates individual assignments of the mutations discovered from the pooled data. We expect that this development will be of great use for different groups implementing TILLING as an alternative for plant breeding and even to research groups performing pooled sequencing for other applications.https://www.frontiersin.org/articles/10.3389/fgene.2021.624513/fullsoftwaremutagenesisfunctional genomicsTILLINGvariants detection
collection DOAJ
language English
format Article
sources DOAJ
author Juanita Gil
Juan Sebastian Andrade-Martínez
Juan Sebastian Andrade-Martínez
Jorge Duitama
spellingShingle Juanita Gil
Juan Sebastian Andrade-Martínez
Juan Sebastian Andrade-Martínez
Jorge Duitama
Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
Frontiers in Genetics
software
mutagenesis
functional genomics
TILLING
variants detection
author_facet Juanita Gil
Juan Sebastian Andrade-Martínez
Juan Sebastian Andrade-Martínez
Jorge Duitama
author_sort Juanita Gil
title Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_short Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_full Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_fullStr Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_full_unstemmed Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_sort accurate, efficient and user-friendly mutation calling and sample identification for tilling experiments
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2021-02-01
description TILLING (Targeting Induced Local Lesions IN Genomes) is a powerful reverse genetics method in plant functional genomics and breeding to identify mutagenized individuals with improved behavior for a trait of interest. Pooled high throughput sequencing (HTS) of the targeted genes allows efficient identification and sample assignment of variants within genes of interest in hundreds of individuals. Although TILLING has been used successfully in different crops and even applied to natural populations, one of the main issues for a successful TILLING experiment is that most currently available bioinformatics tools for variant detection are not designed to identify mutations with low frequencies in pooled samples or to perform sample identification from variants identified in overlapping pools. Our research group maintains the Next Generation Sequencing Experience Platform (NGSEP), an open source solution for analysis of HTS data. In this manuscript, we present three novel components within NGSEP to facilitate the design and analysis of TILLING experiments: a pooled variants detector, a sample identifier from variants detected in overlapping pools and a simulator of TILLING experiments. A new implementation of the NGSEP calling model for variant detection allows accurate detection of low frequency mutations within pools. The samples identifier implements the process to triangulate the mutations called within overlapping pools in order to assign mutations to single individuals whenever possible. Finally, we developed a complete simulator of TILLING experiments to enable benchmarking of different tools and to facilitate the design of experimental alternatives varying the number of pools and individuals per pool. Simulation experiments based on genes from the common bean genome indicate that NGSEP provides similar accuracy and better efficiency than other tools to perform pooled variants detection. To the best of our knowledge, NGSEP is currently the only tool that generates individual assignments of the mutations discovered from the pooled data. We expect that this development will be of great use for different groups implementing TILLING as an alternative for plant breeding and even to research groups performing pooled sequencing for other applications.
topic software
mutagenesis
functional genomics
TILLING
variants detection
url https://www.frontiersin.org/articles/10.3389/fgene.2021.624513/full
work_keys_str_mv AT juanitagil accurateefficientanduserfriendlymutationcallingandsampleidentificationfortillingexperiments
AT juansebastianandrademartinez accurateefficientanduserfriendlymutationcallingandsampleidentificationfortillingexperiments
AT juansebastianandrademartinez accurateefficientanduserfriendlymutationcallingandsampleidentificationfortillingexperiments
AT jorgeduitama accurateefficientanduserfriendlymutationcallingandsampleidentificationfortillingexperiments
_version_ 1724289368902860800