POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments

Background While many bioinformatics tools currently exist for assembling and discovering variants from next-generation sequence data, there are very few tools available for performing evolutionary analyses from these data. Evolutionary and population genomics studies hold great promise for providin...

Full description

Bibliographic Details
Main Author:	Daniel Garrigan
Format:	Article
Language:	English
Published:	SAGE Publishing 2013-01-01
Series:	Evolutionary Bioinformatics
Online Access:	https://doi.org/10.4137/EBO.S12751

id	doaj-effe942c0630463ebcde46665a94bd53
record_format	Article
spelling	doaj-effe942c0630463ebcde46665a94bd532020-11-25T01:27:14ZengSAGE PublishingEvolutionary Bioinformatics1176-93432013-01-01910.4137/EBO.S12751POPBAM: Tools for Evolutionary Analysis of Short Read Sequence AlignmentsDaniel Garrigan0Department of Biology, University of Rochester, Rochester, New York 14627 USA.Background While many bioinformatics tools currently exist for assembling and discovering variants from next-generation sequence data, there are very few tools available for performing evolutionary analyses from these data. Evolutionary and population genomics studies hold great promise for providing valuable insights into natural selection, the effect of mutations on phenotypes, and the origin of species. Thus, there is a need for an extensible and flexible computational tool that can function into a growing number of evolutionary bioinformatics pipelines. Results This paper describes the POPBAM software, which is a comprehensive set of computational tools for evolutionary analysis of whole-genome alignments consisting of multiple individuals, from multiple populations or species. POPBAM works directly from BAM-formatted assembly files, calls variant sites, and calculates a variety of commonly used evolutionary sequence statistics. POPBAM is designed primarily to perform analyses in sliding windows across chromosomes or scaffolds. POPBAM accurately measures nucleotide diversity, population divergence, linkage disequilibrium, and the frequency spectrum of mutations from two or more populations. POPBAM can also produce phylogenetic trees of all samples in a BAM file. Finally, I demonstrate that the implementation of POPBAM is both fast and memory-efficient, and also can feasibly scale to the analysis of large BAM files with many individuals and populations. Software The POPBAM program is written in C/C++ and is available from http://dgarriga.github.io/POPBAM . The program has few dependencies and can be built on a variety of Linux platforms. The program is open-source and users are encouraged to participate in the development of this resource.https://doi.org/10.4137/EBO.S12751
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Daniel Garrigan
spellingShingle	Daniel Garrigan POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments Evolutionary Bioinformatics
author_facet	Daniel Garrigan
author_sort	Daniel Garrigan
title	POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_short	POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_full	POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_fullStr	POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_full_unstemmed	POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments
title_sort	popbam: tools for evolutionary analysis of short read sequence alignments
publisher	SAGE Publishing
series	Evolutionary Bioinformatics
issn	1176-9343
publishDate	2013-01-01
description	Background While many bioinformatics tools currently exist for assembling and discovering variants from next-generation sequence data, there are very few tools available for performing evolutionary analyses from these data. Evolutionary and population genomics studies hold great promise for providing valuable insights into natural selection, the effect of mutations on phenotypes, and the origin of species. Thus, there is a need for an extensible and flexible computational tool that can function into a growing number of evolutionary bioinformatics pipelines. Results This paper describes the POPBAM software, which is a comprehensive set of computational tools for evolutionary analysis of whole-genome alignments consisting of multiple individuals, from multiple populations or species. POPBAM works directly from BAM-formatted assembly files, calls variant sites, and calculates a variety of commonly used evolutionary sequence statistics. POPBAM is designed primarily to perform analyses in sliding windows across chromosomes or scaffolds. POPBAM accurately measures nucleotide diversity, population divergence, linkage disequilibrium, and the frequency spectrum of mutations from two or more populations. POPBAM can also produce phylogenetic trees of all samples in a BAM file. Finally, I demonstrate that the implementation of POPBAM is both fast and memory-efficient, and also can feasibly scale to the analysis of large BAM files with many individuals and populations. Software The POPBAM program is written in C/C++ and is available from http://dgarriga.github.io/POPBAM . The program has few dependencies and can be built on a variety of Linux platforms. The program is open-source and users are encouraged to participate in the development of this resource.
url	https://doi.org/10.4137/EBO.S12751
work_keys_str_mv	AT danielgarrigan popbamtoolsforevolutionaryanalysisofshortreadsequencealignments
_version_	1725105915832565760

POPBAM: Tools for Evolutionary Analysis of Short Read Sequence Alignments

Similar Items