FORGe: prioritizing variants for graph genomes

Abstract There is growing interest in using genetic variants to augment the reference genome into a graph genome, with alternative sequences, to improve read alignment accuracy and reduce allelic bias. While adding a variant has the positive effect of removing an undesirable alignment score penalty,...

Full description

Bibliographic Details
Main Authors: Jacob Pritt, Nae-Chyun Chen, Ben Langmead
Format: Article
Language:English
Published: BMC 2018-12-01
Series:Genome Biology
Online Access:http://link.springer.com/article/10.1186/s13059-018-1595-x
Description
Summary:Abstract There is growing interest in using genetic variants to augment the reference genome into a graph genome, with alternative sequences, to improve read alignment accuracy and reduce allelic bias. While adding a variant has the positive effect of removing an undesirable alignment score penalty, it also increases both the ambiguity of the reference genome and the cost of storing and querying the genome index. We introduce methods and a software tool called FORGe for modeling these effects and prioritizing variants accordingly. We show that FORGe enables a range of advantageous and measurable trade-offs between accuracy and computational overhead.
ISSN:1474-760X