FORGe: prioritizing variants for graph genomes

Abstract There is growing interest in using genetic variants to augment the reference genome into a graph genome, with alternative sequences, to improve read alignment accuracy and reduce allelic bias. While adding a variant has the positive effect of removing an undesirable alignment score penalty,...

Full description

Bibliographic Details
Main Authors: Jacob Pritt, Nae-Chyun Chen, Ben Langmead
Format: Article
Language:English
Published: BMC 2018-12-01
Series:Genome Biology
Online Access:http://link.springer.com/article/10.1186/s13059-018-1595-x
id doaj-ba95cdc1b96245d4858f3ff997ddcdb4
record_format Article
spelling doaj-ba95cdc1b96245d4858f3ff997ddcdb42020-11-25T02:04:09ZengBMCGenome Biology1474-760X2018-12-0119111610.1186/s13059-018-1595-xFORGe: prioritizing variants for graph genomesJacob Pritt0Nae-Chyun Chen1Ben Langmead2Department of Computer Science, Johns Hopkins UniversityDepartment of Computer Science, Johns Hopkins UniversityDepartment of Computer Science, Johns Hopkins UniversityAbstract There is growing interest in using genetic variants to augment the reference genome into a graph genome, with alternative sequences, to improve read alignment accuracy and reduce allelic bias. While adding a variant has the positive effect of removing an undesirable alignment score penalty, it also increases both the ambiguity of the reference genome and the cost of storing and querying the genome index. We introduce methods and a software tool called FORGe for modeling these effects and prioritizing variants accordingly. We show that FORGe enables a range of advantageous and measurable trade-offs between accuracy and computational overhead.http://link.springer.com/article/10.1186/s13059-018-1595-x
collection DOAJ
language English
format Article
sources DOAJ
author Jacob Pritt
Nae-Chyun Chen
Ben Langmead
spellingShingle Jacob Pritt
Nae-Chyun Chen
Ben Langmead
FORGe: prioritizing variants for graph genomes
Genome Biology
author_facet Jacob Pritt
Nae-Chyun Chen
Ben Langmead
author_sort Jacob Pritt
title FORGe: prioritizing variants for graph genomes
title_short FORGe: prioritizing variants for graph genomes
title_full FORGe: prioritizing variants for graph genomes
title_fullStr FORGe: prioritizing variants for graph genomes
title_full_unstemmed FORGe: prioritizing variants for graph genomes
title_sort forge: prioritizing variants for graph genomes
publisher BMC
series Genome Biology
issn 1474-760X
publishDate 2018-12-01
description Abstract There is growing interest in using genetic variants to augment the reference genome into a graph genome, with alternative sequences, to improve read alignment accuracy and reduce allelic bias. While adding a variant has the positive effect of removing an undesirable alignment score penalty, it also increases both the ambiguity of the reference genome and the cost of storing and querying the genome index. We introduce methods and a software tool called FORGe for modeling these effects and prioritizing variants accordingly. We show that FORGe enables a range of advantageous and measurable trade-offs between accuracy and computational overhead.
url http://link.springer.com/article/10.1186/s13059-018-1595-x
work_keys_str_mv AT jacobpritt forgeprioritizingvariantsforgraphgenomes
AT naechyunchen forgeprioritizingvariantsforgraphgenomes
AT benlangmead forgeprioritizingvariantsforgraphgenomes
_version_ 1724944310966681600