Fast computation and applications of genome mappability.

We present a fast mapping-based algorithm to compute the mappability of each region of a reference genome up to a specified number of mismatches. Knowing the mappability of a genome is crucial for the interpretation of massively parallel sequencing experiments. We investigate the properties of the m...

Full description

Bibliographic Details
Main Authors: Thomas Derrien, Jordi Estellé, Santiago Marco Sola, David G Knowles, Emanuele Raineri, Roderic Guigó, Paolo Ribeca
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22276185/pdf/?tool=EBI
id doaj-5bfaabaf5be54ea0877d847ee3c5f7d5
record_format Article
spelling doaj-5bfaabaf5be54ea0877d847ee3c5f7d52021-03-04T01:08:49ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0171e3037710.1371/journal.pone.0030377Fast computation and applications of genome mappability.Thomas DerrienJordi EstelléSantiago Marco SolaDavid G KnowlesEmanuele RaineriRoderic GuigóPaolo RibecaWe present a fast mapping-based algorithm to compute the mappability of each region of a reference genome up to a specified number of mismatches. Knowing the mappability of a genome is crucial for the interpretation of massively parallel sequencing experiments. We investigate the properties of the mappability of eukaryotic DNA/RNA both as a whole and at the level of the gene family, providing for various organisms tracks which allow the mappability information to be visually explored. In addition, we show that mappability varies greatly between species and gene classes. Finally, we suggest several practical applications where mappability can be used to refine the analysis of high-throughput sequencing data (SNP calling, gene expression quantification and paired-end experiments). This work highlights mappability as an important concept which deserves to be taken into full account, in particular when massively parallel sequencing technologies are employed. The GEM mappability program belongs to the GEM (GEnome Multitool) suite of programs, which can be freely downloaded for any use from its website (http://gemlibrary.sourceforge.net).https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22276185/pdf/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Thomas Derrien
Jordi Estellé
Santiago Marco Sola
David G Knowles
Emanuele Raineri
Roderic Guigó
Paolo Ribeca
spellingShingle Thomas Derrien
Jordi Estellé
Santiago Marco Sola
David G Knowles
Emanuele Raineri
Roderic Guigó
Paolo Ribeca
Fast computation and applications of genome mappability.
PLoS ONE
author_facet Thomas Derrien
Jordi Estellé
Santiago Marco Sola
David G Knowles
Emanuele Raineri
Roderic Guigó
Paolo Ribeca
author_sort Thomas Derrien
title Fast computation and applications of genome mappability.
title_short Fast computation and applications of genome mappability.
title_full Fast computation and applications of genome mappability.
title_fullStr Fast computation and applications of genome mappability.
title_full_unstemmed Fast computation and applications of genome mappability.
title_sort fast computation and applications of genome mappability.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2012-01-01
description We present a fast mapping-based algorithm to compute the mappability of each region of a reference genome up to a specified number of mismatches. Knowing the mappability of a genome is crucial for the interpretation of massively parallel sequencing experiments. We investigate the properties of the mappability of eukaryotic DNA/RNA both as a whole and at the level of the gene family, providing for various organisms tracks which allow the mappability information to be visually explored. In addition, we show that mappability varies greatly between species and gene classes. Finally, we suggest several practical applications where mappability can be used to refine the analysis of high-throughput sequencing data (SNP calling, gene expression quantification and paired-end experiments). This work highlights mappability as an important concept which deserves to be taken into full account, in particular when massively parallel sequencing technologies are employed. The GEM mappability program belongs to the GEM (GEnome Multitool) suite of programs, which can be freely downloaded for any use from its website (http://gemlibrary.sourceforge.net).
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22276185/pdf/?tool=EBI
work_keys_str_mv AT thomasderrien fastcomputationandapplicationsofgenomemappability
AT jordiestelle fastcomputationandapplicationsofgenomemappability
AT santiagomarcosola fastcomputationandapplicationsofgenomemappability
AT davidgknowles fastcomputationandapplicationsofgenomemappability
AT emanueleraineri fastcomputationandapplicationsofgenomemappability
AT rodericguigo fastcomputationandapplicationsofgenomemappability
AT paoloribeca fastcomputationandapplicationsofgenomemappability
_version_ 1714809713047109632