Fast computation and applications of genome mappability.
We present a fast mapping-based algorithm to compute the mappability of each region of a reference genome up to a specified number of mismatches. Knowing the mappability of a genome is crucial for the interpretation of massively parallel sequencing experiments. We investigate the properties of the m...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2012-01-01
|
Series: | PLoS ONE |
Online Access: | https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22276185/pdf/?tool=EBI |
id |
doaj-5bfaabaf5be54ea0877d847ee3c5f7d5 |
---|---|
record_format |
Article |
spelling |
doaj-5bfaabaf5be54ea0877d847ee3c5f7d52021-03-04T01:08:49ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0171e3037710.1371/journal.pone.0030377Fast computation and applications of genome mappability.Thomas DerrienJordi EstelléSantiago Marco SolaDavid G KnowlesEmanuele RaineriRoderic GuigóPaolo RibecaWe present a fast mapping-based algorithm to compute the mappability of each region of a reference genome up to a specified number of mismatches. Knowing the mappability of a genome is crucial for the interpretation of massively parallel sequencing experiments. We investigate the properties of the mappability of eukaryotic DNA/RNA both as a whole and at the level of the gene family, providing for various organisms tracks which allow the mappability information to be visually explored. In addition, we show that mappability varies greatly between species and gene classes. Finally, we suggest several practical applications where mappability can be used to refine the analysis of high-throughput sequencing data (SNP calling, gene expression quantification and paired-end experiments). This work highlights mappability as an important concept which deserves to be taken into full account, in particular when massively parallel sequencing technologies are employed. The GEM mappability program belongs to the GEM (GEnome Multitool) suite of programs, which can be freely downloaded for any use from its website (http://gemlibrary.sourceforge.net).https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22276185/pdf/?tool=EBI |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Thomas Derrien Jordi Estellé Santiago Marco Sola David G Knowles Emanuele Raineri Roderic Guigó Paolo Ribeca |
spellingShingle |
Thomas Derrien Jordi Estellé Santiago Marco Sola David G Knowles Emanuele Raineri Roderic Guigó Paolo Ribeca Fast computation and applications of genome mappability. PLoS ONE |
author_facet |
Thomas Derrien Jordi Estellé Santiago Marco Sola David G Knowles Emanuele Raineri Roderic Guigó Paolo Ribeca |
author_sort |
Thomas Derrien |
title |
Fast computation and applications of genome mappability. |
title_short |
Fast computation and applications of genome mappability. |
title_full |
Fast computation and applications of genome mappability. |
title_fullStr |
Fast computation and applications of genome mappability. |
title_full_unstemmed |
Fast computation and applications of genome mappability. |
title_sort |
fast computation and applications of genome mappability. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2012-01-01 |
description |
We present a fast mapping-based algorithm to compute the mappability of each region of a reference genome up to a specified number of mismatches. Knowing the mappability of a genome is crucial for the interpretation of massively parallel sequencing experiments. We investigate the properties of the mappability of eukaryotic DNA/RNA both as a whole and at the level of the gene family, providing for various organisms tracks which allow the mappability information to be visually explored. In addition, we show that mappability varies greatly between species and gene classes. Finally, we suggest several practical applications where mappability can be used to refine the analysis of high-throughput sequencing data (SNP calling, gene expression quantification and paired-end experiments). This work highlights mappability as an important concept which deserves to be taken into full account, in particular when massively parallel sequencing technologies are employed. The GEM mappability program belongs to the GEM (GEnome Multitool) suite of programs, which can be freely downloaded for any use from its website (http://gemlibrary.sourceforge.net). |
url |
https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22276185/pdf/?tool=EBI |
work_keys_str_mv |
AT thomasderrien fastcomputationandapplicationsofgenomemappability AT jordiestelle fastcomputationandapplicationsofgenomemappability AT santiagomarcosola fastcomputationandapplicationsofgenomemappability AT davidgknowles fastcomputationandapplicationsofgenomemappability AT emanueleraineri fastcomputationandapplicationsofgenomemappability AT rodericguigo fastcomputationandapplicationsofgenomemappability AT paoloribeca fastcomputationandapplicationsofgenomemappability |
_version_ |
1714809713047109632 |