RNA-Seq mapping and detection of gene fusions with a suffix array algorithm.

High-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusions-particularly those expressed with low abundance- is a challenge with short- and medium-length sequencing reads. To add...

Full description

Bibliographic Details
Main Authors: Onur Sakarya, Heinz Breu, Milan Radovich, Yongzhi Chen, Yulei N Wang, Catalin Barbacioru, Sowmi Utiramerur, Penn P Whitley, Joel P Brockman, Paolo Vatta, Zheng Zhang, Liviu Popescu, Matthew W Muller, Vidya Kudlingar, Nriti Garg, Chieh-Yuan Li, Benjamin S Kong, John P Bodeau, Robert C Nutter, Jian Gu, Kelli S Bramlett, Jeffrey K Ichikawa, Fiona C Hyland, Asim S Siddiqui
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC3320572?pdf=render
id doaj-74a83f81239d4243ae7b1565ad835612
record_format Article
spelling doaj-74a83f81239d4243ae7b1565ad8356122020-11-24T21:51:14ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582012-01-0184e100246410.1371/journal.pcbi.1002464RNA-Seq mapping and detection of gene fusions with a suffix array algorithm.Onur SakaryaHeinz BreuMilan RadovichYongzhi ChenYulei N WangCatalin BarbacioruSowmi UtiramerurPenn P WhitleyJoel P BrockmanPaolo VattaZheng ZhangLiviu PopescuMatthew W MullerVidya KudlingarNriti GargChieh-Yuan LiBenjamin S KongJohn P BodeauRobert C NutterJian GuKelli S BramlettJeffrey K IchikawaFiona C HylandAsim S SiddiquiHigh-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusions-particularly those expressed with low abundance- is a challenge with short- and medium-length sequencing reads. To address this challenge, we implemented an RNA-Seq mapping pipeline within the LifeScope software. We introduced new features including filter and junction mapping, annotation-aided pairing rescue and accurate mapping quality values. We combined this pipeline with a Suffix Array Spliced Read (SASR) aligner to detect chimeric transcripts. Performing paired-end RNA-Seq of the breast cancer cell line MCF-7 using the SOLiD system, we called 40 gene fusions among over 120,000 splicing junctions. We validated 36 of these 40 fusions with TaqMan assays, of which 25 were expressed in MCF-7 but not the Human Brain Reference. An intra-chromosomal gene fusion involving the estrogen receptor alpha gene ESR1, and another involving the RPS6KB1 (Ribosomal protein S6 kinase beta-1) were recurrently expressed in a number of breast tumor cell lines and a clinical tumor sample.http://europepmc.org/articles/PMC3320572?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Onur Sakarya
Heinz Breu
Milan Radovich
Yongzhi Chen
Yulei N Wang
Catalin Barbacioru
Sowmi Utiramerur
Penn P Whitley
Joel P Brockman
Paolo Vatta
Zheng Zhang
Liviu Popescu
Matthew W Muller
Vidya Kudlingar
Nriti Garg
Chieh-Yuan Li
Benjamin S Kong
John P Bodeau
Robert C Nutter
Jian Gu
Kelli S Bramlett
Jeffrey K Ichikawa
Fiona C Hyland
Asim S Siddiqui
spellingShingle Onur Sakarya
Heinz Breu
Milan Radovich
Yongzhi Chen
Yulei N Wang
Catalin Barbacioru
Sowmi Utiramerur
Penn P Whitley
Joel P Brockman
Paolo Vatta
Zheng Zhang
Liviu Popescu
Matthew W Muller
Vidya Kudlingar
Nriti Garg
Chieh-Yuan Li
Benjamin S Kong
John P Bodeau
Robert C Nutter
Jian Gu
Kelli S Bramlett
Jeffrey K Ichikawa
Fiona C Hyland
Asim S Siddiqui
RNA-Seq mapping and detection of gene fusions with a suffix array algorithm.
PLoS Computational Biology
author_facet Onur Sakarya
Heinz Breu
Milan Radovich
Yongzhi Chen
Yulei N Wang
Catalin Barbacioru
Sowmi Utiramerur
Penn P Whitley
Joel P Brockman
Paolo Vatta
Zheng Zhang
Liviu Popescu
Matthew W Muller
Vidya Kudlingar
Nriti Garg
Chieh-Yuan Li
Benjamin S Kong
John P Bodeau
Robert C Nutter
Jian Gu
Kelli S Bramlett
Jeffrey K Ichikawa
Fiona C Hyland
Asim S Siddiqui
author_sort Onur Sakarya
title RNA-Seq mapping and detection of gene fusions with a suffix array algorithm.
title_short RNA-Seq mapping and detection of gene fusions with a suffix array algorithm.
title_full RNA-Seq mapping and detection of gene fusions with a suffix array algorithm.
title_fullStr RNA-Seq mapping and detection of gene fusions with a suffix array algorithm.
title_full_unstemmed RNA-Seq mapping and detection of gene fusions with a suffix array algorithm.
title_sort rna-seq mapping and detection of gene fusions with a suffix array algorithm.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2012-01-01
description High-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusions-particularly those expressed with low abundance- is a challenge with short- and medium-length sequencing reads. To address this challenge, we implemented an RNA-Seq mapping pipeline within the LifeScope software. We introduced new features including filter and junction mapping, annotation-aided pairing rescue and accurate mapping quality values. We combined this pipeline with a Suffix Array Spliced Read (SASR) aligner to detect chimeric transcripts. Performing paired-end RNA-Seq of the breast cancer cell line MCF-7 using the SOLiD system, we called 40 gene fusions among over 120,000 splicing junctions. We validated 36 of these 40 fusions with TaqMan assays, of which 25 were expressed in MCF-7 but not the Human Brain Reference. An intra-chromosomal gene fusion involving the estrogen receptor alpha gene ESR1, and another involving the RPS6KB1 (Ribosomal protein S6 kinase beta-1) were recurrently expressed in a number of breast tumor cell lines and a clinical tumor sample.
url http://europepmc.org/articles/PMC3320572?pdf=render
work_keys_str_mv AT onursakarya rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT heinzbreu rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT milanradovich rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT yongzhichen rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT yuleinwang rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT catalinbarbacioru rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT sowmiutiramerur rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT pennpwhitley rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT joelpbrockman rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT paolovatta rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT zhengzhang rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT liviupopescu rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT matthewwmuller rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT vidyakudlingar rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT nritigarg rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT chiehyuanli rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT benjaminskong rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT johnpbodeau rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT robertcnutter rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT jiangu rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT kellisbramlett rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT jeffreykichikawa rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT fionachyland rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
AT asimssiddiqui rnaseqmappinganddetectionofgenefusionswithasuffixarrayalgorithm
_version_ 1725879826721538048