RNA variant identification discrepancy among splice-aware alignment algorithms.

Next-generation sequencing (NGS) techniques have been generating various molecular maps, including transcriptomes via RNA-seq. Although the primary purpose of RNA-seq is to quantify the expression level of known genes, RNA variants are also identifiable. However, care must be taken to account for RN...

Full description

Bibliographic Details
Main Authors:	Ji Hyung Hong, Yoon Ho Ko, Keunsoo Kang
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2018-01-01
Series:	PLoS ONE
Online Access:	http://europepmc.org/articles/PMC6072070?pdf=render

id	doaj-c2190ba7c35649dbabdd646646df93cd
record_format	Article
spelling	doaj-c2190ba7c35649dbabdd646646df93cd2020-11-25T01:22:52ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01138e020182210.1371/journal.pone.0201822RNA variant identification discrepancy among splice-aware alignment algorithms.Ji Hyung HongYoon Ho KoKeunsoo KangNext-generation sequencing (NGS) techniques have been generating various molecular maps, including transcriptomes via RNA-seq. Although the primary purpose of RNA-seq is to quantify the expression level of known genes, RNA variants are also identifiable. However, care must be taken to account for RNA's dynamic nature. In this study, we evaluated the following popular splice-aware alignment algorithms in the context of RNA variant-calling analysis: HISAT2, STAR, STAR (two-pass mode), Subread, and Subjunc. For this, we performed RNA-seq with ten pieces of invasive ductal carcinoma from breast tissue and three pieces of adjacent normal tissue from a single patient. These RNA-seq data were used to evaluate the performance of splice-aware aligners. Surprisingly, the number of common potential RNA editing sites (pRESs) identified by all alignment algorithms was less than 2% of the total. The main cause of this difference was the mapped reads on the splice junctions. In addition, the RNA quality significantly affected the outcome. Therefore, researchers must consider these experimental and bioinformatic features during RNA variant analysis. Further investigations of common pRESs discovered that BDH1, CCDC137, and TBC1D10A transcripts contained a single non-synonymous RNA variant that was unique to breast cancer tissue compared to adjacent normal tissue; thus, further clinical validation is required.http://europepmc.org/articles/PMC6072070?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Ji Hyung Hong Yoon Ho Ko Keunsoo Kang
spellingShingle	Ji Hyung Hong Yoon Ho Ko Keunsoo Kang RNA variant identification discrepancy among splice-aware alignment algorithms. PLoS ONE
author_facet	Ji Hyung Hong Yoon Ho Ko Keunsoo Kang
author_sort	Ji Hyung Hong
title	RNA variant identification discrepancy among splice-aware alignment algorithms.
title_short	RNA variant identification discrepancy among splice-aware alignment algorithms.
title_full	RNA variant identification discrepancy among splice-aware alignment algorithms.
title_fullStr	RNA variant identification discrepancy among splice-aware alignment algorithms.
title_full_unstemmed	RNA variant identification discrepancy among splice-aware alignment algorithms.
title_sort	rna variant identification discrepancy among splice-aware alignment algorithms.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2018-01-01
description	Next-generation sequencing (NGS) techniques have been generating various molecular maps, including transcriptomes via RNA-seq. Although the primary purpose of RNA-seq is to quantify the expression level of known genes, RNA variants are also identifiable. However, care must be taken to account for RNA's dynamic nature. In this study, we evaluated the following popular splice-aware alignment algorithms in the context of RNA variant-calling analysis: HISAT2, STAR, STAR (two-pass mode), Subread, and Subjunc. For this, we performed RNA-seq with ten pieces of invasive ductal carcinoma from breast tissue and three pieces of adjacent normal tissue from a single patient. These RNA-seq data were used to evaluate the performance of splice-aware aligners. Surprisingly, the number of common potential RNA editing sites (pRESs) identified by all alignment algorithms was less than 2% of the total. The main cause of this difference was the mapped reads on the splice junctions. In addition, the RNA quality significantly affected the outcome. Therefore, researchers must consider these experimental and bioinformatic features during RNA variant analysis. Further investigations of common pRESs discovered that BDH1, CCDC137, and TBC1D10A transcripts contained a single non-synonymous RNA variant that was unique to breast cancer tissue compared to adjacent normal tissue; thus, further clinical validation is required.
url	http://europepmc.org/articles/PMC6072070?pdf=render
work_keys_str_mv	AT jihyunghong rnavariantidentificationdiscrepancyamongspliceawarealignmentalgorithms AT yoonhoko rnavariantidentificationdiscrepancyamongspliceawarealignmentalgorithms AT keunsookang rnavariantidentificationdiscrepancyamongspliceawarealignmentalgorithms
_version_	1725125042526748672

RNA variant identification discrepancy among splice-aware alignment algorithms.

Similar Items