A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing

With recent advances in next-generation sequencing (NGS) technology, large volumes of data have been produced in the form of short reads. Sequence assembly involves using initial short reads to produce progressively longer contigs, and then using scaffolds to produce the final sequence. These proces...

Full description

Bibliographic Details
Main Authors: Jeongkyu Kim, Mingeun Ji, Gangman Yi
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
NGS
Online Access:https://ieeexplore.ieee.org/document/9223667/
id doaj-f2f4d0573e164b0a809fea528fb2e23b
record_format Article
spelling doaj-f2f4d0573e164b0a809fea528fb2e23b2021-03-30T04:10:36ZengIEEEIEEE Access2169-35362020-01-01818981118982210.1109/ACCESS.2020.30311599223667A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation SequencingJeongkyu Kim0https://orcid.org/0000-0003-2191-3573Mingeun Ji1Gangman Yi2https://orcid.org/0000-0003-4664-5325Department of Multimedia Engineering, Dongguk University, Seoul, South KoreaDepartment of Multimedia Engineering, Dongguk University, Seoul, South KoreaDepartment of Multimedia Engineering, Dongguk University, Seoul, South KoreaWith recent advances in next-generation sequencing (NGS) technology, large volumes of data have been produced in the form of short reads. Sequence assembly involves using initial short reads to produce progressively longer contigs, and then using scaffolds to produce the final sequence. These processes each require evaluation of the extent of homology between different sequences. However, because the NGS platforms currently being developed are diverse, and the data being produced are of different sizes and read lengths, numerous algorithms are being developed with unique methodologies to process this complex data. It is difficult for biologists to manipulate the different features involved in these algorithms. Therefore, to reduce experimental trial-and-error, different strategies are required depending on the performance and purpose of the optimal algorithm, thereby facilitating understanding of algorithm methodologies and effective use of their various features. This study is a review of the different short read alignment algorithms and NGS platforms that have been developed to date, in order to aid efficient selection of algorithms for reference sequences and mapping of DNA data.https://ieeexplore.ieee.org/document/9223667/NGSsequence alignmentread alignmentFM-indexhashing
collection DOAJ
language English
format Article
sources DOAJ
author Jeongkyu Kim
Mingeun Ji
Gangman Yi
spellingShingle Jeongkyu Kim
Mingeun Ji
Gangman Yi
A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing
IEEE Access
NGS
sequence alignment
read alignment
FM-index
hashing
author_facet Jeongkyu Kim
Mingeun Ji
Gangman Yi
author_sort Jeongkyu Kim
title A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing
title_short A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing
title_full A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing
title_fullStr A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing
title_full_unstemmed A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing
title_sort review on sequence alignment algorithms for short reads based on next-generation sequencing
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description With recent advances in next-generation sequencing (NGS) technology, large volumes of data have been produced in the form of short reads. Sequence assembly involves using initial short reads to produce progressively longer contigs, and then using scaffolds to produce the final sequence. These processes each require evaluation of the extent of homology between different sequences. However, because the NGS platforms currently being developed are diverse, and the data being produced are of different sizes and read lengths, numerous algorithms are being developed with unique methodologies to process this complex data. It is difficult for biologists to manipulate the different features involved in these algorithms. Therefore, to reduce experimental trial-and-error, different strategies are required depending on the performance and purpose of the optimal algorithm, thereby facilitating understanding of algorithm methodologies and effective use of their various features. This study is a review of the different short read alignment algorithms and NGS platforms that have been developed to date, in order to aid efficient selection of algorithms for reference sequences and mapping of DNA data.
topic NGS
sequence alignment
read alignment
FM-index
hashing
url https://ieeexplore.ieee.org/document/9223667/
work_keys_str_mv AT jeongkyukim areviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing
AT mingeunji areviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing
AT gangmanyi areviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing
AT jeongkyukim reviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing
AT mingeunji reviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing
AT gangmanyi reviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing
_version_ 1724182257510383616