A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing
With recent advances in next-generation sequencing (NGS) technology, large volumes of data have been produced in the form of short reads. Sequence assembly involves using initial short reads to produce progressively longer contigs, and then using scaffolds to produce the final sequence. These proces...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9223667/ |
id |
doaj-f2f4d0573e164b0a809fea528fb2e23b |
---|---|
record_format |
Article |
spelling |
doaj-f2f4d0573e164b0a809fea528fb2e23b2021-03-30T04:10:36ZengIEEEIEEE Access2169-35362020-01-01818981118982210.1109/ACCESS.2020.30311599223667A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation SequencingJeongkyu Kim0https://orcid.org/0000-0003-2191-3573Mingeun Ji1Gangman Yi2https://orcid.org/0000-0003-4664-5325Department of Multimedia Engineering, Dongguk University, Seoul, South KoreaDepartment of Multimedia Engineering, Dongguk University, Seoul, South KoreaDepartment of Multimedia Engineering, Dongguk University, Seoul, South KoreaWith recent advances in next-generation sequencing (NGS) technology, large volumes of data have been produced in the form of short reads. Sequence assembly involves using initial short reads to produce progressively longer contigs, and then using scaffolds to produce the final sequence. These processes each require evaluation of the extent of homology between different sequences. However, because the NGS platforms currently being developed are diverse, and the data being produced are of different sizes and read lengths, numerous algorithms are being developed with unique methodologies to process this complex data. It is difficult for biologists to manipulate the different features involved in these algorithms. Therefore, to reduce experimental trial-and-error, different strategies are required depending on the performance and purpose of the optimal algorithm, thereby facilitating understanding of algorithm methodologies and effective use of their various features. This study is a review of the different short read alignment algorithms and NGS platforms that have been developed to date, in order to aid efficient selection of algorithms for reference sequences and mapping of DNA data.https://ieeexplore.ieee.org/document/9223667/NGSsequence alignmentread alignmentFM-indexhashing |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jeongkyu Kim Mingeun Ji Gangman Yi |
spellingShingle |
Jeongkyu Kim Mingeun Ji Gangman Yi A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing IEEE Access NGS sequence alignment read alignment FM-index hashing |
author_facet |
Jeongkyu Kim Mingeun Ji Gangman Yi |
author_sort |
Jeongkyu Kim |
title |
A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing |
title_short |
A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing |
title_full |
A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing |
title_fullStr |
A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing |
title_full_unstemmed |
A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing |
title_sort |
review on sequence alignment algorithms for short reads based on next-generation sequencing |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
With recent advances in next-generation sequencing (NGS) technology, large volumes of data have been produced in the form of short reads. Sequence assembly involves using initial short reads to produce progressively longer contigs, and then using scaffolds to produce the final sequence. These processes each require evaluation of the extent of homology between different sequences. However, because the NGS platforms currently being developed are diverse, and the data being produced are of different sizes and read lengths, numerous algorithms are being developed with unique methodologies to process this complex data. It is difficult for biologists to manipulate the different features involved in these algorithms. Therefore, to reduce experimental trial-and-error, different strategies are required depending on the performance and purpose of the optimal algorithm, thereby facilitating understanding of algorithm methodologies and effective use of their various features. This study is a review of the different short read alignment algorithms and NGS platforms that have been developed to date, in order to aid efficient selection of algorithms for reference sequences and mapping of DNA data. |
topic |
NGS sequence alignment read alignment FM-index hashing |
url |
https://ieeexplore.ieee.org/document/9223667/ |
work_keys_str_mv |
AT jeongkyukim areviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing AT mingeunji areviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing AT gangmanyi areviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing AT jeongkyukim reviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing AT mingeunji reviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing AT gangmanyi reviewonsequencealignmentalgorithmsforshortreadsbasedonnextgenerationsequencing |
_version_ |
1724182257510383616 |