Optimal compressed representation of high throughput sequence data via light assembly
Increase in high throughput sequencing (HTS) data warrants compression methods to facilitate better storage and communication. Here, Ginart et al. introduce Assembltrie, a reference-free compression tool which is guaranteed to achieve optimality for error-free reads.
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2018-02-01
|
Series: | Nature Communications |
Online Access: | https://doi.org/10.1038/s41467-017-02480-6 |
id |
doaj-b186eeb7b1ee4de1983ef5679b01c1db |
---|---|
record_format |
Article |
spelling |
doaj-b186eeb7b1ee4de1983ef5679b01c1db2021-05-11T10:15:44ZengNature Publishing GroupNature Communications2041-17232018-02-01911910.1038/s41467-017-02480-6Optimal compressed representation of high throughput sequence data via light assemblyAntonio A. Ginart0Joseph Hui1Kaiyuan Zhu2Ibrahim Numanagić3Thomas A. Courtade4S. Cenk Sahinalp5David N. Tse6Department of Electrical Engineering, Stanford UniversityDepartment of Electrical Engineering & Computer Science, Massachusetts Institute of TechnologyDepartment of Computer Science, Indiana University BloomingtonComputer Science & Artificial Intelligence Laboratory, Massachusetts Institute of TechnologyDepartment of Electrical Engineering and Computer Sciences, University of CaliforniaDepartment of Computer Science, Indiana University BloomingtonDepartment of Electrical Engineering, Stanford UniversityIncrease in high throughput sequencing (HTS) data warrants compression methods to facilitate better storage and communication. Here, Ginart et al. introduce Assembltrie, a reference-free compression tool which is guaranteed to achieve optimality for error-free reads.https://doi.org/10.1038/s41467-017-02480-6 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Antonio A. Ginart Joseph Hui Kaiyuan Zhu Ibrahim Numanagić Thomas A. Courtade S. Cenk Sahinalp David N. Tse |
spellingShingle |
Antonio A. Ginart Joseph Hui Kaiyuan Zhu Ibrahim Numanagić Thomas A. Courtade S. Cenk Sahinalp David N. Tse Optimal compressed representation of high throughput sequence data via light assembly Nature Communications |
author_facet |
Antonio A. Ginart Joseph Hui Kaiyuan Zhu Ibrahim Numanagić Thomas A. Courtade S. Cenk Sahinalp David N. Tse |
author_sort |
Antonio A. Ginart |
title |
Optimal compressed representation of high throughput sequence data via light assembly |
title_short |
Optimal compressed representation of high throughput sequence data via light assembly |
title_full |
Optimal compressed representation of high throughput sequence data via light assembly |
title_fullStr |
Optimal compressed representation of high throughput sequence data via light assembly |
title_full_unstemmed |
Optimal compressed representation of high throughput sequence data via light assembly |
title_sort |
optimal compressed representation of high throughput sequence data via light assembly |
publisher |
Nature Publishing Group |
series |
Nature Communications |
issn |
2041-1723 |
publishDate |
2018-02-01 |
description |
Increase in high throughput sequencing (HTS) data warrants compression methods to facilitate better storage and communication. Here, Ginart et al. introduce Assembltrie, a reference-free compression tool which is guaranteed to achieve optimality for error-free reads. |
url |
https://doi.org/10.1038/s41467-017-02480-6 |
work_keys_str_mv |
AT antonioaginart optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly AT josephhui optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly AT kaiyuanzhu optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly AT ibrahimnumanagic optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly AT thomasacourtade optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly AT scenksahinalp optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly AT davidntse optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly |
_version_ |
1721448540145713152 |