Optimal compressed representation of high throughput sequence data via light assembly

Increase in high throughput sequencing (HTS) data warrants compression methods to facilitate better storage and communication. Here, Ginart et al. introduce Assembltrie, a reference-free compression tool which is guaranteed to achieve optimality for error-free reads.

Bibliographic Details
Main Authors: Antonio A. Ginart, Joseph Hui, Kaiyuan Zhu, Ibrahim Numanagić, Thomas A. Courtade, S. Cenk Sahinalp, David N. Tse
Format: Article
Language:English
Published: Nature Publishing Group 2018-02-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-017-02480-6
id doaj-b186eeb7b1ee4de1983ef5679b01c1db
record_format Article
spelling doaj-b186eeb7b1ee4de1983ef5679b01c1db2021-05-11T10:15:44ZengNature Publishing GroupNature Communications2041-17232018-02-01911910.1038/s41467-017-02480-6Optimal compressed representation of high throughput sequence data via light assemblyAntonio A. Ginart0Joseph Hui1Kaiyuan Zhu2Ibrahim Numanagić3Thomas A. Courtade4S. Cenk Sahinalp5David N. Tse6Department of Electrical Engineering, Stanford UniversityDepartment of Electrical Engineering & Computer Science, Massachusetts Institute of TechnologyDepartment of Computer Science, Indiana University BloomingtonComputer Science & Artificial Intelligence Laboratory, Massachusetts Institute of TechnologyDepartment of Electrical Engineering and Computer Sciences, University of CaliforniaDepartment of Computer Science, Indiana University BloomingtonDepartment of Electrical Engineering, Stanford UniversityIncrease in high throughput sequencing (HTS) data warrants compression methods to facilitate better storage and communication. Here, Ginart et al. introduce Assembltrie, a reference-free compression tool which is guaranteed to achieve optimality for error-free reads.https://doi.org/10.1038/s41467-017-02480-6
collection DOAJ
language English
format Article
sources DOAJ
author Antonio A. Ginart
Joseph Hui
Kaiyuan Zhu
Ibrahim Numanagić
Thomas A. Courtade
S. Cenk Sahinalp
David N. Tse
spellingShingle Antonio A. Ginart
Joseph Hui
Kaiyuan Zhu
Ibrahim Numanagić
Thomas A. Courtade
S. Cenk Sahinalp
David N. Tse
Optimal compressed representation of high throughput sequence data via light assembly
Nature Communications
author_facet Antonio A. Ginart
Joseph Hui
Kaiyuan Zhu
Ibrahim Numanagić
Thomas A. Courtade
S. Cenk Sahinalp
David N. Tse
author_sort Antonio A. Ginart
title Optimal compressed representation of high throughput sequence data via light assembly
title_short Optimal compressed representation of high throughput sequence data via light assembly
title_full Optimal compressed representation of high throughput sequence data via light assembly
title_fullStr Optimal compressed representation of high throughput sequence data via light assembly
title_full_unstemmed Optimal compressed representation of high throughput sequence data via light assembly
title_sort optimal compressed representation of high throughput sequence data via light assembly
publisher Nature Publishing Group
series Nature Communications
issn 2041-1723
publishDate 2018-02-01
description Increase in high throughput sequencing (HTS) data warrants compression methods to facilitate better storage and communication. Here, Ginart et al. introduce Assembltrie, a reference-free compression tool which is guaranteed to achieve optimality for error-free reads.
url https://doi.org/10.1038/s41467-017-02480-6
work_keys_str_mv AT antonioaginart optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly
AT josephhui optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly
AT kaiyuanzhu optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly
AT ibrahimnumanagic optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly
AT thomasacourtade optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly
AT scenksahinalp optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly
AT davidntse optimalcompressedrepresentationofhighthroughputsequencedatavialightassembly
_version_ 1721448540145713152