SparkGA2: Production-quality memory-efficient Apache Spark based genome analysis framework.

Due to the rapid decrease in the cost of NGS (Next Generation Sequencing), interest has increased in using data generated from NGS to diagnose genetic diseases. However, the data generated by NGS technology is usually in the order of hundreds of gigabytes per experiment, thus requiring efficient and...

Full description

Bibliographic Details
Main Authors: Hamid Mushtaq, Nauman Ahmed, Zaid Al-Ars
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0224784