id ndltd-OhioLink-oai-etd.ohiolink.edu-osu1436355132
record_format oai_dc
spelling ndltd-OhioLink-oai-etd.ohiolink.edu-osu14363551322021-08-03T06:31:53Z Parallel Processing of Large Scale Genomic Data Kutlu, Mucahid Computer Science Parallel Computation Genomic Applications Middleware Systems Task Scheduling SNP Calling Sequence Quantification An increasing amount of genomic data is becoming available for researchers with development of high-throughput and low-cost sequencing technologies. Analysis of such data has a significant potential for the new scientific and medical advances. However, as the amount of available data increases, use of parallelism and effective utilization of the computing resources become even more critical. Thus, novel parallelization approaches and frameworks that can help researchers develop parallel applications without dealing with low-level details of parallel coding are urgently needed for new advances in genomic research.In this dissertation, we introduce parallel genomic data analysis tools and middleware systems for developing efficient parallel genomic applications easily. With the proposed frameworks and parallel algorithms, we address the following challenges. (1) How to partition genomic data for parallel SNP calling and sequence quantification tools 2) Is it possible to utilize existing genomic applications in parallel executions? (3) How can we take advantage of domain-specific knowledge to increase the performance of the applications? (4) How to schedule the data intensive tasks (5) How can we implement efficient parallel genomic applications for memory-constrained many-core architectures such as Intel Xeon Phi? 2015-10-09 English text The Ohio State University / OhioLINK http://rave.ohiolink.edu/etdc/view?acc_num=osu1436355132 http://rave.ohiolink.edu/etdc/view?acc_num=osu1436355132 unrestricted This thesis or dissertation is protected by copyright: all rights reserved. It may not be copied or redistributed beyond the terms of applicable copyright laws.
collection NDLTD
language English
sources NDLTD
topic Computer Science
Parallel Computation
Genomic Applications
Middleware Systems
Task Scheduling
SNP Calling
Sequence Quantification
spellingShingle Computer Science
Parallel Computation
Genomic Applications
Middleware Systems
Task Scheduling
SNP Calling
Sequence Quantification
Kutlu, Mucahid
Parallel Processing of Large Scale Genomic Data
author Kutlu, Mucahid
author_facet Kutlu, Mucahid
author_sort Kutlu, Mucahid
title Parallel Processing of Large Scale Genomic Data
title_short Parallel Processing of Large Scale Genomic Data
title_full Parallel Processing of Large Scale Genomic Data
title_fullStr Parallel Processing of Large Scale Genomic Data
title_full_unstemmed Parallel Processing of Large Scale Genomic Data
title_sort parallel processing of large scale genomic data
publisher The Ohio State University / OhioLINK
publishDate 2015
url http://rave.ohiolink.edu/etdc/view?acc_num=osu1436355132
work_keys_str_mv AT kutlumucahid parallelprocessingoflargescalegenomicdata
_version_ 1719438516523368448