Computations and Algorithms in Physical and Biological Problems

This dissertation presents the applications of state-of-the-art computation techniques and data analysis algorithms in three physical and biological problems: assembling DNA pieces, optimizing self-assembly yield, and identifying correlations from large multivariate datasets. In the first topic, in-...

Full description

Bibliographic Details
Main Author: Qin, Yu
Other Authors: Brenner, Michael P.
Language:en_US
Published: Harvard University 2014
Subjects:
Online Access:http://dissertations.umi.com/gsas.harvard:11478
http://nrs.harvard.edu/urn-3:HUL.InstRepos:12274619
id ndltd-harvard.edu-oai-dash.harvard.edu-1-12274619
record_format oai_dc
spelling ndltd-harvard.edu-oai-dash.harvard.edu-1-122746192015-08-14T15:42:58ZComputations and Algorithms in Physical and Biological ProblemsQin, YuApplied mathematicsComputer sciencePhysicsComputationInformation miningRandom matrix theorySelf assemblySequencing by hybridizationThis dissertation presents the applications of state-of-the-art computation techniques and data analysis algorithms in three physical and biological problems: assembling DNA pieces, optimizing self-assembly yield, and identifying correlations from large multivariate datasets. In the first topic, in-depth analysis of using Sequencing by Hybridization (SBH) to reconstruct target DNA sequences shows that a modified reconstruction algorithm can overcome the theoretical boundary without the need for different types of biochemical assays and is robust to error. In the second topic, consistent with theoretical predictions, simulations using Graphics Processing Unit (GPU) demonstrate how controlling the short-ranged interactions between particles and controlling the concentrations optimize the self-assembly yield of a desired structure, and nonequilibrium behavior when optimizing concentrations is also unveiled by leveraging the computation capacity of GPUs. In the last topic, a methodology to incorporate existing categorization information into the search process to efficiently reconstruct the optimal true correlation matrix for multivariate datasets is introduced. Simulations on both synthetic and real financial datasets show that the algorithm is able to detect signals below the Random Matrix Theory (RMT) threshold. These three problems are representatives of using massive computation techniques and data analysis algorithms to tackle optimization problems, and outperform theoretical boundary when incorporating prior information into the computation.Engineering and Applied SciencesBrenner, Michael P.2014-06-07T01:12:39Z2014-06-0620142014-06-07T01:12:39ZThesis or DissertationQin, Yu. 2014. Computations and Algorithms in Physical and Biological Problems. Doctoral dissertation, Harvard University.http://dissertations.umi.com/gsas.harvard:11478http://nrs.harvard.edu/urn-3:HUL.InstRepos:12274619en_USopenhttp://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAAHarvard University
collection NDLTD
language en_US
sources NDLTD
topic Applied mathematics
Computer science
Physics
Computation
Information mining
Random matrix theory
Self assembly
Sequencing by hybridization
spellingShingle Applied mathematics
Computer science
Physics
Computation
Information mining
Random matrix theory
Self assembly
Sequencing by hybridization
Qin, Yu
Computations and Algorithms in Physical and Biological Problems
description This dissertation presents the applications of state-of-the-art computation techniques and data analysis algorithms in three physical and biological problems: assembling DNA pieces, optimizing self-assembly yield, and identifying correlations from large multivariate datasets. In the first topic, in-depth analysis of using Sequencing by Hybridization (SBH) to reconstruct target DNA sequences shows that a modified reconstruction algorithm can overcome the theoretical boundary without the need for different types of biochemical assays and is robust to error. In the second topic, consistent with theoretical predictions, simulations using Graphics Processing Unit (GPU) demonstrate how controlling the short-ranged interactions between particles and controlling the concentrations optimize the self-assembly yield of a desired structure, and nonequilibrium behavior when optimizing concentrations is also unveiled by leveraging the computation capacity of GPUs. In the last topic, a methodology to incorporate existing categorization information into the search process to efficiently reconstruct the optimal true correlation matrix for multivariate datasets is introduced. Simulations on both synthetic and real financial datasets show that the algorithm is able to detect signals below the Random Matrix Theory (RMT) threshold. These three problems are representatives of using massive computation techniques and data analysis algorithms to tackle optimization problems, and outperform theoretical boundary when incorporating prior information into the computation. === Engineering and Applied Sciences
author2 Brenner, Michael P.
author_facet Brenner, Michael P.
Qin, Yu
author Qin, Yu
author_sort Qin, Yu
title Computations and Algorithms in Physical and Biological Problems
title_short Computations and Algorithms in Physical and Biological Problems
title_full Computations and Algorithms in Physical and Biological Problems
title_fullStr Computations and Algorithms in Physical and Biological Problems
title_full_unstemmed Computations and Algorithms in Physical and Biological Problems
title_sort computations and algorithms in physical and biological problems
publisher Harvard University
publishDate 2014
url http://dissertations.umi.com/gsas.harvard:11478
http://nrs.harvard.edu/urn-3:HUL.InstRepos:12274619
work_keys_str_mv AT qinyu computationsandalgorithmsinphysicalandbiologicalproblems
_version_ 1716816923776253952