Computations and Algorithms in Physical and Biological Problems
This dissertation presents the applications of state-of-the-art computation techniques and data analysis algorithms in three physical and biological problems: assembling DNA pieces, optimizing self-assembly yield, and identifying correlations from large multivariate datasets. In the first topic, in-...
Main Author: | |
---|---|
Other Authors: | |
Language: | en_US |
Published: |
Harvard University
2014
|
Subjects: | |
Online Access: | http://dissertations.umi.com/gsas.harvard:11478 http://nrs.harvard.edu/urn-3:HUL.InstRepos:12274619 |
id |
ndltd-harvard.edu-oai-dash.harvard.edu-1-12274619 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-harvard.edu-oai-dash.harvard.edu-1-122746192015-08-14T15:42:58ZComputations and Algorithms in Physical and Biological ProblemsQin, YuApplied mathematicsComputer sciencePhysicsComputationInformation miningRandom matrix theorySelf assemblySequencing by hybridizationThis dissertation presents the applications of state-of-the-art computation techniques and data analysis algorithms in three physical and biological problems: assembling DNA pieces, optimizing self-assembly yield, and identifying correlations from large multivariate datasets. In the first topic, in-depth analysis of using Sequencing by Hybridization (SBH) to reconstruct target DNA sequences shows that a modified reconstruction algorithm can overcome the theoretical boundary without the need for different types of biochemical assays and is robust to error. In the second topic, consistent with theoretical predictions, simulations using Graphics Processing Unit (GPU) demonstrate how controlling the short-ranged interactions between particles and controlling the concentrations optimize the self-assembly yield of a desired structure, and nonequilibrium behavior when optimizing concentrations is also unveiled by leveraging the computation capacity of GPUs. In the last topic, a methodology to incorporate existing categorization information into the search process to efficiently reconstruct the optimal true correlation matrix for multivariate datasets is introduced. Simulations on both synthetic and real financial datasets show that the algorithm is able to detect signals below the Random Matrix Theory (RMT) threshold. These three problems are representatives of using massive computation techniques and data analysis algorithms to tackle optimization problems, and outperform theoretical boundary when incorporating prior information into the computation.Engineering and Applied SciencesBrenner, Michael P.2014-06-07T01:12:39Z2014-06-0620142014-06-07T01:12:39ZThesis or DissertationQin, Yu. 2014. Computations and Algorithms in Physical and Biological Problems. Doctoral dissertation, Harvard University.http://dissertations.umi.com/gsas.harvard:11478http://nrs.harvard.edu/urn-3:HUL.InstRepos:12274619en_USopenhttp://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAAHarvard University |
collection |
NDLTD |
language |
en_US |
sources |
NDLTD |
topic |
Applied mathematics Computer science Physics Computation Information mining Random matrix theory Self assembly Sequencing by hybridization |
spellingShingle |
Applied mathematics Computer science Physics Computation Information mining Random matrix theory Self assembly Sequencing by hybridization Qin, Yu Computations and Algorithms in Physical and Biological Problems |
description |
This dissertation presents the applications of state-of-the-art computation techniques and data analysis algorithms in three physical and biological problems: assembling DNA pieces, optimizing self-assembly yield, and identifying correlations from large multivariate datasets. In the first topic, in-depth analysis of using Sequencing by Hybridization (SBH) to reconstruct target DNA sequences shows that a modified reconstruction algorithm can overcome the theoretical boundary without the need for different types of biochemical assays and is robust to error. In the second topic, consistent with theoretical predictions, simulations using Graphics Processing Unit (GPU) demonstrate how controlling the short-ranged interactions between particles and controlling the concentrations optimize the self-assembly yield of a desired structure, and nonequilibrium behavior when optimizing concentrations is also unveiled by leveraging the computation capacity of GPUs. In the last topic, a methodology to incorporate existing categorization information into the search process to efficiently reconstruct the optimal true correlation matrix for multivariate datasets is introduced. Simulations on both synthetic and real financial datasets show that the algorithm is able to detect signals below the Random Matrix Theory (RMT) threshold. These three problems are representatives of using massive computation techniques and data analysis algorithms to tackle optimization problems, and outperform theoretical boundary when incorporating prior information into the computation. === Engineering and Applied Sciences |
author2 |
Brenner, Michael P. |
author_facet |
Brenner, Michael P. Qin, Yu |
author |
Qin, Yu |
author_sort |
Qin, Yu |
title |
Computations and Algorithms in Physical and Biological Problems |
title_short |
Computations and Algorithms in Physical and Biological Problems |
title_full |
Computations and Algorithms in Physical and Biological Problems |
title_fullStr |
Computations and Algorithms in Physical and Biological Problems |
title_full_unstemmed |
Computations and Algorithms in Physical and Biological Problems |
title_sort |
computations and algorithms in physical and biological problems |
publisher |
Harvard University |
publishDate |
2014 |
url |
http://dissertations.umi.com/gsas.harvard:11478 http://nrs.harvard.edu/urn-3:HUL.InstRepos:12274619 |
work_keys_str_mv |
AT qinyu computationsandalgorithmsinphysicalandbiologicalproblems |
_version_ |
1716816923776253952 |