New Statistical Methods and Computational Tools for Mining Big Data, with Applications in Plant Sciences

The purpose of this dissertation is to develop new statistical tools for mining big data in plant sciences. In particular, the dissertation consists of four inter-related projects to address various methodological and computational challenges in phylogenetic methods. Project 1 aims to systematically...

Full description

Bibliographic Details
Main Author: Michels, Kurt Andrew
Other Authors: Zhang, Hao Helen
Language:en_US
Published: The University of Arizona. 2016
Subjects:
R
Online Access:http://hdl.handle.net/10150/613247
http://arizona.openrepository.com/arizona/handle/10150/613247
id ndltd-arizona.edu-oai-arizona.openrepository.com-10150-613247
record_format oai_dc
spelling ndltd-arizona.edu-oai-arizona.openrepository.com-10150-6132472016-06-17T03:00:56Z New Statistical Methods and Computational Tools for Mining Big Data, with Applications in Plant Sciences Michels, Kurt Andrew Zhang, Hao Helen Billheimer, David D. Kececioglu, John D. Merchant, Nirav C. Matasci, Naim Zhang, Hao Helen Forward Regression Genome Wide Association Study Group Data Interactions R Statistics Big Data The purpose of this dissertation is to develop new statistical tools for mining big data in plant sciences. In particular, the dissertation consists of four inter-related projects to address various methodological and computational challenges in phylogenetic methods. Project 1 aims to systematically test different optimization tools and provide useful strategies to improve optimization in practice. Project 2 develops a new R package rPlant, which provides a friendly and convenient toolbox for users of iPlant. Project 3 presents a fast and effective group-screening method to identify important genetic factors in GWAS, with theoretical justifications and nice asymptotic properties. Project 4 develops a new statistical tool to identify gene-gene interactions, with the ability of handling the interactions between groups of covariates. 2016 text Electronic Dissertation http://hdl.handle.net/10150/613247 http://arizona.openrepository.com/arizona/handle/10150/613247 en_US Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author. The University of Arizona.
collection NDLTD
language en_US
sources NDLTD
topic Forward Regression
Genome Wide Association Study
Group Data
Interactions
R
Statistics
Big Data
spellingShingle Forward Regression
Genome Wide Association Study
Group Data
Interactions
R
Statistics
Big Data
Michels, Kurt Andrew
New Statistical Methods and Computational Tools for Mining Big Data, with Applications in Plant Sciences
description The purpose of this dissertation is to develop new statistical tools for mining big data in plant sciences. In particular, the dissertation consists of four inter-related projects to address various methodological and computational challenges in phylogenetic methods. Project 1 aims to systematically test different optimization tools and provide useful strategies to improve optimization in practice. Project 2 develops a new R package rPlant, which provides a friendly and convenient toolbox for users of iPlant. Project 3 presents a fast and effective group-screening method to identify important genetic factors in GWAS, with theoretical justifications and nice asymptotic properties. Project 4 develops a new statistical tool to identify gene-gene interactions, with the ability of handling the interactions between groups of covariates.
author2 Zhang, Hao Helen
author_facet Zhang, Hao Helen
Michels, Kurt Andrew
author Michels, Kurt Andrew
author_sort Michels, Kurt Andrew
title New Statistical Methods and Computational Tools for Mining Big Data, with Applications in Plant Sciences
title_short New Statistical Methods and Computational Tools for Mining Big Data, with Applications in Plant Sciences
title_full New Statistical Methods and Computational Tools for Mining Big Data, with Applications in Plant Sciences
title_fullStr New Statistical Methods and Computational Tools for Mining Big Data, with Applications in Plant Sciences
title_full_unstemmed New Statistical Methods and Computational Tools for Mining Big Data, with Applications in Plant Sciences
title_sort new statistical methods and computational tools for mining big data, with applications in plant sciences
publisher The University of Arizona.
publishDate 2016
url http://hdl.handle.net/10150/613247
http://arizona.openrepository.com/arizona/handle/10150/613247
work_keys_str_mv AT michelskurtandrew newstatisticalmethodsandcomputationaltoolsforminingbigdatawithapplicationsinplantsciences
_version_ 1718306897543561216