Widescale analysis of transcriptomics data using cloud computing methods

This study explores the handling and analyzing of big data in the field of bioinformatics. The focus has been on improving the analysis of public domain data for Affymetrix GeneChips which are a widely used technology for measuring gene expression. Methods to determine the bias in gene expression du...

Full description

Bibliographic Details
Main Author: Owen, Anne M.
Published: University of Essex 2016
Subjects:
510
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.681817
id ndltd-bl.uk-oai-ethos.bl.uk-681817
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-6818172016-06-21T03:23:46ZWidescale analysis of transcriptomics data using cloud computing methodsOwen, Anne M.2016This study explores the handling and analyzing of big data in the field of bioinformatics. The focus has been on improving the analysis of public domain data for Affymetrix GeneChips which are a widely used technology for measuring gene expression. Methods to determine the bias in gene expression due to G-stacks associated with runs of guanine in probes have been explored via the use of a grid and various types of cloud computing. An attempt has been made to find the best way of storing and analyzing big data used in bioinformatics. A grid and various types of cloud computing have been employed. The experience gained in using a grid and different clouds has been reported. In the case of Windows Azure, a public cloud has been employed in a new way to demonstrate the use of the R statistical language for research in bioinformatics. This work has studied the G-stack bias in a broad range of GeneChip data from public repositories. A wide scale survey has been carried out to determine the extent of the Gstack bias in four different chips across three different species. The study commenced with the human GeneChip HG U133A. A second human GeneChip HG U133 Plus2 was then examined, followed by a plant chip, Arabidopsis thaliana, and then a bacterium chip, Pseudomonas aeruginosa. Comparisons have also been made between the use of widely recognised algorithms RMA and PLIER for the normalization stage of extracting gene expression from GeneChip data.510QR MicrobiologyUniversity of Essexhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.681817http://repository.essex.ac.uk/16125/Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 510
QR Microbiology
spellingShingle 510
QR Microbiology
Owen, Anne M.
Widescale analysis of transcriptomics data using cloud computing methods
description This study explores the handling and analyzing of big data in the field of bioinformatics. The focus has been on improving the analysis of public domain data for Affymetrix GeneChips which are a widely used technology for measuring gene expression. Methods to determine the bias in gene expression due to G-stacks associated with runs of guanine in probes have been explored via the use of a grid and various types of cloud computing. An attempt has been made to find the best way of storing and analyzing big data used in bioinformatics. A grid and various types of cloud computing have been employed. The experience gained in using a grid and different clouds has been reported. In the case of Windows Azure, a public cloud has been employed in a new way to demonstrate the use of the R statistical language for research in bioinformatics. This work has studied the G-stack bias in a broad range of GeneChip data from public repositories. A wide scale survey has been carried out to determine the extent of the Gstack bias in four different chips across three different species. The study commenced with the human GeneChip HG U133A. A second human GeneChip HG U133 Plus2 was then examined, followed by a plant chip, Arabidopsis thaliana, and then a bacterium chip, Pseudomonas aeruginosa. Comparisons have also been made between the use of widely recognised algorithms RMA and PLIER for the normalization stage of extracting gene expression from GeneChip data.
author Owen, Anne M.
author_facet Owen, Anne M.
author_sort Owen, Anne M.
title Widescale analysis of transcriptomics data using cloud computing methods
title_short Widescale analysis of transcriptomics data using cloud computing methods
title_full Widescale analysis of transcriptomics data using cloud computing methods
title_fullStr Widescale analysis of transcriptomics data using cloud computing methods
title_full_unstemmed Widescale analysis of transcriptomics data using cloud computing methods
title_sort widescale analysis of transcriptomics data using cloud computing methods
publisher University of Essex
publishDate 2016
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.681817
work_keys_str_mv AT owenannem widescaleanalysisoftranscriptomicsdatausingcloudcomputingmethods
_version_ 1718313135773843456