Widescale analysis of transcriptomics data using cloud computing methods
This study explores the handling and analyzing of big data in the field of bioinformatics. The focus has been on improving the analysis of public domain data for Affymetrix GeneChips which are a widely used technology for measuring gene expression. Methods to determine the bias in gene expression du...
Main Author: | |
---|---|
Published: |
University of Essex
2016
|
Subjects: | |
Online Access: | http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.681817 |
id |
ndltd-bl.uk-oai-ethos.bl.uk-681817 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-bl.uk-oai-ethos.bl.uk-6818172016-06-21T03:23:46ZWidescale analysis of transcriptomics data using cloud computing methodsOwen, Anne M.2016This study explores the handling and analyzing of big data in the field of bioinformatics. The focus has been on improving the analysis of public domain data for Affymetrix GeneChips which are a widely used technology for measuring gene expression. Methods to determine the bias in gene expression due to G-stacks associated with runs of guanine in probes have been explored via the use of a grid and various types of cloud computing. An attempt has been made to find the best way of storing and analyzing big data used in bioinformatics. A grid and various types of cloud computing have been employed. The experience gained in using a grid and different clouds has been reported. In the case of Windows Azure, a public cloud has been employed in a new way to demonstrate the use of the R statistical language for research in bioinformatics. This work has studied the G-stack bias in a broad range of GeneChip data from public repositories. A wide scale survey has been carried out to determine the extent of the Gstack bias in four different chips across three different species. The study commenced with the human GeneChip HG U133A. A second human GeneChip HG U133 Plus2 was then examined, followed by a plant chip, Arabidopsis thaliana, and then a bacterium chip, Pseudomonas aeruginosa. Comparisons have also been made between the use of widely recognised algorithms RMA and PLIER for the normalization stage of extracting gene expression from GeneChip data.510QR MicrobiologyUniversity of Essexhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.681817http://repository.essex.ac.uk/16125/Electronic Thesis or Dissertation |
collection |
NDLTD |
sources |
NDLTD |
topic |
510 QR Microbiology |
spellingShingle |
510 QR Microbiology Owen, Anne M. Widescale analysis of transcriptomics data using cloud computing methods |
description |
This study explores the handling and analyzing of big data in the field of bioinformatics. The focus has been on improving the analysis of public domain data for Affymetrix GeneChips which are a widely used technology for measuring gene expression. Methods to determine the bias in gene expression due to G-stacks associated with runs of guanine in probes have been explored via the use of a grid and various types of cloud computing. An attempt has been made to find the best way of storing and analyzing big data used in bioinformatics. A grid and various types of cloud computing have been employed. The experience gained in using a grid and different clouds has been reported. In the case of Windows Azure, a public cloud has been employed in a new way to demonstrate the use of the R statistical language for research in bioinformatics. This work has studied the G-stack bias in a broad range of GeneChip data from public repositories. A wide scale survey has been carried out to determine the extent of the Gstack bias in four different chips across three different species. The study commenced with the human GeneChip HG U133A. A second human GeneChip HG U133 Plus2 was then examined, followed by a plant chip, Arabidopsis thaliana, and then a bacterium chip, Pseudomonas aeruginosa. Comparisons have also been made between the use of widely recognised algorithms RMA and PLIER for the normalization stage of extracting gene expression from GeneChip data. |
author |
Owen, Anne M. |
author_facet |
Owen, Anne M. |
author_sort |
Owen, Anne M. |
title |
Widescale analysis of transcriptomics data using cloud computing methods |
title_short |
Widescale analysis of transcriptomics data using cloud computing methods |
title_full |
Widescale analysis of transcriptomics data using cloud computing methods |
title_fullStr |
Widescale analysis of transcriptomics data using cloud computing methods |
title_full_unstemmed |
Widescale analysis of transcriptomics data using cloud computing methods |
title_sort |
widescale analysis of transcriptomics data using cloud computing methods |
publisher |
University of Essex |
publishDate |
2016 |
url |
http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.681817 |
work_keys_str_mv |
AT owenannem widescaleanalysisoftranscriptomicsdatausingcloudcomputingmethods |
_version_ |
1718313135773843456 |