Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques

MicroRNAs (miRNAs) may serve as diagnostic and predictive biomarkers for cancer. The aim of this study was to identify novel cancer biomarkers from miRNA datasets, in addition to those already known. Three published miRNA cancer datasets (liver, breast, and brain) were evaluated, and the performance...

Full description

Bibliographic Details
Other Authors: Kotlarchyk, Alex J.
Format: Others
Language:English
Published: Florida Atlantic University
Subjects:
Online Access:http://purl.flvc.org/FAU/3332260
id ndltd-fau.edu-oai-fau.digital.flvc.org-fau_3783
record_format oai_dc
spelling ndltd-fau.edu-oai-fau.digital.flvc.org-fau_37832019-07-04T03:51:00Z Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques Kotlarchyk, Alex J. Text Electronic Thesis or Dissertation Florida Atlantic University English ix, 112 p. : ill. electronic MicroRNAs (miRNAs) may serve as diagnostic and predictive biomarkers for cancer. The aim of this study was to identify novel cancer biomarkers from miRNA datasets, in addition to those already known. Three published miRNA cancer datasets (liver, breast, and brain) were evaluated, and the performance of the entire feature set was compared to the performance of individual feature filters, an ensemble of those filters, and a support vector machine (SVM) wrapper. In addition to confirming many known biomarkers, the main contribution of this study is that seven miRNAs have been newly identified by our ensemble methodology as possible important biomarkers for hepatocellular carcinoma or breast cancer, pending wet lab confirmation. These biomarkers were identified from miRNA expression datasets by combining multiple feature selection techniques (i.e., creating an ensemble) or by the SVM-wrapper, and then classified by different learners. Generally speaking, creating a subset of features by selecting only the highest ranking features (miRNAs) improved upon results generated when using all the miRNAs, and the ensemble and SVM-wrapper approaches outperformed individual feature selection methods. Finally, an algorithm to determine the number of top-ranked features to include in the creation of feature subsets was developed. This algorithm takes into account the performance improvement gained by adding additional features compared to the cost of adding those features. by Alex Kotlarchyk. Thesis (Ph.D.)--Florida Atlantic University, 2011. Includes bibliography. Electronic reproduction. Boca Raton, Fla., 2011. Mode of access: World Wide Web. Gene silencing Biochemical markers Cancer--Diagnosis--Data processing Cancer--Diagnosis--Research Gene expression Tumor markers--Diagnostic use http://purl.flvc.org/FAU/3332260 773920415 3332260 FADT3332260 fau:3783 College of Engineering and Computer Science Department of Computer and Electrical Engineering and Computer Science http://rightsstatements.org/vocab/InC/1.0/ https://fau.digital.flvc.org/islandora/object/fau%3A3783/datastream/TN/view/Identification%20of%20MicroRNA%20biomarkers%20for%20cancer%20by%20combining%20multiple%20feature%20selection%20techniques.jpg
collection NDLTD
language English
format Others
sources NDLTD
topic Gene silencing
Biochemical markers
Cancer--Diagnosis--Data processing
Cancer--Diagnosis--Research
Gene expression
Tumor markers--Diagnostic use
spellingShingle Gene silencing
Biochemical markers
Cancer--Diagnosis--Data processing
Cancer--Diagnosis--Research
Gene expression
Tumor markers--Diagnostic use
Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques
description MicroRNAs (miRNAs) may serve as diagnostic and predictive biomarkers for cancer. The aim of this study was to identify novel cancer biomarkers from miRNA datasets, in addition to those already known. Three published miRNA cancer datasets (liver, breast, and brain) were evaluated, and the performance of the entire feature set was compared to the performance of individual feature filters, an ensemble of those filters, and a support vector machine (SVM) wrapper. In addition to confirming many known biomarkers, the main contribution of this study is that seven miRNAs have been newly identified by our ensemble methodology as possible important biomarkers for hepatocellular carcinoma or breast cancer, pending wet lab confirmation. These biomarkers were identified from miRNA expression datasets by combining multiple feature selection techniques (i.e., creating an ensemble) or by the SVM-wrapper, and then classified by different learners. Generally speaking, creating a subset of features by selecting only the highest ranking features (miRNAs) improved upon results generated when using all the miRNAs, and the ensemble and SVM-wrapper approaches outperformed individual feature selection methods. Finally, an algorithm to determine the number of top-ranked features to include in the creation of feature subsets was developed. This algorithm takes into account the performance improvement gained by adding additional features compared to the cost of adding those features. === by Alex Kotlarchyk. === Thesis (Ph.D.)--Florida Atlantic University, 2011. === Includes bibliography. === Electronic reproduction. Boca Raton, Fla., 2011. Mode of access: World Wide Web.
author2 Kotlarchyk, Alex J.
author_facet Kotlarchyk, Alex J.
title Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques
title_short Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques
title_full Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques
title_fullStr Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques
title_full_unstemmed Identification of MicroRNA biomarkers for cancer by combining multiple feature selection techniques
title_sort identification of microrna biomarkers for cancer by combining multiple feature selection techniques
publisher Florida Atlantic University
url http://purl.flvc.org/FAU/3332260
_version_ 1719218916069212160