Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction

Master of Science === Department of Computing and Information Sciences === William H. Hsu === Alternative splicing is a mechanism for generating different gene transcripts (called iso- forms) from the same genomic sequence. Finding alternative splicing events experimentally is both expensive and...

Full description

Bibliographic Details
Main Author: Xia, Jing
Language:en_US
Published: Kansas State University 2008
Subjects:
Online Access:http://hdl.handle.net/2097/1113
id ndltd-KSU-oai-krex.k-state.edu-2097-1113
record_format oai_dc
spelling ndltd-KSU-oai-krex.k-state.edu-2097-11132016-03-01T03:50:00Z Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction Xia, Jing support vector machine alternative splicing Computer Science (0984) Master of Science Department of Computing and Information Sciences William H. Hsu Alternative splicing is a mechanism for generating different gene transcripts (called iso- forms) from the same genomic sequence. Finding alternative splicing events experimentally is both expensive and time consuming. Computational methods in general, and EST analy- sis and machine learning algorithms in particular, can be used to complement experimental methods in the process of identifying alternative splicing events. In this thesis, I first iden- tify alternative splicing exons by analyzing EST-genome alignment. Next, I explore the predictive power of a rich set of features that have been experimentally shown to affect al- ternative splicing. I use these features to build support vector machine (SVM) classifiers for distinguishing between alternatively spliced exons and constitutive exons. My results show that simple, linear SVM classifiers built from a rich set of features give results comparable to those of more sophisticated SVM classifiers that use more basic sequence features. Finally, I use feature selection methods to identify computationally the most informative features for the prediction problem considered. 2008-12-22T14:36:12Z 2008-12-22T14:36:12Z 2008-12-22T14:36:12Z 2008 December Thesis http://hdl.handle.net/2097/1113 en_US Kansas State University
collection NDLTD
language en_US
sources NDLTD
topic support vector machine
alternative splicing
Computer Science (0984)
spellingShingle support vector machine
alternative splicing
Computer Science (0984)
Xia, Jing
Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction
description Master of Science === Department of Computing and Information Sciences === William H. Hsu === Alternative splicing is a mechanism for generating different gene transcripts (called iso- forms) from the same genomic sequence. Finding alternative splicing events experimentally is both expensive and time consuming. Computational methods in general, and EST analy- sis and machine learning algorithms in particular, can be used to complement experimental methods in the process of identifying alternative splicing events. In this thesis, I first iden- tify alternative splicing exons by analyzing EST-genome alignment. Next, I explore the predictive power of a rich set of features that have been experimentally shown to affect al- ternative splicing. I use these features to build support vector machine (SVM) classifiers for distinguishing between alternatively spliced exons and constitutive exons. My results show that simple, linear SVM classifiers built from a rich set of features give results comparable to those of more sophisticated SVM classifiers that use more basic sequence features. Finally, I use feature selection methods to identify computationally the most informative features for the prediction problem considered.
author Xia, Jing
author_facet Xia, Jing
author_sort Xia, Jing
title Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction
title_short Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction
title_full Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction
title_fullStr Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction
title_full_unstemmed Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction
title_sort bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction
publisher Kansas State University
publishDate 2008
url http://hdl.handle.net/2097/1113
work_keys_str_mv AT xiajing bioinformaticsanalysesofalternativesplicingestbasedandmachinelearningbasedprediction
_version_ 1718196307934314496