Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction
Master of Science === Department of Computing and Information Sciences === William H. Hsu === Alternative splicing is a mechanism for generating different gene transcripts (called iso- forms) from the same genomic sequence. Finding alternative splicing events experimentally is both expensive and...
Main Author: | |
---|---|
Language: | en_US |
Published: |
Kansas State University
2008
|
Subjects: | |
Online Access: | http://hdl.handle.net/2097/1113 |
id |
ndltd-KSU-oai-krex.k-state.edu-2097-1113 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-KSU-oai-krex.k-state.edu-2097-11132016-03-01T03:50:00Z Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction Xia, Jing support vector machine alternative splicing Computer Science (0984) Master of Science Department of Computing and Information Sciences William H. Hsu Alternative splicing is a mechanism for generating different gene transcripts (called iso- forms) from the same genomic sequence. Finding alternative splicing events experimentally is both expensive and time consuming. Computational methods in general, and EST analy- sis and machine learning algorithms in particular, can be used to complement experimental methods in the process of identifying alternative splicing events. In this thesis, I first iden- tify alternative splicing exons by analyzing EST-genome alignment. Next, I explore the predictive power of a rich set of features that have been experimentally shown to affect al- ternative splicing. I use these features to build support vector machine (SVM) classifiers for distinguishing between alternatively spliced exons and constitutive exons. My results show that simple, linear SVM classifiers built from a rich set of features give results comparable to those of more sophisticated SVM classifiers that use more basic sequence features. Finally, I use feature selection methods to identify computationally the most informative features for the prediction problem considered. 2008-12-22T14:36:12Z 2008-12-22T14:36:12Z 2008-12-22T14:36:12Z 2008 December Thesis http://hdl.handle.net/2097/1113 en_US Kansas State University |
collection |
NDLTD |
language |
en_US |
sources |
NDLTD |
topic |
support vector machine alternative splicing Computer Science (0984) |
spellingShingle |
support vector machine alternative splicing Computer Science (0984) Xia, Jing Bioinformatics analyses of alternative splicing, est-based and machine learning-based prediction |
description |
Master of Science === Department of Computing and Information Sciences === William H. Hsu === Alternative splicing is a mechanism for generating different gene transcripts (called iso-
forms) from the same genomic sequence. Finding alternative splicing events experimentally
is both expensive and time consuming. Computational methods in general, and EST analy-
sis and machine learning algorithms in particular, can be used to complement experimental
methods in the process of identifying alternative splicing events. In this thesis, I first iden-
tify alternative splicing exons by analyzing EST-genome alignment. Next, I explore the
predictive power of a rich set of features that have been experimentally shown to affect al-
ternative splicing. I use these features to build support vector machine (SVM) classifiers for
distinguishing between alternatively spliced exons and constitutive exons. My results show
that simple, linear SVM classifiers built from a rich set of features give results comparable to
those of more sophisticated SVM classifiers that use more basic sequence features. Finally,
I use feature selection methods to identify computationally the most informative features
for the prediction problem considered. |
author |
Xia, Jing |
author_facet |
Xia, Jing |
author_sort |
Xia, Jing |
title |
Bioinformatics analyses of alternative splicing, est-based and machine learning-based
prediction |
title_short |
Bioinformatics analyses of alternative splicing, est-based and machine learning-based
prediction |
title_full |
Bioinformatics analyses of alternative splicing, est-based and machine learning-based
prediction |
title_fullStr |
Bioinformatics analyses of alternative splicing, est-based and machine learning-based
prediction |
title_full_unstemmed |
Bioinformatics analyses of alternative splicing, est-based and machine learning-based
prediction |
title_sort |
bioinformatics analyses of alternative splicing, est-based and machine learning-based
prediction |
publisher |
Kansas State University |
publishDate |
2008 |
url |
http://hdl.handle.net/2097/1113 |
work_keys_str_mv |
AT xiajing bioinformaticsanalysesofalternativesplicingestbasedandmachinelearningbasedprediction |
_version_ |
1718196307934314496 |