Conditional Random Field with Lasso and its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection

The classification problem of gene expression level, more specifically, gene expression analysis, is a major research area in statistics. There are several classical methods to solve the classification problem. To apply Logistic Regression Model (LRM) and other classical methods, the observations in...

Full description

Bibliographic Details
Main Author: Liu, Xiyuan
Format: Others
Published: North Dakota State University 2019
Online Access:https://hdl.handle.net/10365/29518
id ndltd-ndsu.edu-oai-library.ndsu.edu-10365-29518
record_format oai_dc
spelling ndltd-ndsu.edu-oai-library.ndsu.edu-10365-295182021-10-01T17:09:58Z Conditional Random Field with Lasso and its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection Conditional Random Fields with Lasso and its Application to the Classification of Relationships between Plant Genes Expression Level and Fungus Genes Liu, Xiyuan The classification problem of gene expression level, more specifically, gene expression analysis, is a major research area in statistics. There are several classical methods to solve the classification problem. To apply Logistic Regression Model (LRM) and other classical methods, the observations in the dataset should fit the assumption of independence. That is, the observations in the dataset are independent to each other, and the predictor (independent variable) should be independent. These assumptions are usually violated in gene expression analysis. Although the Classical Hidden Markov Chain Model (HMM) can solve the independence of observation problem, the classical HMM requires the independent variables in the dataset are discrete and independent. Unfortunately, the gene expression level is a continuous variable. To solve the classification problem of Gene Expression Level data, the Conditional Random Field(CRF) is introduce. Finally, the Least Absolute Selection and Shrinkage Operator (LASSO) penalty, a dimensional reduction method, is introduced to improve the CRF model. 2019-04-05T19:23:02Z 2019-04-05T19:23:02Z 2019 text/dissertation movingimage/video https://hdl.handle.net/10365/29518 application/pdf video/mp4 North Dakota State University
collection NDLTD
format Others
sources NDLTD
description The classification problem of gene expression level, more specifically, gene expression analysis, is a major research area in statistics. There are several classical methods to solve the classification problem. To apply Logistic Regression Model (LRM) and other classical methods, the observations in the dataset should fit the assumption of independence. That is, the observations in the dataset are independent to each other, and the predictor (independent variable) should be independent. These assumptions are usually violated in gene expression analysis. Although the Classical Hidden Markov Chain Model (HMM) can solve the independence of observation problem, the classical HMM requires the independent variables in the dataset are discrete and independent. Unfortunately, the gene expression level is a continuous variable. To solve the classification problem of Gene Expression Level data, the Conditional Random Field(CRF) is introduce. Finally, the Least Absolute Selection and Shrinkage Operator (LASSO) penalty, a dimensional reduction method, is introduced to improve the CRF model.
author Liu, Xiyuan
spellingShingle Liu, Xiyuan
Conditional Random Field with Lasso and its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection
author_facet Liu, Xiyuan
author_sort Liu, Xiyuan
title Conditional Random Field with Lasso and its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection
title_short Conditional Random Field with Lasso and its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection
title_full Conditional Random Field with Lasso and its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection
title_fullStr Conditional Random Field with Lasso and its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection
title_full_unstemmed Conditional Random Field with Lasso and its Application to the Classification of Barley Genes Based on Expression Level Affected by Fungal Infection
title_sort conditional random field with lasso and its application to the classification of barley genes based on expression level affected by fungal infection
publisher North Dakota State University
publishDate 2019
url https://hdl.handle.net/10365/29518
work_keys_str_mv AT liuxiyuan conditionalrandomfieldwithlassoanditsapplicationtotheclassificationofbarleygenesbasedonexpressionlevelaffectedbyfungalinfection
AT liuxiyuan conditionalrandomfieldswithlassoanditsapplicationtotheclassificationofrelationshipsbetweenplantgenesexpressionlevelandfungusgenes
_version_ 1719486674594955264