Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.
Generalized estimating equation (GEE) algorithm under a heterogeneous residual variance model is an extension of the iteratively reweighted least squares (IRLS) method for continuous traits to discrete traits. In contrast to mixture model-based expectation-maximization (EM) algorithm, the GEE algori...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2014-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC4161361?pdf=render |
id |
doaj-5ada01360eca41b0a7e765bc2a2cda89 |
---|---|
record_format |
Article |
spelling |
doaj-5ada01360eca41b0a7e765bc2a2cda892020-11-25T02:50:23ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0199e10698510.1371/journal.pone.0106985Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.Jun XingHuijiang GaoYang WuYani WuHongwang LiRunqing YangGeneralized estimating equation (GEE) algorithm under a heterogeneous residual variance model is an extension of the iteratively reweighted least squares (IRLS) method for continuous traits to discrete traits. In contrast to mixture model-based expectation-maximization (EM) algorithm, the GEE algorithm can well detect quantitative trait locus (QTL), especially large effect QTLs located in large marker intervals in the manner of high computing speed. Based on a single QTL model, however, the GEE algorithm has very limited statistical power to detect multiple QTLs because of ignoring other linked QTLs. In this study, the fast least absolute shrinkage and selection operator (LASSO) is derived for generalized linear model (GLM) with all possible link functions. Under a heterogeneous residual variance model, the LASSO for GLM is used to iteratively estimate the non-zero genetic effects of those loci over entire genome. The iteratively reweighted LASSO is therefore extended to mapping QTL for discrete traits, such as ordinal, binary, and Poisson traits. The simulated and real data analyses are conducted to demonstrate the efficiency of the proposed method to simultaneously identify multiple QTLs for binary and Poisson traits as examples.http://europepmc.org/articles/PMC4161361?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jun Xing Huijiang Gao Yang Wu Yani Wu Hongwang Li Runqing Yang |
spellingShingle |
Jun Xing Huijiang Gao Yang Wu Yani Wu Hongwang Li Runqing Yang Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm. PLoS ONE |
author_facet |
Jun Xing Huijiang Gao Yang Wu Yani Wu Hongwang Li Runqing Yang |
author_sort |
Jun Xing |
title |
Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm. |
title_short |
Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm. |
title_full |
Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm. |
title_fullStr |
Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm. |
title_full_unstemmed |
Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm. |
title_sort |
generalized linear model for mapping discrete trait loci implemented with lasso algorithm. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2014-01-01 |
description |
Generalized estimating equation (GEE) algorithm under a heterogeneous residual variance model is an extension of the iteratively reweighted least squares (IRLS) method for continuous traits to discrete traits. In contrast to mixture model-based expectation-maximization (EM) algorithm, the GEE algorithm can well detect quantitative trait locus (QTL), especially large effect QTLs located in large marker intervals in the manner of high computing speed. Based on a single QTL model, however, the GEE algorithm has very limited statistical power to detect multiple QTLs because of ignoring other linked QTLs. In this study, the fast least absolute shrinkage and selection operator (LASSO) is derived for generalized linear model (GLM) with all possible link functions. Under a heterogeneous residual variance model, the LASSO for GLM is used to iteratively estimate the non-zero genetic effects of those loci over entire genome. The iteratively reweighted LASSO is therefore extended to mapping QTL for discrete traits, such as ordinal, binary, and Poisson traits. The simulated and real data analyses are conducted to demonstrate the efficiency of the proposed method to simultaneously identify multiple QTLs for binary and Poisson traits as examples. |
url |
http://europepmc.org/articles/PMC4161361?pdf=render |
work_keys_str_mv |
AT junxing generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm AT huijianggao generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm AT yangwu generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm AT yaniwu generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm AT hongwangli generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm AT runqingyang generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm |
_version_ |
1724738900934524928 |