Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.

Generalized estimating equation (GEE) algorithm under a heterogeneous residual variance model is an extension of the iteratively reweighted least squares (IRLS) method for continuous traits to discrete traits. In contrast to mixture model-based expectation-maximization (EM) algorithm, the GEE algori...

Full description

Bibliographic Details
Main Authors: Jun Xing, Huijiang Gao, Yang Wu, Yani Wu, Hongwang Li, Runqing Yang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4161361?pdf=render
id doaj-5ada01360eca41b0a7e765bc2a2cda89
record_format Article
spelling doaj-5ada01360eca41b0a7e765bc2a2cda892020-11-25T02:50:23ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0199e10698510.1371/journal.pone.0106985Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.Jun XingHuijiang GaoYang WuYani WuHongwang LiRunqing YangGeneralized estimating equation (GEE) algorithm under a heterogeneous residual variance model is an extension of the iteratively reweighted least squares (IRLS) method for continuous traits to discrete traits. In contrast to mixture model-based expectation-maximization (EM) algorithm, the GEE algorithm can well detect quantitative trait locus (QTL), especially large effect QTLs located in large marker intervals in the manner of high computing speed. Based on a single QTL model, however, the GEE algorithm has very limited statistical power to detect multiple QTLs because of ignoring other linked QTLs. In this study, the fast least absolute shrinkage and selection operator (LASSO) is derived for generalized linear model (GLM) with all possible link functions. Under a heterogeneous residual variance model, the LASSO for GLM is used to iteratively estimate the non-zero genetic effects of those loci over entire genome. The iteratively reweighted LASSO is therefore extended to mapping QTL for discrete traits, such as ordinal, binary, and Poisson traits. The simulated and real data analyses are conducted to demonstrate the efficiency of the proposed method to simultaneously identify multiple QTLs for binary and Poisson traits as examples.http://europepmc.org/articles/PMC4161361?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Jun Xing
Huijiang Gao
Yang Wu
Yani Wu
Hongwang Li
Runqing Yang
spellingShingle Jun Xing
Huijiang Gao
Yang Wu
Yani Wu
Hongwang Li
Runqing Yang
Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.
PLoS ONE
author_facet Jun Xing
Huijiang Gao
Yang Wu
Yani Wu
Hongwang Li
Runqing Yang
author_sort Jun Xing
title Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.
title_short Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.
title_full Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.
title_fullStr Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.
title_full_unstemmed Generalized linear model for mapping discrete trait loci implemented with LASSO algorithm.
title_sort generalized linear model for mapping discrete trait loci implemented with lasso algorithm.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description Generalized estimating equation (GEE) algorithm under a heterogeneous residual variance model is an extension of the iteratively reweighted least squares (IRLS) method for continuous traits to discrete traits. In contrast to mixture model-based expectation-maximization (EM) algorithm, the GEE algorithm can well detect quantitative trait locus (QTL), especially large effect QTLs located in large marker intervals in the manner of high computing speed. Based on a single QTL model, however, the GEE algorithm has very limited statistical power to detect multiple QTLs because of ignoring other linked QTLs. In this study, the fast least absolute shrinkage and selection operator (LASSO) is derived for generalized linear model (GLM) with all possible link functions. Under a heterogeneous residual variance model, the LASSO for GLM is used to iteratively estimate the non-zero genetic effects of those loci over entire genome. The iteratively reweighted LASSO is therefore extended to mapping QTL for discrete traits, such as ordinal, binary, and Poisson traits. The simulated and real data analyses are conducted to demonstrate the efficiency of the proposed method to simultaneously identify multiple QTLs for binary and Poisson traits as examples.
url http://europepmc.org/articles/PMC4161361?pdf=render
work_keys_str_mv AT junxing generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm
AT huijianggao generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm
AT yangwu generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm
AT yaniwu generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm
AT hongwangli generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm
AT runqingyang generalizedlinearmodelformappingdiscretetraitlociimplementedwithlassoalgorithm
_version_ 1724738900934524928