A comparative analysis of family-based and population-based association tests using whole genome sequence data

The revolution in next-generation sequencing has made obtaining both common and rare high-quality sequence variants across the entire genome feasible. Because researchers are now faced with the analytical challenges of handling a massive amount of genetic variant information from sequencing studies,...

Full description

Bibliographic Details
Main Authors: Zhou, Jin, Yip, Wai-Ki, Cho, Michael, Qiao, Dandi, McDonald, Merry-Lynn, Laird, Nan
Other Authors: Biostatistics Department, Harvard School of Public Health, Boston, MA 02115 USA
Language:en
Published: BioMed Central 2014
Online Access:Zhou et al. BMC Proceedings 2014, 8(Suppl 1):S33 http://www.biomedcentral.com/1753-6561/8/S1/S33
http://hdl.handle.net/10150/610090
http://arizona.openrepository.com/arizona/handle/10150/610090
id ndltd-arizona.edu-oai-arizona.openrepository.com-10150-610090
record_format oai_dc
spelling ndltd-arizona.edu-oai-arizona.openrepository.com-10150-6100902016-05-22T03:01:31Z A comparative analysis of family-based and population-based association tests using whole genome sequence data Zhou, Jin Yip, Wai-Ki Cho, Michael Qiao, Dandi McDonald, Merry-Lynn Laird, Nan Biostatistics Department, Harvard School of Public Health, Boston, MA 02115 USA Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA Division of Epidemiology and Biostatistics, College of Public Health, University of Arizona, Tucson, AZ 85724, USA The revolution in next-generation sequencing has made obtaining both common and rare high-quality sequence variants across the entire genome feasible. Because researchers are now faced with the analytical challenges of handling a massive amount of genetic variant information from sequencing studies, numerous methods have been developed to assess the impact of both common and rare variants on disease traits. In this report, whole genome sequencing data from Genetic Analysis Workshop 18 was used to compare the power of several methods, considering both family-based and population-based designs, to detect association with variants in the MAP4 gene region and on chromosome 3 with blood pressure. To prioritize variants across the genome for testing, variants were first functionally assessed using prediction algorithms and expression quantitative trait loci (eQTLs) data. Four set-based tests in the family-based association tests (FBAT) framework--FBAT-v, FBAT-lmm, FBAT-m, and FBAT-l--were used to analyze 20 pedigrees, and 2 variance component tests, sequence kernel association test (SKAT) and genome-wide complex trait analysis (GCTA), were used with 142 unrelated individuals in the sample. Both set-based and variance-component-based tests had high power and an adequate type I error rate. Of the various FBATs, FBAT-l demonstrated superior performance, indicating the potential for it to be used in rare-variant analysis. The updated FBAT package is available at: http://www.hsph.harvard.edu/fbat/ webcite. 2014 Article Zhou et al. BMC Proceedings 2014, 8(Suppl 1):S33 http://www.biomedcentral.com/1753-6561/8/S1/S33 10.1186/1753-6561-8-S1-S33 http://hdl.handle.net/10150/610090 http://arizona.openrepository.com/arizona/handle/10150/610090 1753-6561 BMC Proceedings en http://www.biomedcentral.com/1753-6561/8/S1/S33 © 2014 Zhou et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0) BioMed Central
collection NDLTD
language en
sources NDLTD
description The revolution in next-generation sequencing has made obtaining both common and rare high-quality sequence variants across the entire genome feasible. Because researchers are now faced with the analytical challenges of handling a massive amount of genetic variant information from sequencing studies, numerous methods have been developed to assess the impact of both common and rare variants on disease traits. In this report, whole genome sequencing data from Genetic Analysis Workshop 18 was used to compare the power of several methods, considering both family-based and population-based designs, to detect association with variants in the MAP4 gene region and on chromosome 3 with blood pressure. To prioritize variants across the genome for testing, variants were first functionally assessed using prediction algorithms and expression quantitative trait loci (eQTLs) data. Four set-based tests in the family-based association tests (FBAT) framework--FBAT-v, FBAT-lmm, FBAT-m, and FBAT-l--were used to analyze 20 pedigrees, and 2 variance component tests, sequence kernel association test (SKAT) and genome-wide complex trait analysis (GCTA), were used with 142 unrelated individuals in the sample. Both set-based and variance-component-based tests had high power and an adequate type I error rate. Of the various FBATs, FBAT-l demonstrated superior performance, indicating the potential for it to be used in rare-variant analysis. The updated FBAT package is available at: http://www.hsph.harvard.edu/fbat/ webcite.
author2 Biostatistics Department, Harvard School of Public Health, Boston, MA 02115 USA
author_facet Biostatistics Department, Harvard School of Public Health, Boston, MA 02115 USA
Zhou, Jin
Yip, Wai-Ki
Cho, Michael
Qiao, Dandi
McDonald, Merry-Lynn
Laird, Nan
author Zhou, Jin
Yip, Wai-Ki
Cho, Michael
Qiao, Dandi
McDonald, Merry-Lynn
Laird, Nan
spellingShingle Zhou, Jin
Yip, Wai-Ki
Cho, Michael
Qiao, Dandi
McDonald, Merry-Lynn
Laird, Nan
A comparative analysis of family-based and population-based association tests using whole genome sequence data
author_sort Zhou, Jin
title A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_short A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_full A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_fullStr A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_full_unstemmed A comparative analysis of family-based and population-based association tests using whole genome sequence data
title_sort comparative analysis of family-based and population-based association tests using whole genome sequence data
publisher BioMed Central
publishDate 2014
url Zhou et al. BMC Proceedings 2014, 8(Suppl 1):S33 http://www.biomedcentral.com/1753-6561/8/S1/S33
http://hdl.handle.net/10150/610090
http://arizona.openrepository.com/arizona/handle/10150/610090
work_keys_str_mv AT zhoujin acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT yipwaiki acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT chomichael acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT qiaodandi acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT mcdonaldmerrylynn acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT lairdnan acomparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT zhoujin comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT yipwaiki comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT chomichael comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT qiaodandi comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT mcdonaldmerrylynn comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
AT lairdnan comparativeanalysisoffamilybasedandpopulationbasedassociationtestsusingwholegenomesequencedata
_version_ 1718273988926373888