Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies

Romdhane Rekaya,1–3 Shannon Smith,4 El Hamidi Hay,5 Nourhene Farhat,6 Samuel E Aggrey3,7 1Department of Animal and Dairy Science, College of Agricultural and Environmental Sciences, 2Department of Statistics, Franklin College of Arts and Sciences, 3Institute of Bioinformatics, The Universi...

Full description

Bibliographic Details
Main Authors:	Rekaya R, Smith S, Hay EH, Farhat N, Aggrey SE
Format:	Article
Language:	English
Published:	Dove Medical Press 2016-11-01
Series:	The Application of Clinical Genetics
Subjects:	Binary responses Misclassification Specificity Sensitivity
Online Access:	https://www.dovepress.com/analysis-of-binary-responses-with-outcome-specific-misclassification-p-peer-reviewed-article-TACG

id	doaj-062d0b0c1d644795b3e0531be43ce53d
record_format	Article
spelling	doaj-062d0b0c1d644795b3e0531be43ce53d2020-11-24T21:04:48ZengDove Medical PressThe Application of Clinical Genetics1178-704X2016-11-01Volume 916917730283Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studiesRekaya RSmith SHay EHFarhat NAggrey SERomdhane Rekaya,1–3 Shannon Smith,4 El Hamidi Hay,5 Nourhene Farhat,6 Samuel E Aggrey3,7 1Department of Animal and Dairy Science, College of Agricultural and Environmental Sciences, 2Department of Statistics, Franklin College of Arts and Sciences, 3Institute of Bioinformatics, The University of Georgia, Athens, GA, 4Zoetis, Kalamazoo, MI, 5United States Department of Agriculture, Agricultural Research Service, Beltsville, MD, 6Carolinas HealthCare System Blue Ridge, Morganton, NC, 7Department of Poultry Science, College of Agricultural and Environmental Sciences, University of Georgia, Athens, GA, USA Abstract: Errors in the binary status of some response traits are frequent in human, animal, and plant applications. These error rates tend to differ between cases and controls because diagnostic and screening tests have different sensitivity and specificity. This increases the inaccuracies of classifying individuals into correct groups, giving rise to both false-positive and false-negative cases. The analysis of these noisy binary responses due to misclassification will undoubtedly reduce the statistical power of genome-wide association studies (GWAS). A threshold model that accommodates varying diagnostic errors between cases and controls was investigated. A simulation study was carried out where several binary data sets (case–control) were generated with varying effects for the most influential single nucleotide polymorphisms (SNPs) and different diagnostic error rate for cases and controls. Each simulated data set consisted of 2000 individuals. Ignoring misclassification resulted in biased estimates of true influential SNP effects and inflated estimates for true noninfluential markers. A substantial reduction in bias and increase in accuracy ranging from 12% to 32% was observed when the misclassification procedure was invoked. In fact, the majority of influential SNPs that were not identified using the noisy data were captured using the proposed method. Additionally, truly misclassified binary records were identified with high probability using the proposed method. The superiority of the proposed method was maintained across different simulation parameters (misclassification rates and odds ratios) attesting to its robustness. Keywords: binary responses, misclassification, specificity, sensitivityhttps://www.dovepress.com/analysis-of-binary-responses-with-outcome-specific-misclassification-p-peer-reviewed-article-TACGBinary responsesMisclassificationSpecificitySensitivity
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Rekaya R Smith S Hay EH Farhat N Aggrey SE
spellingShingle	Rekaya R Smith S Hay EH Farhat N Aggrey SE Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies The Application of Clinical Genetics Binary responses Misclassification Specificity Sensitivity
author_facet	Rekaya R Smith S Hay EH Farhat N Aggrey SE
author_sort	Rekaya R
title	Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies
title_short	Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies
title_full	Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies
title_fullStr	Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies
title_full_unstemmed	Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies
title_sort	analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies
publisher	Dove Medical Press
series	The Application of Clinical Genetics
issn	1178-704X
publishDate	2016-11-01
description	Romdhane Rekaya,1–3 Shannon Smith,4 El Hamidi Hay,5 Nourhene Farhat,6 Samuel E Aggrey3,7 1Department of Animal and Dairy Science, College of Agricultural and Environmental Sciences, 2Department of Statistics, Franklin College of Arts and Sciences, 3Institute of Bioinformatics, The University of Georgia, Athens, GA, 4Zoetis, Kalamazoo, MI, 5United States Department of Agriculture, Agricultural Research Service, Beltsville, MD, 6Carolinas HealthCare System Blue Ridge, Morganton, NC, 7Department of Poultry Science, College of Agricultural and Environmental Sciences, University of Georgia, Athens, GA, USA Abstract: Errors in the binary status of some response traits are frequent in human, animal, and plant applications. These error rates tend to differ between cases and controls because diagnostic and screening tests have different sensitivity and specificity. This increases the inaccuracies of classifying individuals into correct groups, giving rise to both false-positive and false-negative cases. The analysis of these noisy binary responses due to misclassification will undoubtedly reduce the statistical power of genome-wide association studies (GWAS). A threshold model that accommodates varying diagnostic errors between cases and controls was investigated. A simulation study was carried out where several binary data sets (case–control) were generated with varying effects for the most influential single nucleotide polymorphisms (SNPs) and different diagnostic error rate for cases and controls. Each simulated data set consisted of 2000 individuals. Ignoring misclassification resulted in biased estimates of true influential SNP effects and inflated estimates for true noninfluential markers. A substantial reduction in bias and increase in accuracy ranging from 12% to 32% was observed when the misclassification procedure was invoked. In fact, the majority of influential SNPs that were not identified using the noisy data were captured using the proposed method. Additionally, truly misclassified binary records were identified with high probability using the proposed method. The superiority of the proposed method was maintained across different simulation parameters (misclassification rates and odds ratios) attesting to its robustness. Keywords: binary responses, misclassification, specificity, sensitivity
topic	Binary responses Misclassification Specificity Sensitivity
url	https://www.dovepress.com/analysis-of-binary-responses-with-outcome-specific-misclassification-p-peer-reviewed-article-TACG
work_keys_str_mv	AT rekayar analysisofbinaryresponseswithoutcomespecificmisclassificationprobabilityingenomewideassociationstudies AT smiths analysisofbinaryresponseswithoutcomespecificmisclassificationprobabilityingenomewideassociationstudies AT hayeh analysisofbinaryresponseswithoutcomespecificmisclassificationprobabilityingenomewideassociationstudies AT farhatn analysisofbinaryresponseswithoutcomespecificmisclassificationprobabilityingenomewideassociationstudies AT aggreyse analysisofbinaryresponseswithoutcomespecificmisclassificationprobabilityingenomewideassociationstudies
_version_	1716769679040577536

Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies

Similar Items