A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer

Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide p...

Full description

Bibliographic Details
Main Authors: Ming Li, Yalu Wen, Wenjiang Fu
Format: Article
Language:English
Published: SAGE Publishing 2014-01-01
Series:Cancer Informatics
Online Access:https://doi.org/10.4137/CIN.S15203
id doaj-c5c1fbff0fbc42299b3723d0aa174770
record_format Article
spelling doaj-c5c1fbff0fbc42299b3723d0aa1747702020-11-25T03:41:16ZengSAGE PublishingCancer Informatics1176-93512014-01-0113s410.4137/CIN.S15203A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast CancerMing Li0Yalu Wen1Wenjiang Fu2Division of Biostatistics, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, AR, USA.Department of Epidemiology and Biostatistics, Michigan State University, East Lansing Ml, USA.Department of Mathematics, University of Houston, Houston, TX, USA.Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease.https://doi.org/10.4137/CIN.S15203
collection DOAJ
language English
format Article
sources DOAJ
author Ming Li
Yalu Wen
Wenjiang Fu
spellingShingle Ming Li
Yalu Wen
Wenjiang Fu
A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer
Cancer Informatics
author_facet Ming Li
Yalu Wen
Wenjiang Fu
author_sort Ming Li
title A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer
title_short A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer
title_full A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer
title_fullStr A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer
title_full_unstemmed A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer
title_sort single-array-based method for detecting copy number variants using affymetrix high density snp arrays and its application to breast cancer
publisher SAGE Publishing
series Cancer Informatics
issn 1176-9351
publishDate 2014-01-01
description Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease.
url https://doi.org/10.4137/CIN.S15203
work_keys_str_mv AT mingli asinglearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer
AT yaluwen asinglearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer
AT wenjiangfu asinglearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer
AT mingli singlearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer
AT yaluwen singlearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer
AT wenjiangfu singlearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer
_version_ 1724530757701992448