A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer
Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide p...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SAGE Publishing
2014-01-01
|
Series: | Cancer Informatics |
Online Access: | https://doi.org/10.4137/CIN.S15203 |
id |
doaj-c5c1fbff0fbc42299b3723d0aa174770 |
---|---|
record_format |
Article |
spelling |
doaj-c5c1fbff0fbc42299b3723d0aa1747702020-11-25T03:41:16ZengSAGE PublishingCancer Informatics1176-93512014-01-0113s410.4137/CIN.S15203A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast CancerMing Li0Yalu Wen1Wenjiang Fu2Division of Biostatistics, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, AR, USA.Department of Epidemiology and Biostatistics, Michigan State University, East Lansing Ml, USA.Department of Mathematics, University of Houston, Houston, TX, USA.Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease.https://doi.org/10.4137/CIN.S15203 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ming Li Yalu Wen Wenjiang Fu |
spellingShingle |
Ming Li Yalu Wen Wenjiang Fu A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer Cancer Informatics |
author_facet |
Ming Li Yalu Wen Wenjiang Fu |
author_sort |
Ming Li |
title |
A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer |
title_short |
A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer |
title_full |
A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer |
title_fullStr |
A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer |
title_full_unstemmed |
A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer |
title_sort |
single-array-based method for detecting copy number variants using affymetrix high density snp arrays and its application to breast cancer |
publisher |
SAGE Publishing |
series |
Cancer Informatics |
issn |
1176-9351 |
publishDate |
2014-01-01 |
description |
Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease. |
url |
https://doi.org/10.4137/CIN.S15203 |
work_keys_str_mv |
AT mingli asinglearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer AT yaluwen asinglearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer AT wenjiangfu asinglearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer AT mingli singlearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer AT yaluwen singlearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer AT wenjiangfu singlearraybasedmethodfordetectingcopynumbervariantsusingaffymetrixhighdensitysnparraysanditsapplicationtobreastcancer |
_version_ |
1724530757701992448 |