Simple binary segmentation frameworks for identifying variation in DNA copy number

<p>Abstract</p> <p>Background</p> <p>Variation in DNA copy number, due to gains and losses of chromosome segments, is common. A first step for analyzing DNA copy number data is to identify amplified or deleted regions in individuals. To locate such regions, we propose a...

Full description

Bibliographic Details
Main Author: Yang Tae Young
Format: Article
Language:English
Published: BMC 2012-10-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://www.biomedcentral.com/1471-2105/13/277
id doaj-35228ec175634010b3eac87bfe307a86
record_format Article
spelling doaj-35228ec175634010b3eac87bfe307a862020-11-24T23:21:22ZengBMCBMC Bioinformatics1471-21052012-10-0113127710.1186/1471-2105-13-277Simple binary segmentation frameworks for identifying variation in DNA copy numberYang Tae Young<p>Abstract</p> <p>Background</p> <p>Variation in DNA copy number, due to gains and losses of chromosome segments, is common. A first step for analyzing DNA copy number data is to identify amplified or deleted regions in individuals. To locate such regions, we propose a circular binary segmentation procedure, which is based on a sequence of nested hypothesis tests, each using the Bayesian information criterion.</p> <p>Results</p> <p>Our procedure is convenient for analyzing DNA copy number in two general situations: (1) when using data from multiple sources and (2) when using cohort analysis of multiple patients suffering from the same type of cancer. In the first case, data from multiple sources such as different platforms, labs, or preprocessing methods are used to study variation in copy number in the same individual. Combining these sources provides a higher resolution, which leads to a more detailed genome-wide survey of the individual. In this case, we provide a simple statistical framework to derive a consensus molecular signature. In the framework, the multiple sequences from various sources are integrated into a single sequence, and then the proposed segmentation procedure is applied to this sequence to detect aberrant regions. In the second case, cohort analysis of multiple patients is carried out to derive overall molecular signatures for the cohort. For this case, we provide another simple statistical framework in which data across multiple profiles is standardized before segmentation. The proposed segmentation procedure is then applied to the standardized profiles one at a time to detect aberrant regions. Any such regions that are common across two or more profiles are probably real and may play important roles in the cancer pathogenesis process.</p> <p>Conclusions</p> <p>The main advantages of the proposed procedure are flexibility and simplicity.</p> http://www.biomedcentral.com/1471-2105/13/277Bayesian information criterionCircular binary segmentationConsensus molecular signatureOverall molecular signatureVariation in DNA copy number
collection DOAJ
language English
format Article
sources DOAJ
author Yang Tae Young
spellingShingle Yang Tae Young
Simple binary segmentation frameworks for identifying variation in DNA copy number
BMC Bioinformatics
Bayesian information criterion
Circular binary segmentation
Consensus molecular signature
Overall molecular signature
Variation in DNA copy number
author_facet Yang Tae Young
author_sort Yang Tae Young
title Simple binary segmentation frameworks for identifying variation in DNA copy number
title_short Simple binary segmentation frameworks for identifying variation in DNA copy number
title_full Simple binary segmentation frameworks for identifying variation in DNA copy number
title_fullStr Simple binary segmentation frameworks for identifying variation in DNA copy number
title_full_unstemmed Simple binary segmentation frameworks for identifying variation in DNA copy number
title_sort simple binary segmentation frameworks for identifying variation in dna copy number
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2012-10-01
description <p>Abstract</p> <p>Background</p> <p>Variation in DNA copy number, due to gains and losses of chromosome segments, is common. A first step for analyzing DNA copy number data is to identify amplified or deleted regions in individuals. To locate such regions, we propose a circular binary segmentation procedure, which is based on a sequence of nested hypothesis tests, each using the Bayesian information criterion.</p> <p>Results</p> <p>Our procedure is convenient for analyzing DNA copy number in two general situations: (1) when using data from multiple sources and (2) when using cohort analysis of multiple patients suffering from the same type of cancer. In the first case, data from multiple sources such as different platforms, labs, or preprocessing methods are used to study variation in copy number in the same individual. Combining these sources provides a higher resolution, which leads to a more detailed genome-wide survey of the individual. In this case, we provide a simple statistical framework to derive a consensus molecular signature. In the framework, the multiple sequences from various sources are integrated into a single sequence, and then the proposed segmentation procedure is applied to this sequence to detect aberrant regions. In the second case, cohort analysis of multiple patients is carried out to derive overall molecular signatures for the cohort. For this case, we provide another simple statistical framework in which data across multiple profiles is standardized before segmentation. The proposed segmentation procedure is then applied to the standardized profiles one at a time to detect aberrant regions. Any such regions that are common across two or more profiles are probably real and may play important roles in the cancer pathogenesis process.</p> <p>Conclusions</p> <p>The main advantages of the proposed procedure are flexibility and simplicity.</p>
topic Bayesian information criterion
Circular binary segmentation
Consensus molecular signature
Overall molecular signature
Variation in DNA copy number
url http://www.biomedcentral.com/1471-2105/13/277
work_keys_str_mv AT yangtaeyoung simplebinarysegmentationframeworksforidentifyingvariationindnacopynumber
_version_ 1725571697636016128