Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors
Abstract Background Cancer arises from an evolutionary process where somatic mutations give rise to clonal expansions. Reconstructing this evolutionary process is useful for treatment decision-making as well as understanding evolutionary patterns across patients and cancer types. In particular, clas...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-07-01
|
Series: | Algorithms for Molecular Biology |
Subjects: | |
Online Access: | https://doi.org/10.1186/s13015-021-00194-5 |
id |
doaj-f19439f0def244f3b8ac725cf95b16c7 |
---|---|
record_format |
Article |
spelling |
doaj-f19439f0def244f3b8ac725cf95b16c72021-07-11T11:03:17ZengBMCAlgorithms for Molecular Biology1748-71882021-07-0116111210.1186/s13015-021-00194-5Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumorsLeah L. Weber0Mohammed El-Kebir1Department of Computer Science, University of Illinois at Urbana-ChampaignDepartment of Computer Science, University of Illinois at Urbana-ChampaignAbstract Background Cancer arises from an evolutionary process where somatic mutations give rise to clonal expansions. Reconstructing this evolutionary process is useful for treatment decision-making as well as understanding evolutionary patterns across patients and cancer types. In particular, classifying a tumor’s evolutionary process as either linear or branched and understanding what cancer types and which patients have each of these trajectories could provide useful insights for both clinicians and researchers. While comprehensive cancer phylogeny inference from single-cell DNA sequencing data is challenging due to limitations with current sequencing technology and the complexity of the resulting problem, current data might provide sufficient signal to accurately classify a tumor’s evolutionary history as either linear or branched. Results We introduce the Linear Perfect Phylogeny Flipping (LPPF) problem as a means of testing two alternative hypotheses for the pattern of evolution, which we prove to be NP-hard. We develop Phyolin, which uses constraint programming to solve the LPPF problem. Through both in silico experiments and real data application, we demonstrate the performance of our method, outperforming a competing machine learning approach. Conclusion Phyolin is an accurate, easy to use and fast method for classifying an evolutionary trajectory as linear or branched given a tumor’s single-cell DNA sequencing data.https://doi.org/10.1186/s13015-021-00194-5Intra-tumor heterogeneityPerfect phylogenyConstraint programmingSingle-cell DNA sequencingPerfect phylogeny |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Leah L. Weber Mohammed El-Kebir |
spellingShingle |
Leah L. Weber Mohammed El-Kebir Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors Algorithms for Molecular Biology Intra-tumor heterogeneity Perfect phylogeny Constraint programming Single-cell DNA sequencing Perfect phylogeny |
author_facet |
Leah L. Weber Mohammed El-Kebir |
author_sort |
Leah L. Weber |
title |
Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors |
title_short |
Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors |
title_full |
Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors |
title_fullStr |
Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors |
title_full_unstemmed |
Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors |
title_sort |
distinguishing linear and branched evolution given single-cell dna sequencing data of tumors |
publisher |
BMC |
series |
Algorithms for Molecular Biology |
issn |
1748-7188 |
publishDate |
2021-07-01 |
description |
Abstract Background Cancer arises from an evolutionary process where somatic mutations give rise to clonal expansions. Reconstructing this evolutionary process is useful for treatment decision-making as well as understanding evolutionary patterns across patients and cancer types. In particular, classifying a tumor’s evolutionary process as either linear or branched and understanding what cancer types and which patients have each of these trajectories could provide useful insights for both clinicians and researchers. While comprehensive cancer phylogeny inference from single-cell DNA sequencing data is challenging due to limitations with current sequencing technology and the complexity of the resulting problem, current data might provide sufficient signal to accurately classify a tumor’s evolutionary history as either linear or branched. Results We introduce the Linear Perfect Phylogeny Flipping (LPPF) problem as a means of testing two alternative hypotheses for the pattern of evolution, which we prove to be NP-hard. We develop Phyolin, which uses constraint programming to solve the LPPF problem. Through both in silico experiments and real data application, we demonstrate the performance of our method, outperforming a competing machine learning approach. Conclusion Phyolin is an accurate, easy to use and fast method for classifying an evolutionary trajectory as linear or branched given a tumor’s single-cell DNA sequencing data. |
topic |
Intra-tumor heterogeneity Perfect phylogeny Constraint programming Single-cell DNA sequencing Perfect phylogeny |
url |
https://doi.org/10.1186/s13015-021-00194-5 |
work_keys_str_mv |
AT leahlweber distinguishinglinearandbranchedevolutiongivensinglecelldnasequencingdataoftumors AT mohammedelkebir distinguishinglinearandbranchedevolutiongivensinglecelldnasequencingdataoftumors |
_version_ |
1721309542847873024 |