Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors

Abstract Background Cancer arises from an evolutionary process where somatic mutations give rise to clonal expansions. Reconstructing this evolutionary process is useful for treatment decision-making as well as understanding evolutionary patterns across patients and cancer types. In particular, clas...

Full description

Bibliographic Details
Main Authors: Leah L. Weber, Mohammed El-Kebir
Format: Article
Language:English
Published: BMC 2021-07-01
Series:Algorithms for Molecular Biology
Subjects:
Online Access:https://doi.org/10.1186/s13015-021-00194-5
id doaj-f19439f0def244f3b8ac725cf95b16c7
record_format Article
spelling doaj-f19439f0def244f3b8ac725cf95b16c72021-07-11T11:03:17ZengBMCAlgorithms for Molecular Biology1748-71882021-07-0116111210.1186/s13015-021-00194-5Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumorsLeah L. Weber0Mohammed El-Kebir1Department of Computer Science, University of Illinois at Urbana-ChampaignDepartment of Computer Science, University of Illinois at Urbana-ChampaignAbstract Background Cancer arises from an evolutionary process where somatic mutations give rise to clonal expansions. Reconstructing this evolutionary process is useful for treatment decision-making as well as understanding evolutionary patterns across patients and cancer types. In particular, classifying a tumor’s evolutionary process as either linear or branched and understanding what cancer types and which patients have each of these trajectories could provide useful insights for both clinicians and researchers. While comprehensive cancer phylogeny inference from single-cell DNA sequencing data is challenging due to limitations with current sequencing technology and the complexity of the resulting problem, current data might provide sufficient signal to accurately classify a tumor’s evolutionary history as either linear or branched. Results We introduce the Linear Perfect Phylogeny Flipping (LPPF) problem as a means of testing two alternative hypotheses for the pattern of evolution, which we prove to be NP-hard. We develop Phyolin, which uses constraint programming to solve the LPPF problem. Through both in silico experiments and real data application, we demonstrate the performance of our method, outperforming a competing machine learning approach. Conclusion Phyolin is an accurate, easy to use and fast method for classifying an evolutionary trajectory as linear or branched given a tumor’s single-cell DNA sequencing data.https://doi.org/10.1186/s13015-021-00194-5Intra-tumor heterogeneityPerfect phylogenyConstraint programmingSingle-cell DNA sequencingPerfect phylogeny
collection DOAJ
language English
format Article
sources DOAJ
author Leah L. Weber
Mohammed El-Kebir
spellingShingle Leah L. Weber
Mohammed El-Kebir
Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors
Algorithms for Molecular Biology
Intra-tumor heterogeneity
Perfect phylogeny
Constraint programming
Single-cell DNA sequencing
Perfect phylogeny
author_facet Leah L. Weber
Mohammed El-Kebir
author_sort Leah L. Weber
title Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors
title_short Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors
title_full Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors
title_fullStr Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors
title_full_unstemmed Distinguishing linear and branched evolution given single-cell DNA sequencing data of tumors
title_sort distinguishing linear and branched evolution given single-cell dna sequencing data of tumors
publisher BMC
series Algorithms for Molecular Biology
issn 1748-7188
publishDate 2021-07-01
description Abstract Background Cancer arises from an evolutionary process where somatic mutations give rise to clonal expansions. Reconstructing this evolutionary process is useful for treatment decision-making as well as understanding evolutionary patterns across patients and cancer types. In particular, classifying a tumor’s evolutionary process as either linear or branched and understanding what cancer types and which patients have each of these trajectories could provide useful insights for both clinicians and researchers. While comprehensive cancer phylogeny inference from single-cell DNA sequencing data is challenging due to limitations with current sequencing technology and the complexity of the resulting problem, current data might provide sufficient signal to accurately classify a tumor’s evolutionary history as either linear or branched. Results We introduce the Linear Perfect Phylogeny Flipping (LPPF) problem as a means of testing two alternative hypotheses for the pattern of evolution, which we prove to be NP-hard. We develop Phyolin, which uses constraint programming to solve the LPPF problem. Through both in silico experiments and real data application, we demonstrate the performance of our method, outperforming a competing machine learning approach. Conclusion Phyolin is an accurate, easy to use and fast method for classifying an evolutionary trajectory as linear or branched given a tumor’s single-cell DNA sequencing data.
topic Intra-tumor heterogeneity
Perfect phylogeny
Constraint programming
Single-cell DNA sequencing
Perfect phylogeny
url https://doi.org/10.1186/s13015-021-00194-5
work_keys_str_mv AT leahlweber distinguishinglinearandbranchedevolutiongivensinglecelldnasequencingdataoftumors
AT mohammedelkebir distinguishinglinearandbranchedevolutiongivensinglecelldnasequencingdataoftumors
_version_ 1721309542847873024