Learning Signaling Network Structures with Sparsely Distributed Data

Flow cytometric measurement of signaling protein abundances has proved particularly useful for elucidation of signaling pathway structure. The single cell nature of the data ensures a very large dataset size, providing a statistically robust dataset for structure learning. Moreover, the approach is...

Full description

Bibliographic Details
Main Authors: Sachs, Karen (Author), Itani, Solomon (Contributor), Carlisle, Jennifer (Contributor), Nolan, Garry P. (Author), Pe'er, Dana (Author), Lauffenburger, Douglas A. (Contributor)
Other Authors: Massachusetts Institute of Technology. Department of Biological Engineering (Contributor), Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: Mary Ann Liebert, Inc., 2011-04-19T14:53:34Z.
Subjects:
Online Access:Get fulltext
Description
Summary:Flow cytometric measurement of signaling protein abundances has proved particularly useful for elucidation of signaling pathway structure. The single cell nature of the data ensures a very large dataset size, providing a statistically robust dataset for structure learning. Moreover, the approach is easily scaled to many conditions in high throughput. However, the technology suffers from a dimensionality constraint: at the cutting edge, only about 12 protein species can be measured per cell, far from sufficient for most signaling pathways. Because the structure learning algorithm (in practice) requires that all variables be measured together simultaneously, this restricts structure learning to the number of variables that constitute the flow cytometer's upper dimensionality limit. To address this problem, we present here an algorithm that enables structure learning for sparsely distributed data, allowing structure learning beyond the measurement technology's upper dimensionality limit for simultaneously measurable variables. The algorithm assesses pairwise (or n-wise) dependencies, constructs "Markov neighborhoods" for each variable based on these dependencies, measures each variable in the context of its neighborhood, and performs structure learning using a constrained search.
Leukemia & Lymphoma Society of America
National Institutes of Health (U.S.) (grant AI06584)
National Institutes of Health (U.S.) (grant GM68762)
Burroughs Wellcome Fund
National Institutes of Health (U.S.) (grant N01-HV-28183)
National Institutes of Health (U.S.) (U19 AI057229)
National Institutes of Health (U.S.) (2P01 AI36535)
National Institutes of Health (U.S.) (U19 AI062623)
National Institutes of Health (U.S.) (R01-AI065824)
National Institutes of Health (U.S.) (2P01 CA034233-22A1)
National Institutes of Health (U.S.) (HHSN272200700038C)
National Institutes of Health (U.S.) (NCI grant U54 RFA-CA-05-024)
National Institutes of Health (U.S.) (LLS grant 7017-6)