Summary: | An embryo develops from a single-celled zygote, which produces a multi-cellular organism by mitosis. Due to the complication of processes and mechanisms, research on embryo cell clusters in different early embryo developmental stages with significant phenotypic differences is still lacking. In this work, we identified some gene characters and expression rules to classify these individual cells using several advanced computational methods. The single cell expression profiles of embryo cells were analyzed by the Monte Carlo feature selection (MCFS) method, resulting in a feature list. Then, the incremental feature selection (IFS) method, incorporating support vector machine (SVM), applied on such list to extract key gene characters. These gene characters include KHDC1, HMGN1, DCP, GDF9, RNF11, DNMT3L, and CDX1. Furthermore, a rule learning algorithm, Repeated Incremental Pruning to Produce Error Reduction (RIPPER), was applied to the informative features yielded by MCFS method, producing a group of classification rules. These rules can clearly uncover different expression patterns on cells in different stages. This study provided a group of effective gene signatures and rules for embryo cell subtyping and presented an applicable computational tool to further dig into the regulatory mechanisms of embryo development.
|