Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments

Purpose: To analyze a microarray experiment to identify the genes with expressions varying after the diagnosis of breast cancer. Methods: A total of 44 928 probe sets in an Affymetrix microarray data publicly available on Gene Expression Omnibus from 249 patients with breast cancer were analyzed by...

Full description

Bibliographic Details
Main Authors: Fenghai Duan, Ye Xu
Format: Article
Language:English
Published: SAGE Publishing 2017-05-01
Series:Cancer Informatics
Online Access:https://doi.org/10.1177/1176935117705381
id doaj-0fde6076cdc34c79b074c4be7a94e13f
record_format Article
spelling doaj-0fde6076cdc34c79b074c4be7a94e13f2020-11-25T02:55:16ZengSAGE PublishingCancer Informatics1176-93512017-05-011610.1177/117693511770538110.1177_1176935117705381Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray ExperimentsFenghai Duan0Ye Xu1Department of Biostatistics and Center for Statistical Sciences, School of Public Health, Brown University, Providence, RI, USAStubHub, San Francisco, CA, USAPurpose: To analyze a microarray experiment to identify the genes with expressions varying after the diagnosis of breast cancer. Methods: A total of 44 928 probe sets in an Affymetrix microarray data publicly available on Gene Expression Omnibus from 249 patients with breast cancer were analyzed by the nonparametric multivariate adaptive splines. Then, the identified genes with turning points were grouped by K-means clustering, and their network relationship was subsequently analyzed by the Ingenuity Pathway Analysis. Results: In total, 1640 probe sets (genes) were reliably identified to have turning points along with the age at diagnosis in their expression profiling, of which 927 expressed lower after turning points and 713 expressed higher after the turning points. K-means clustered them into 3 groups with turning points centering at 54, 62.5, and 72, respectively. The pathway analysis showed that the identified genes were actively involved in various cancer-related functions or networks. Conclusions: In this article, we applied the nonparametric multivariate adaptive splines method to a publicly available gene expression data and successfully identified genes with expressions varying before and after breast cancer diagnosis.https://doi.org/10.1177/1176935117705381
collection DOAJ
language English
format Article
sources DOAJ
author Fenghai Duan
Ye Xu
spellingShingle Fenghai Duan
Ye Xu
Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments
Cancer Informatics
author_facet Fenghai Duan
Ye Xu
author_sort Fenghai Duan
title Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments
title_short Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments
title_full Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments
title_fullStr Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments
title_full_unstemmed Applying Multivariate Adaptive Splines to Identify Genes With Expressions Varying After Diagnosis in Microarray Experiments
title_sort applying multivariate adaptive splines to identify genes with expressions varying after diagnosis in microarray experiments
publisher SAGE Publishing
series Cancer Informatics
issn 1176-9351
publishDate 2017-05-01
description Purpose: To analyze a microarray experiment to identify the genes with expressions varying after the diagnosis of breast cancer. Methods: A total of 44 928 probe sets in an Affymetrix microarray data publicly available on Gene Expression Omnibus from 249 patients with breast cancer were analyzed by the nonparametric multivariate adaptive splines. Then, the identified genes with turning points were grouped by K-means clustering, and their network relationship was subsequently analyzed by the Ingenuity Pathway Analysis. Results: In total, 1640 probe sets (genes) were reliably identified to have turning points along with the age at diagnosis in their expression profiling, of which 927 expressed lower after turning points and 713 expressed higher after the turning points. K-means clustered them into 3 groups with turning points centering at 54, 62.5, and 72, respectively. The pathway analysis showed that the identified genes were actively involved in various cancer-related functions or networks. Conclusions: In this article, we applied the nonparametric multivariate adaptive splines method to a publicly available gene expression data and successfully identified genes with expressions varying before and after breast cancer diagnosis.
url https://doi.org/10.1177/1176935117705381
work_keys_str_mv AT fenghaiduan applyingmultivariateadaptivesplinestoidentifygeneswithexpressionsvaryingafterdiagnosisinmicroarrayexperiments
AT yexu applyingmultivariateadaptivesplinestoidentifygeneswithexpressionsvaryingafterdiagnosisinmicroarrayexperiments
_version_ 1724716978268012544