An Improved Stacked Autoencoder for Metabolomic Data Classification

Naru3 (NR) is a traditional Mongolian medicine with high clinical efficacy and low incidence of side effects. Metabolomics is an approach that can facilitate the development of traditional drugs. However, metabolomic data have a high throughput, sparse, high-dimensional, and small sample nature, and...

Full description

Bibliographic Details
Main Authors: Xiaojing Fan, Xiye Wang, Mingyang Jiang, Zhili Pei, Shicheng Qiao
Format: Article
Language:English
Published: Hindawi Limited 2021-01-01
Series:Computational Intelligence and Neuroscience
Online Access:http://dx.doi.org/10.1155/2021/1051172
id doaj-23d2d77d71a14b759295fad539c83004
record_format Article
spelling doaj-23d2d77d71a14b759295fad539c830042021-08-30T00:01:25ZengHindawi LimitedComputational Intelligence and Neuroscience1687-52732021-01-01202110.1155/2021/1051172An Improved Stacked Autoencoder for Metabolomic Data ClassificationXiaojing Fan0Xiye Wang1Mingyang Jiang2Zhili Pei3Shicheng Qiao4College of EngineeringCollege of Chemistry and Chemical EngineeringCollege of Computer Science and TechnologyCollege of Computer Science and TechnologyCollege of Computer Science and TechnologyNaru3 (NR) is a traditional Mongolian medicine with high clinical efficacy and low incidence of side effects. Metabolomics is an approach that can facilitate the development of traditional drugs. However, metabolomic data have a high throughput, sparse, high-dimensional, and small sample nature, and their classification is challenging. Although deep learning methods have a wide range of applications, deep learning-based metabolomic studies have not been widely performed. We aimed to develop an improved stacked autoencoder (SAE) for metabolomic data classification. We established an NR-treated rheumatoid arthritis (RA) mouse model and classified the obtained metabolomic data using the Hessian-free SAE (HF-SAE) algorithm. During training, the unlabeled data were used for pretraining, and the labeled data were used for fine-tuning based on the HF algorithm for gradient descent optimization. The hybrid algorithm successfully classified the data. The results were compared with those of the support vector machine (SVM), k-nearest neighbor (KNN), and gradient descent SAE (GD-SAE) algorithms. A five-fold cross-validation was used to complete the classification experiment. In each fine-tuning process, the mean square error (MSE) and misclassification rates of the training and test data were recorded. We successfully established an NR animal model and an improved SAE for metabolomic data classification.http://dx.doi.org/10.1155/2021/1051172
collection DOAJ
language English
format Article
sources DOAJ
author Xiaojing Fan
Xiye Wang
Mingyang Jiang
Zhili Pei
Shicheng Qiao
spellingShingle Xiaojing Fan
Xiye Wang
Mingyang Jiang
Zhili Pei
Shicheng Qiao
An Improved Stacked Autoencoder for Metabolomic Data Classification
Computational Intelligence and Neuroscience
author_facet Xiaojing Fan
Xiye Wang
Mingyang Jiang
Zhili Pei
Shicheng Qiao
author_sort Xiaojing Fan
title An Improved Stacked Autoencoder for Metabolomic Data Classification
title_short An Improved Stacked Autoencoder for Metabolomic Data Classification
title_full An Improved Stacked Autoencoder for Metabolomic Data Classification
title_fullStr An Improved Stacked Autoencoder for Metabolomic Data Classification
title_full_unstemmed An Improved Stacked Autoencoder for Metabolomic Data Classification
title_sort improved stacked autoencoder for metabolomic data classification
publisher Hindawi Limited
series Computational Intelligence and Neuroscience
issn 1687-5273
publishDate 2021-01-01
description Naru3 (NR) is a traditional Mongolian medicine with high clinical efficacy and low incidence of side effects. Metabolomics is an approach that can facilitate the development of traditional drugs. However, metabolomic data have a high throughput, sparse, high-dimensional, and small sample nature, and their classification is challenging. Although deep learning methods have a wide range of applications, deep learning-based metabolomic studies have not been widely performed. We aimed to develop an improved stacked autoencoder (SAE) for metabolomic data classification. We established an NR-treated rheumatoid arthritis (RA) mouse model and classified the obtained metabolomic data using the Hessian-free SAE (HF-SAE) algorithm. During training, the unlabeled data were used for pretraining, and the labeled data were used for fine-tuning based on the HF algorithm for gradient descent optimization. The hybrid algorithm successfully classified the data. The results were compared with those of the support vector machine (SVM), k-nearest neighbor (KNN), and gradient descent SAE (GD-SAE) algorithms. A five-fold cross-validation was used to complete the classification experiment. In each fine-tuning process, the mean square error (MSE) and misclassification rates of the training and test data were recorded. We successfully established an NR animal model and an improved SAE for metabolomic data classification.
url http://dx.doi.org/10.1155/2021/1051172
work_keys_str_mv AT xiaojingfan animprovedstackedautoencoderformetabolomicdataclassification
AT xiyewang animprovedstackedautoencoderformetabolomicdataclassification
AT mingyangjiang animprovedstackedautoencoderformetabolomicdataclassification
AT zhilipei animprovedstackedautoencoderformetabolomicdataclassification
AT shichengqiao animprovedstackedautoencoderformetabolomicdataclassification
AT xiaojingfan improvedstackedautoencoderformetabolomicdataclassification
AT xiyewang improvedstackedautoencoderformetabolomicdataclassification
AT mingyangjiang improvedstackedautoencoderformetabolomicdataclassification
AT zhilipei improvedstackedautoencoderformetabolomicdataclassification
AT shichengqiao improvedstackedautoencoderformetabolomicdataclassification
_version_ 1721185948855697408