An Improved Stacked Autoencoder for Metabolomic Data Classification
Naru3 (NR) is a traditional Mongolian medicine with high clinical efficacy and low incidence of side effects. Metabolomics is an approach that can facilitate the development of traditional drugs. However, metabolomic data have a high throughput, sparse, high-dimensional, and small sample nature, and...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2021-01-01
|
Series: | Computational Intelligence and Neuroscience |
Online Access: | http://dx.doi.org/10.1155/2021/1051172 |
id |
doaj-23d2d77d71a14b759295fad539c83004 |
---|---|
record_format |
Article |
spelling |
doaj-23d2d77d71a14b759295fad539c830042021-08-30T00:01:25ZengHindawi LimitedComputational Intelligence and Neuroscience1687-52732021-01-01202110.1155/2021/1051172An Improved Stacked Autoencoder for Metabolomic Data ClassificationXiaojing Fan0Xiye Wang1Mingyang Jiang2Zhili Pei3Shicheng Qiao4College of EngineeringCollege of Chemistry and Chemical EngineeringCollege of Computer Science and TechnologyCollege of Computer Science and TechnologyCollege of Computer Science and TechnologyNaru3 (NR) is a traditional Mongolian medicine with high clinical efficacy and low incidence of side effects. Metabolomics is an approach that can facilitate the development of traditional drugs. However, metabolomic data have a high throughput, sparse, high-dimensional, and small sample nature, and their classification is challenging. Although deep learning methods have a wide range of applications, deep learning-based metabolomic studies have not been widely performed. We aimed to develop an improved stacked autoencoder (SAE) for metabolomic data classification. We established an NR-treated rheumatoid arthritis (RA) mouse model and classified the obtained metabolomic data using the Hessian-free SAE (HF-SAE) algorithm. During training, the unlabeled data were used for pretraining, and the labeled data were used for fine-tuning based on the HF algorithm for gradient descent optimization. The hybrid algorithm successfully classified the data. The results were compared with those of the support vector machine (SVM), k-nearest neighbor (KNN), and gradient descent SAE (GD-SAE) algorithms. A five-fold cross-validation was used to complete the classification experiment. In each fine-tuning process, the mean square error (MSE) and misclassification rates of the training and test data were recorded. We successfully established an NR animal model and an improved SAE for metabolomic data classification.http://dx.doi.org/10.1155/2021/1051172 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiaojing Fan Xiye Wang Mingyang Jiang Zhili Pei Shicheng Qiao |
spellingShingle |
Xiaojing Fan Xiye Wang Mingyang Jiang Zhili Pei Shicheng Qiao An Improved Stacked Autoencoder for Metabolomic Data Classification Computational Intelligence and Neuroscience |
author_facet |
Xiaojing Fan Xiye Wang Mingyang Jiang Zhili Pei Shicheng Qiao |
author_sort |
Xiaojing Fan |
title |
An Improved Stacked Autoencoder for Metabolomic Data Classification |
title_short |
An Improved Stacked Autoencoder for Metabolomic Data Classification |
title_full |
An Improved Stacked Autoencoder for Metabolomic Data Classification |
title_fullStr |
An Improved Stacked Autoencoder for Metabolomic Data Classification |
title_full_unstemmed |
An Improved Stacked Autoencoder for Metabolomic Data Classification |
title_sort |
improved stacked autoencoder for metabolomic data classification |
publisher |
Hindawi Limited |
series |
Computational Intelligence and Neuroscience |
issn |
1687-5273 |
publishDate |
2021-01-01 |
description |
Naru3 (NR) is a traditional Mongolian medicine with high clinical efficacy and low incidence of side effects. Metabolomics is an approach that can facilitate the development of traditional drugs. However, metabolomic data have a high throughput, sparse, high-dimensional, and small sample nature, and their classification is challenging. Although deep learning methods have a wide range of applications, deep learning-based metabolomic studies have not been widely performed. We aimed to develop an improved stacked autoencoder (SAE) for metabolomic data classification. We established an NR-treated rheumatoid arthritis (RA) mouse model and classified the obtained metabolomic data using the Hessian-free SAE (HF-SAE) algorithm. During training, the unlabeled data were used for pretraining, and the labeled data were used for fine-tuning based on the HF algorithm for gradient descent optimization. The hybrid algorithm successfully classified the data. The results were compared with those of the support vector machine (SVM), k-nearest neighbor (KNN), and gradient descent SAE (GD-SAE) algorithms. A five-fold cross-validation was used to complete the classification experiment. In each fine-tuning process, the mean square error (MSE) and misclassification rates of the training and test data were recorded. We successfully established an NR animal model and an improved SAE for metabolomic data classification. |
url |
http://dx.doi.org/10.1155/2021/1051172 |
work_keys_str_mv |
AT xiaojingfan animprovedstackedautoencoderformetabolomicdataclassification AT xiyewang animprovedstackedautoencoderformetabolomicdataclassification AT mingyangjiang animprovedstackedautoencoderformetabolomicdataclassification AT zhilipei animprovedstackedautoencoderformetabolomicdataclassification AT shichengqiao animprovedstackedautoencoderformetabolomicdataclassification AT xiaojingfan improvedstackedautoencoderformetabolomicdataclassification AT xiyewang improvedstackedautoencoderformetabolomicdataclassification AT mingyangjiang improvedstackedautoencoderformetabolomicdataclassification AT zhilipei improvedstackedautoencoderformetabolomicdataclassification AT shichengqiao improvedstackedautoencoderformetabolomicdataclassification |
_version_ |
1721185948855697408 |