Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology

It is essential to utilize deep-learning algorithms based on big data for the implementation of the new generation of artificial intelligence. Effective utilization of deep learning relies considerably on the number of labeled samples, which restricts the application of deep learning in an environme...

Full description

Bibliographic Details
Main Authors: Yufei Liu, Yuan Zhou, Xin Liu, Fang Dong, Chang Wang, Zihong Wang
Format: Article
Language:English
Published: Elsevier 2019-02-01
Series:Engineering
Online Access:http://www.sciencedirect.com/science/article/pii/S2095809918301127
id doaj-c72c1b297b1447b2a0fa83ceca1ff915
record_format Article
spelling doaj-c72c1b297b1447b2a0fa83ceca1ff9152020-11-24T21:52:05ZengElsevierEngineering2095-80992019-02-0151156163Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in BiologyYufei Liu0Yuan Zhou1Xin Liu2Fang Dong3Chang Wang4Zihong Wang5College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China; School of Public Policy and Management, Tsinghua University, Beijing 100084, China; Center for Strategic Studies, Chinese Academy of Engineering, Beijing 100088, ChinaSchool of Public Policy and Management, Tsinghua University, Beijing 100084, China; Corresponding author.College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, ChinaSchool of Public Policy and Management, Tsinghua University, Beijing 100084, ChinaCollege of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, ChinaSchool of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, ChinaIt is essential to utilize deep-learning algorithms based on big data for the implementation of the new generation of artificial intelligence. Effective utilization of deep learning relies considerably on the number of labeled samples, which restricts the application of deep learning in an environment with a small sample size. In this paper, we propose an approach based on a generative adversarial network (GAN) combined with a deep neural network (DNN). First, the original samples were divided into a training set and a test set. The GAN was trained with the training set to generate synthetic sample data, which enlarged the training set. Next, the DNN classifier was trained with the synthetic samples. Finally, the classifier was tested with the test set, and the effectiveness of the approach for multi-classification with a small sample size was validated by the indicators. As an empirical case, the approach was then applied to identify the stages of cancers with a small labeled sample size. The experimental results verified that the proposed approach achieved a greater accuracy than traditional methods. This research was an attempt to transform the classical statistical machine-learning classification method based on original samples into a deep-learning classification method based on data augmentation. The use of this approach will contribute to an expansion of application scenarios for the new generation of artificial intelligence based on deep learning, and to an increase in application effectiveness. This research is also expected to contribute to the comprehensive promotion of new-generation artificial intelligence. Keywords: Artificial intelligence, Generative adversarial network, Deep neural network, Small sample size, Cancerhttp://www.sciencedirect.com/science/article/pii/S2095809918301127
collection DOAJ
language English
format Article
sources DOAJ
author Yufei Liu
Yuan Zhou
Xin Liu
Fang Dong
Chang Wang
Zihong Wang
spellingShingle Yufei Liu
Yuan Zhou
Xin Liu
Fang Dong
Chang Wang
Zihong Wang
Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology
Engineering
author_facet Yufei Liu
Yuan Zhou
Xin Liu
Fang Dong
Chang Wang
Zihong Wang
author_sort Yufei Liu
title Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology
title_short Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology
title_full Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology
title_fullStr Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology
title_full_unstemmed Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology
title_sort wasserstein gan-based small-sample augmentation for new-generation artificial intelligence: a case study of cancer-staging data in biology
publisher Elsevier
series Engineering
issn 2095-8099
publishDate 2019-02-01
description It is essential to utilize deep-learning algorithms based on big data for the implementation of the new generation of artificial intelligence. Effective utilization of deep learning relies considerably on the number of labeled samples, which restricts the application of deep learning in an environment with a small sample size. In this paper, we propose an approach based on a generative adversarial network (GAN) combined with a deep neural network (DNN). First, the original samples were divided into a training set and a test set. The GAN was trained with the training set to generate synthetic sample data, which enlarged the training set. Next, the DNN classifier was trained with the synthetic samples. Finally, the classifier was tested with the test set, and the effectiveness of the approach for multi-classification with a small sample size was validated by the indicators. As an empirical case, the approach was then applied to identify the stages of cancers with a small labeled sample size. The experimental results verified that the proposed approach achieved a greater accuracy than traditional methods. This research was an attempt to transform the classical statistical machine-learning classification method based on original samples into a deep-learning classification method based on data augmentation. The use of this approach will contribute to an expansion of application scenarios for the new generation of artificial intelligence based on deep learning, and to an increase in application effectiveness. This research is also expected to contribute to the comprehensive promotion of new-generation artificial intelligence. Keywords: Artificial intelligence, Generative adversarial network, Deep neural network, Small sample size, Cancer
url http://www.sciencedirect.com/science/article/pii/S2095809918301127
work_keys_str_mv AT yufeiliu wassersteinganbasedsmallsampleaugmentationfornewgenerationartificialintelligenceacasestudyofcancerstagingdatainbiology
AT yuanzhou wassersteinganbasedsmallsampleaugmentationfornewgenerationartificialintelligenceacasestudyofcancerstagingdatainbiology
AT xinliu wassersteinganbasedsmallsampleaugmentationfornewgenerationartificialintelligenceacasestudyofcancerstagingdatainbiology
AT fangdong wassersteinganbasedsmallsampleaugmentationfornewgenerationartificialintelligenceacasestudyofcancerstagingdatainbiology
AT changwang wassersteinganbasedsmallsampleaugmentationfornewgenerationartificialintelligenceacasestudyofcancerstagingdatainbiology
AT zihongwang wassersteinganbasedsmallsampleaugmentationfornewgenerationartificialintelligenceacasestudyofcancerstagingdatainbiology
_version_ 1725876912267460608