Automated Classification of Massive Spectra Based on Enhanced Multi-Scale Coded Convolutional Neural Network

The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) has produced massive medium-resolution spectra. Data mining for special and rare stars in massive LAMOST spectra is of great significance. Feature extraction plays an important role in the process of automatic spectra classificat...

Full description

Bibliographic Details
Main Authors: Bin Jiang, Donglai Wei, Jiazhen Liu, Shuting Wang, Liyun Cheng, Zihao Wang, Meixia Qu
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:Universe
Subjects:
Online Access:https://www.mdpi.com/2218-1997/6/4/60
id doaj-80cf83eac38340cb9a5d77a93112881f
record_format Article
spelling doaj-80cf83eac38340cb9a5d77a93112881f2020-11-25T02:24:42ZengMDPI AGUniverse2218-19972020-04-016606010.3390/universe6040060Automated Classification of Massive Spectra Based on Enhanced Multi-Scale Coded Convolutional Neural NetworkBin Jiang0Donglai Wei1Jiazhen Liu2Shuting Wang3Liyun Cheng4Zihao Wang5Meixia Qu6School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaSchool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, ChinaThe Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) has produced massive medium-resolution spectra. Data mining for special and rare stars in massive LAMOST spectra is of great significance. Feature extraction plays an important role in the process of automatic spectra classification. The proper classification network can extract most of the common spectral features with minimum noise and individual features. Such a network has better generalization capabilities and can extract sufficient features for classification. A variety of classification networks of one dimension and two dimensions are both designed and implemented systematically in this paper to verify whether spectra is easier to deal with in a 2D situation. The experimental results show that the fully connected neural network cannot extract enough features. Although convolutional neural network (CNN) with a strong feature extraction capability can quickly achieve satisfactory results on the training set, there is a tendency for overfitting. Signal-to-noise ratios also have effects on the network. To investigate the problems above, various techniques are tested and the enhanced multi-scale coded convolutional neural network (EMCCNN) is proposed and implemented, which can perform spectral denoising and feature extraction at different scales in a more efficient manner. In a specified search, eight known and one possible cataclysmic variables (CVs) in LAMOST MRS are identified by EMCCNN including four CVs, one dwarf nova and three novae. The result supplements the spectra of CVs. Furthermore, these spectra are the first medium-resolution spectra of CVs. The EMCCNN model can be easily extended to search for other rare stellar spectra.https://www.mdpi.com/2218-1997/6/4/60data miningspectral classificationconvolutional neural networkcataclysmic variables
collection DOAJ
language English
format Article
sources DOAJ
author Bin Jiang
Donglai Wei
Jiazhen Liu
Shuting Wang
Liyun Cheng
Zihao Wang
Meixia Qu
spellingShingle Bin Jiang
Donglai Wei
Jiazhen Liu
Shuting Wang
Liyun Cheng
Zihao Wang
Meixia Qu
Automated Classification of Massive Spectra Based on Enhanced Multi-Scale Coded Convolutional Neural Network
Universe
data mining
spectral classification
convolutional neural network
cataclysmic variables
author_facet Bin Jiang
Donglai Wei
Jiazhen Liu
Shuting Wang
Liyun Cheng
Zihao Wang
Meixia Qu
author_sort Bin Jiang
title Automated Classification of Massive Spectra Based on Enhanced Multi-Scale Coded Convolutional Neural Network
title_short Automated Classification of Massive Spectra Based on Enhanced Multi-Scale Coded Convolutional Neural Network
title_full Automated Classification of Massive Spectra Based on Enhanced Multi-Scale Coded Convolutional Neural Network
title_fullStr Automated Classification of Massive Spectra Based on Enhanced Multi-Scale Coded Convolutional Neural Network
title_full_unstemmed Automated Classification of Massive Spectra Based on Enhanced Multi-Scale Coded Convolutional Neural Network
title_sort automated classification of massive spectra based on enhanced multi-scale coded convolutional neural network
publisher MDPI AG
series Universe
issn 2218-1997
publishDate 2020-04-01
description The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) has produced massive medium-resolution spectra. Data mining for special and rare stars in massive LAMOST spectra is of great significance. Feature extraction plays an important role in the process of automatic spectra classification. The proper classification network can extract most of the common spectral features with minimum noise and individual features. Such a network has better generalization capabilities and can extract sufficient features for classification. A variety of classification networks of one dimension and two dimensions are both designed and implemented systematically in this paper to verify whether spectra is easier to deal with in a 2D situation. The experimental results show that the fully connected neural network cannot extract enough features. Although convolutional neural network (CNN) with a strong feature extraction capability can quickly achieve satisfactory results on the training set, there is a tendency for overfitting. Signal-to-noise ratios also have effects on the network. To investigate the problems above, various techniques are tested and the enhanced multi-scale coded convolutional neural network (EMCCNN) is proposed and implemented, which can perform spectral denoising and feature extraction at different scales in a more efficient manner. In a specified search, eight known and one possible cataclysmic variables (CVs) in LAMOST MRS are identified by EMCCNN including four CVs, one dwarf nova and three novae. The result supplements the spectra of CVs. Furthermore, these spectra are the first medium-resolution spectra of CVs. The EMCCNN model can be easily extended to search for other rare stellar spectra.
topic data mining
spectral classification
convolutional neural network
cataclysmic variables
url https://www.mdpi.com/2218-1997/6/4/60
work_keys_str_mv AT binjiang automatedclassificationofmassivespectrabasedonenhancedmultiscalecodedconvolutionalneuralnetwork
AT donglaiwei automatedclassificationofmassivespectrabasedonenhancedmultiscalecodedconvolutionalneuralnetwork
AT jiazhenliu automatedclassificationofmassivespectrabasedonenhancedmultiscalecodedconvolutionalneuralnetwork
AT shutingwang automatedclassificationofmassivespectrabasedonenhancedmultiscalecodedconvolutionalneuralnetwork
AT liyuncheng automatedclassificationofmassivespectrabasedonenhancedmultiscalecodedconvolutionalneuralnetwork
AT zihaowang automatedclassificationofmassivespectrabasedonenhancedmultiscalecodedconvolutionalneuralnetwork
AT meixiaqu automatedclassificationofmassivespectrabasedonenhancedmultiscalecodedconvolutionalneuralnetwork
_version_ 1724853950920785920