A Novel Similarity Measurement and Clustering Framework for Time Series Based on Convolution Neural Networks

In recent years, with the development of machine learning, especially after the rise of deep learning, time series clustering has been proven to effectively provide useful information in cloud computing and big data. However, many modern clustering algorithms are difficult to mine the complex featur...

Full description

Bibliographic Details
Main Authors: Xin Ding, Kuangrong Hao, Xin Cai, Xue-Song Tang, Lei Chen, Haichao Zhang
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9205232/
id doaj-7913d6d4b0f94a3ab435de9949a9f92b
record_format Article
spelling doaj-7913d6d4b0f94a3ab435de9949a9f92b2021-03-30T03:57:21ZengIEEEIEEE Access2169-35362020-01-01817315817316810.1109/ACCESS.2020.30250489205232A Novel Similarity Measurement and Clustering Framework for Time Series Based on Convolution Neural NetworksXin Ding0Kuangrong Hao1https://orcid.org/0000-0001-9672-6161Xin Cai2https://orcid.org/0000-0002-5300-9733Xue-Song Tang3https://orcid.org/0000-0002-7594-2241Lei Chen4Haichao Zhang5https://orcid.org/0000-0002-4168-3640Engineering Research Center of Digitized Textile and Apparel Technology, Ministry of Education, College of Information Sciences and Technology, Donghua University, Shanghai, ChinaEngineering Research Center of Digitized Textile and Apparel Technology, Ministry of Education, College of Information Sciences and Technology, Donghua University, Shanghai, ChinaEngineering Research Center of Digitized Textile and Apparel Technology, Ministry of Education, College of Information Sciences and Technology, Donghua University, Shanghai, ChinaEngineering Research Center of Digitized Textile and Apparel Technology, Ministry of Education, College of Information Sciences and Technology, Donghua University, Shanghai, ChinaEngineering Research Center of Digitized Textile and Apparel Technology, Ministry of Education, College of Information Sciences and Technology, Donghua University, Shanghai, ChinaEngineering Research Center of Digitized Textile and Apparel Technology, Ministry of Education, College of Information Sciences and Technology, Donghua University, Shanghai, ChinaIn recent years, with the development of machine learning, especially after the rise of deep learning, time series clustering has been proven to effectively provide useful information in cloud computing and big data. However, many modern clustering algorithms are difficult to mine the complex features of time series, which is important for further analysis. Convolutional neural network provides powerful feature extraction capabilities and has excellent performance in classification tasks, but it is hard to be applied to clustering. Therefore, a similarity measurement method based on convolutional neural networks is proposed. This algorithm converts the number of output changes of the convolutional neural network in the same direction into the similarity of time series, so that the convolutional neural network can mine unlabeled data features in the clustering process. Especially by preferentially collecting a small amount of high similarity data to create labels, a classification algorithm based on the convolutional neural network can be used to assist clustering. The effectiveness of the proposed algorithm is proved by extensive experiments on the UCR time series datasets, and the experimental results show that its superior performance than other leading methods. Compared with other clustering algorithms based on deep networks, the proposed algorithm can output intermediate variables, and visually explain the principle of the algorithm. The application of financial stock linkage analysis provides an auxiliary mechanism for investment decision-making.https://ieeexplore.ieee.org/document/9205232/Time seriesclusteringconvolutional neural networkssimilarity measurement
collection DOAJ
language English
format Article
sources DOAJ
author Xin Ding
Kuangrong Hao
Xin Cai
Xue-Song Tang
Lei Chen
Haichao Zhang
spellingShingle Xin Ding
Kuangrong Hao
Xin Cai
Xue-Song Tang
Lei Chen
Haichao Zhang
A Novel Similarity Measurement and Clustering Framework for Time Series Based on Convolution Neural Networks
IEEE Access
Time series
clustering
convolutional neural networks
similarity measurement
author_facet Xin Ding
Kuangrong Hao
Xin Cai
Xue-Song Tang
Lei Chen
Haichao Zhang
author_sort Xin Ding
title A Novel Similarity Measurement and Clustering Framework for Time Series Based on Convolution Neural Networks
title_short A Novel Similarity Measurement and Clustering Framework for Time Series Based on Convolution Neural Networks
title_full A Novel Similarity Measurement and Clustering Framework for Time Series Based on Convolution Neural Networks
title_fullStr A Novel Similarity Measurement and Clustering Framework for Time Series Based on Convolution Neural Networks
title_full_unstemmed A Novel Similarity Measurement and Clustering Framework for Time Series Based on Convolution Neural Networks
title_sort novel similarity measurement and clustering framework for time series based on convolution neural networks
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description In recent years, with the development of machine learning, especially after the rise of deep learning, time series clustering has been proven to effectively provide useful information in cloud computing and big data. However, many modern clustering algorithms are difficult to mine the complex features of time series, which is important for further analysis. Convolutional neural network provides powerful feature extraction capabilities and has excellent performance in classification tasks, but it is hard to be applied to clustering. Therefore, a similarity measurement method based on convolutional neural networks is proposed. This algorithm converts the number of output changes of the convolutional neural network in the same direction into the similarity of time series, so that the convolutional neural network can mine unlabeled data features in the clustering process. Especially by preferentially collecting a small amount of high similarity data to create labels, a classification algorithm based on the convolutional neural network can be used to assist clustering. The effectiveness of the proposed algorithm is proved by extensive experiments on the UCR time series datasets, and the experimental results show that its superior performance than other leading methods. Compared with other clustering algorithms based on deep networks, the proposed algorithm can output intermediate variables, and visually explain the principle of the algorithm. The application of financial stock linkage analysis provides an auxiliary mechanism for investment decision-making.
topic Time series
clustering
convolutional neural networks
similarity measurement
url https://ieeexplore.ieee.org/document/9205232/
work_keys_str_mv AT xinding anovelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT kuangronghao anovelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT xincai anovelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT xuesongtang anovelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT leichen anovelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT haichaozhang anovelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT xinding novelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT kuangronghao novelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT xincai novelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT xuesongtang novelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT leichen novelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
AT haichaozhang novelsimilaritymeasurementandclusteringframeworkfortimeseriesbasedonconvolutionneuralnetworks
_version_ 1724182524960178176