The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data

Clustering analysis of massive data in wireless multimedia sensor networks (WMSN) has become a hot topic. However, most data clustering algorithms have difficulty in obtaining latent nonlinear correlations of data features, resulting in a low clustering accuracy. In addition, it is difficult to extr...

Full description

Bibliographic Details
Main Authors: Xiulan Yu, Hongyu Li, Zufan Zhang, Chenquan Gan
Format: Article
Language:English
Published: MDPI AG 2019-02-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/19/4/809
id doaj-15d15139d63649a5a7d0ac8fecc64292
record_format Article
spelling doaj-15d15139d63649a5a7d0ac8fecc642922020-11-25T00:31:05ZengMDPI AGSensors1424-82202019-02-0119480910.3390/s19040809s19040809The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia DataXiulan Yu0Hongyu Li1Zufan Zhang2Chenquan Gan3School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaSchool of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaSchool of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaSchool of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaClustering analysis of massive data in wireless multimedia sensor networks (WMSN) has become a hot topic. However, most data clustering algorithms have difficulty in obtaining latent nonlinear correlations of data features, resulting in a low clustering accuracy. In addition, it is difficult to extract features from missing or corrupted data, so incomplete data are widely used in practical work. In this paper, the optimally designed variational autoencoder networks is proposed for extracting features of incomplete data and using high-order fuzzy c-means algorithm (HOFCM) to improve cluster performance of incomplete data. Specifically, the feature extraction model is improved by using variational autoencoder to learn the feature of incomplete data. To capture nonlinear correlations in different heterogeneous data patterns, tensor based fuzzy c-means algorithm is used to cluster low-dimensional features. The tensor distance is used as the distance measure to capture the unknown correlations of data as much as possible. Finally, in the case that the clustering results are obtained, the missing data can be restored by using the low-dimensional features. Experiments on real datasets show that the proposed algorithm not only can improve the clustering performance of incomplete data effectively, but also can fill in missing features and get better data reconstruction results.https://www.mdpi.com/1424-8220/19/4/809feature learningincomplete multimedia datafuzzy c-meansvariational autoencoder
collection DOAJ
language English
format Article
sources DOAJ
author Xiulan Yu
Hongyu Li
Zufan Zhang
Chenquan Gan
spellingShingle Xiulan Yu
Hongyu Li
Zufan Zhang
Chenquan Gan
The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
Sensors
feature learning
incomplete multimedia data
fuzzy c-means
variational autoencoder
author_facet Xiulan Yu
Hongyu Li
Zufan Zhang
Chenquan Gan
author_sort Xiulan Yu
title The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_short The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_full The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_fullStr The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_full_unstemmed The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_sort optimally designed variational autoencoder networks for clustering and recovery of incomplete multimedia data
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2019-02-01
description Clustering analysis of massive data in wireless multimedia sensor networks (WMSN) has become a hot topic. However, most data clustering algorithms have difficulty in obtaining latent nonlinear correlations of data features, resulting in a low clustering accuracy. In addition, it is difficult to extract features from missing or corrupted data, so incomplete data are widely used in practical work. In this paper, the optimally designed variational autoencoder networks is proposed for extracting features of incomplete data and using high-order fuzzy c-means algorithm (HOFCM) to improve cluster performance of incomplete data. Specifically, the feature extraction model is improved by using variational autoencoder to learn the feature of incomplete data. To capture nonlinear correlations in different heterogeneous data patterns, tensor based fuzzy c-means algorithm is used to cluster low-dimensional features. The tensor distance is used as the distance measure to capture the unknown correlations of data as much as possible. Finally, in the case that the clustering results are obtained, the missing data can be restored by using the low-dimensional features. Experiments on real datasets show that the proposed algorithm not only can improve the clustering performance of incomplete data effectively, but also can fill in missing features and get better data reconstruction results.
topic feature learning
incomplete multimedia data
fuzzy c-means
variational autoencoder
url https://www.mdpi.com/1424-8220/19/4/809
work_keys_str_mv AT xiulanyu theoptimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT hongyuli theoptimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT zufanzhang theoptimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT chenquangan theoptimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT xiulanyu optimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT hongyuli optimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT zufanzhang optimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT chenquangan optimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
_version_ 1725323924553596928