Adaptive Graph Regularized Low–Rank Matrix Factorization With Noise and Outliers for Clustering

Clustering, which is a commonly used tool, has been applied in machine learning, data mining and so on, and has received extensive research. However, there are usually noise and outliers in the data, which will bring about significant errors in the clustering results. In this paper, a robust cluster...

Full description

Bibliographic Details
Main Authors: Min Zhao, Jinglei Liu
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9201009/
id doaj-ac8e3d40b83e4632bf263f0510df84cc
record_format Article
spelling doaj-ac8e3d40b83e4632bf263f0510df84cc2021-03-30T03:58:49ZengIEEEIEEE Access2169-35362020-01-01817185117186310.1109/ACCESS.2020.30250969201009Adaptive Graph Regularized Low&#x2013;Rank Matrix Factorization With Noise and Outliers for ClusteringMin Zhao0https://orcid.org/0000-0001-7505-0721Jinglei Liu1School of Computer Science and Control Engineering, Yantai University, Yantai, ChinaSchool of Computer Science and Control Engineering, Yantai University, Yantai, ChinaClustering, which is a commonly used tool, has been applied in machine learning, data mining and so on, and has received extensive research. However, there are usually noise and outliers in the data, which will bring about significant errors in the clustering results. In this paper, a robust clustering model with adaptive graph regularization (RCAG) is proposed, on which, sparse error matrix is introduced to express sparse noise, such as impulse noise, dead line, stripes, and &#x2113;<sub>1</sub> norm is introduced to alleviate the sparse noise. In addition, the &#x2113;<sub>2,1</sub> norm is also proposed mitigating the effects of outliers, and it has rotation invariance property. Therefore, our RCAG is insensitive to data noise and outliers. More importantly, the adaptive graph regularization is introduced into the RCAG to improve the clustering performance. Aiming at the optimization objective, we propose an iterative updating algorithm, named the Augmented Lagrangian Method (ALM), to update each optimization variable respectively. The convergence and time complexity of RCAG is also proved in theory. Finally, experimental results on fourteen datasets of four application scenarios, such as face image, handwriting recognition and UCI, elaborate the superiority of proposed method over seven existing classical clustering methods. The experimental results demonstrate that our approach achieves better clustering performance in ACC and Purity, which is a little less impressive in other ways.https://ieeexplore.ieee.org/document/9201009/Adaptive graph regularizationclusteringℓ₂,₁noise and outliersaugmented Lagrangian method
collection DOAJ
language English
format Article
sources DOAJ
author Min Zhao
Jinglei Liu
spellingShingle Min Zhao
Jinglei Liu
Adaptive Graph Regularized Low&#x2013;Rank Matrix Factorization With Noise and Outliers for Clustering
IEEE Access
Adaptive graph regularization
clustering
ℓ₂,₁
noise and outliers
augmented Lagrangian method
author_facet Min Zhao
Jinglei Liu
author_sort Min Zhao
title Adaptive Graph Regularized Low&#x2013;Rank Matrix Factorization With Noise and Outliers for Clustering
title_short Adaptive Graph Regularized Low&#x2013;Rank Matrix Factorization With Noise and Outliers for Clustering
title_full Adaptive Graph Regularized Low&#x2013;Rank Matrix Factorization With Noise and Outliers for Clustering
title_fullStr Adaptive Graph Regularized Low&#x2013;Rank Matrix Factorization With Noise and Outliers for Clustering
title_full_unstemmed Adaptive Graph Regularized Low&#x2013;Rank Matrix Factorization With Noise and Outliers for Clustering
title_sort adaptive graph regularized low&#x2013;rank matrix factorization with noise and outliers for clustering
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Clustering, which is a commonly used tool, has been applied in machine learning, data mining and so on, and has received extensive research. However, there are usually noise and outliers in the data, which will bring about significant errors in the clustering results. In this paper, a robust clustering model with adaptive graph regularization (RCAG) is proposed, on which, sparse error matrix is introduced to express sparse noise, such as impulse noise, dead line, stripes, and &#x2113;<sub>1</sub> norm is introduced to alleviate the sparse noise. In addition, the &#x2113;<sub>2,1</sub> norm is also proposed mitigating the effects of outliers, and it has rotation invariance property. Therefore, our RCAG is insensitive to data noise and outliers. More importantly, the adaptive graph regularization is introduced into the RCAG to improve the clustering performance. Aiming at the optimization objective, we propose an iterative updating algorithm, named the Augmented Lagrangian Method (ALM), to update each optimization variable respectively. The convergence and time complexity of RCAG is also proved in theory. Finally, experimental results on fourteen datasets of four application scenarios, such as face image, handwriting recognition and UCI, elaborate the superiority of proposed method over seven existing classical clustering methods. The experimental results demonstrate that our approach achieves better clustering performance in ACC and Purity, which is a little less impressive in other ways.
topic Adaptive graph regularization
clustering
ℓ₂,₁
noise and outliers
augmented Lagrangian method
url https://ieeexplore.ieee.org/document/9201009/
work_keys_str_mv AT minzhao adaptivegraphregularizedlowx2013rankmatrixfactorizationwithnoiseandoutliersforclustering
AT jingleiliu adaptivegraphregularizedlowx2013rankmatrixfactorizationwithnoiseandoutliersforclustering
_version_ 1724182549384658944