The Impact of Supervised Manifold Learning on Structure Preserving and Classification Error: A Theoretical Study

In recent years, a variety of supervised manifold learning techniques have been proposed to outperform their unsupervised alternative versions in terms of classification accuracy and data structure capturing. Some dissimilarity measures have been used in these techniques to guide the dimensionality...

Full description

Bibliographic Details
Main Authors: Laureta Hajderanj, Daqing Chen, Isakh Weheliye
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9380338/
id doaj-e0c7aad72b7548a2bc6eb19b6b33c911
record_format Article
spelling doaj-e0c7aad72b7548a2bc6eb19b6b33c9112021-03-30T15:24:21ZengIEEEIEEE Access2169-35362021-01-019439094392210.1109/ACCESS.2021.30662599380338The Impact of Supervised Manifold Learning on Structure Preserving and Classification Error: A Theoretical StudyLaureta Hajderanj0https://orcid.org/0000-0002-1865-0648Daqing Chen1https://orcid.org/0000-0003-0030-1199Isakh Weheliye2School of Engineering, London South Bank University, London, U.K.School of Engineering, London South Bank University, London, U.K.School of Engineering, London South Bank University, London, U.K.In recent years, a variety of supervised manifold learning techniques have been proposed to outperform their unsupervised alternative versions in terms of classification accuracy and data structure capturing. Some dissimilarity measures have been used in these techniques to guide the dimensionality reduction process. Their good performance was empirically demonstrated; however, the relevant analysis is still missing. This paper contributes to a theoretical analysis on a) how dissimilarity measures affect maintaining manifold neighbourhood structure and b) how supervised manifold learning techniques could contribute to the reduction of classification error. This paper also provides a cross-comparison between supervised and unsupervised manifold learning approaches in terms of structure capturing using Kendall&#x2019;s Tau coefficients and co-ranking matrices. Four different metrics (including three dissimilarity measures and Euclidean distance) have been considered along with manifold learning methods such as Isomap, <inline-formula> <tex-math notation="LaTeX">${t}$ </tex-math></inline-formula>-Stochastic Neighbour Embedding (<inline-formula> <tex-math notation="LaTeX">${t}$ </tex-math></inline-formula>-SNE), and Laplacian Eigenmaps (LE), in two datasets: Breast Cancer and Swiss-Roll. This paper concludes that although the dissimilarity measures used in the manifold learning techniques can reduce classification error, they do not learn well or preserve the structure of the hidden manifold in the high dimensional space, but instead, they destroy the structure of the data. Based on the findings of this paper, it is advisable to use supervised manifold learning techniques as a pre-processing step in classification. In addition, it is not advisable to apply supervised manifold learning for visualization purposes since the two-dimensional representation using supervised manifold learning does not improve the preservation of data structure.https://ieeexplore.ieee.org/document/9380338/Classification errorstructure capturingmanifold learningsupervised manifold learningvisualization
collection DOAJ
language English
format Article
sources DOAJ
author Laureta Hajderanj
Daqing Chen
Isakh Weheliye
spellingShingle Laureta Hajderanj
Daqing Chen
Isakh Weheliye
The Impact of Supervised Manifold Learning on Structure Preserving and Classification Error: A Theoretical Study
IEEE Access
Classification error
structure capturing
manifold learning
supervised manifold learning
visualization
author_facet Laureta Hajderanj
Daqing Chen
Isakh Weheliye
author_sort Laureta Hajderanj
title The Impact of Supervised Manifold Learning on Structure Preserving and Classification Error: A Theoretical Study
title_short The Impact of Supervised Manifold Learning on Structure Preserving and Classification Error: A Theoretical Study
title_full The Impact of Supervised Manifold Learning on Structure Preserving and Classification Error: A Theoretical Study
title_fullStr The Impact of Supervised Manifold Learning on Structure Preserving and Classification Error: A Theoretical Study
title_full_unstemmed The Impact of Supervised Manifold Learning on Structure Preserving and Classification Error: A Theoretical Study
title_sort impact of supervised manifold learning on structure preserving and classification error: a theoretical study
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description In recent years, a variety of supervised manifold learning techniques have been proposed to outperform their unsupervised alternative versions in terms of classification accuracy and data structure capturing. Some dissimilarity measures have been used in these techniques to guide the dimensionality reduction process. Their good performance was empirically demonstrated; however, the relevant analysis is still missing. This paper contributes to a theoretical analysis on a) how dissimilarity measures affect maintaining manifold neighbourhood structure and b) how supervised manifold learning techniques could contribute to the reduction of classification error. This paper also provides a cross-comparison between supervised and unsupervised manifold learning approaches in terms of structure capturing using Kendall&#x2019;s Tau coefficients and co-ranking matrices. Four different metrics (including three dissimilarity measures and Euclidean distance) have been considered along with manifold learning methods such as Isomap, <inline-formula> <tex-math notation="LaTeX">${t}$ </tex-math></inline-formula>-Stochastic Neighbour Embedding (<inline-formula> <tex-math notation="LaTeX">${t}$ </tex-math></inline-formula>-SNE), and Laplacian Eigenmaps (LE), in two datasets: Breast Cancer and Swiss-Roll. This paper concludes that although the dissimilarity measures used in the manifold learning techniques can reduce classification error, they do not learn well or preserve the structure of the hidden manifold in the high dimensional space, but instead, they destroy the structure of the data. Based on the findings of this paper, it is advisable to use supervised manifold learning techniques as a pre-processing step in classification. In addition, it is not advisable to apply supervised manifold learning for visualization purposes since the two-dimensional representation using supervised manifold learning does not improve the preservation of data structure.
topic Classification error
structure capturing
manifold learning
supervised manifold learning
visualization
url https://ieeexplore.ieee.org/document/9380338/
work_keys_str_mv AT lauretahajderanj theimpactofsupervisedmanifoldlearningonstructurepreservingandclassificationerroratheoreticalstudy
AT daqingchen theimpactofsupervisedmanifoldlearningonstructurepreservingandclassificationerroratheoreticalstudy
AT isakhweheliye theimpactofsupervisedmanifoldlearningonstructurepreservingandclassificationerroratheoreticalstudy
AT lauretahajderanj impactofsupervisedmanifoldlearningonstructurepreservingandclassificationerroratheoreticalstudy
AT daqingchen impactofsupervisedmanifoldlearningonstructurepreservingandclassificationerroratheoreticalstudy
AT isakhweheliye impactofsupervisedmanifoldlearningonstructurepreservingandclassificationerroratheoreticalstudy
_version_ 1724179588506976256