Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction t...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-09-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | http://www.mdpi.com/2072-4292/10/10/1564 |
id |
doaj-0e4e96b1891549f39bb9c2d776802609 |
---|---|
record_format |
Article |
spelling |
doaj-0e4e96b1891549f39bb9c2d7768026092020-11-25T02:29:16ZengMDPI AGRemote Sensing2072-42922018-09-011010156410.3390/rs10101564rs10101564Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral DataPatrick Erik Bradley0Sina Keller1Martin Weinmann2Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Englerstr. 7, 76131 Karlsruhe, GermanyInstitute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Englerstr. 7, 76131 Karlsruhe, GermanyInstitute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Englerstr. 7, 76131 Karlsruhe, GermanyIn this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features.http://www.mdpi.com/2072-4292/10/10/1564unsupervised feature selectionultrametricitysparse training dataclassificationland coverland usehyperspectral imageryROSIS dataAVIRIS dataEnMAP data |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Patrick Erik Bradley Sina Keller Martin Weinmann |
spellingShingle |
Patrick Erik Bradley Sina Keller Martin Weinmann Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data Remote Sensing unsupervised feature selection ultrametricity sparse training data classification land cover land use hyperspectral imagery ROSIS data AVIRIS data EnMAP data |
author_facet |
Patrick Erik Bradley Sina Keller Martin Weinmann |
author_sort |
Patrick Erik Bradley |
title |
Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data |
title_short |
Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data |
title_full |
Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data |
title_fullStr |
Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data |
title_full_unstemmed |
Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data |
title_sort |
unsupervised feature selection based on ultrametricity and sparse training data: a case study for the classification of high-dimensional hyperspectral data |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2018-09-01 |
description |
In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features. |
topic |
unsupervised feature selection ultrametricity sparse training data classification land cover land use hyperspectral imagery ROSIS data AVIRIS data EnMAP data |
url |
http://www.mdpi.com/2072-4292/10/10/1564 |
work_keys_str_mv |
AT patrickerikbradley unsupervisedfeatureselectionbasedonultrametricityandsparsetrainingdataacasestudyfortheclassificationofhighdimensionalhyperspectraldata AT sinakeller unsupervisedfeatureselectionbasedonultrametricityandsparsetrainingdataacasestudyfortheclassificationofhighdimensionalhyperspectraldata AT martinweinmann unsupervisedfeatureselectionbasedonultrametricityandsparsetrainingdataacasestudyfortheclassificationofhighdimensionalhyperspectraldata |
_version_ |
1724834235412381696 |