Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data

In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction t...

Full description

Bibliographic Details
Main Authors: Patrick Erik Bradley, Sina Keller, Martin Weinmann
Format: Article
Language:English
Published: MDPI AG 2018-09-01
Series:Remote Sensing
Subjects:
Online Access:http://www.mdpi.com/2072-4292/10/10/1564
id doaj-0e4e96b1891549f39bb9c2d776802609
record_format Article
spelling doaj-0e4e96b1891549f39bb9c2d7768026092020-11-25T02:29:16ZengMDPI AGRemote Sensing2072-42922018-09-011010156410.3390/rs10101564rs10101564Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral DataPatrick Erik Bradley0Sina Keller1Martin Weinmann2Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Englerstr. 7, 76131 Karlsruhe, GermanyInstitute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Englerstr. 7, 76131 Karlsruhe, GermanyInstitute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Englerstr. 7, 76131 Karlsruhe, GermanyIn this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features.http://www.mdpi.com/2072-4292/10/10/1564unsupervised feature selectionultrametricitysparse training dataclassificationland coverland usehyperspectral imageryROSIS dataAVIRIS dataEnMAP data
collection DOAJ
language English
format Article
sources DOAJ
author Patrick Erik Bradley
Sina Keller
Martin Weinmann
spellingShingle Patrick Erik Bradley
Sina Keller
Martin Weinmann
Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
Remote Sensing
unsupervised feature selection
ultrametricity
sparse training data
classification
land cover
land use
hyperspectral imagery
ROSIS data
AVIRIS data
EnMAP data
author_facet Patrick Erik Bradley
Sina Keller
Martin Weinmann
author_sort Patrick Erik Bradley
title Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_short Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_full Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_fullStr Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_full_unstemmed Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_sort unsupervised feature selection based on ultrametricity and sparse training data: a case study for the classification of high-dimensional hyperspectral data
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2018-09-01
description In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features.
topic unsupervised feature selection
ultrametricity
sparse training data
classification
land cover
land use
hyperspectral imagery
ROSIS data
AVIRIS data
EnMAP data
url http://www.mdpi.com/2072-4292/10/10/1564
work_keys_str_mv AT patrickerikbradley unsupervisedfeatureselectionbasedonultrametricityandsparsetrainingdataacasestudyfortheclassificationofhighdimensionalhyperspectraldata
AT sinakeller unsupervisedfeatureselectionbasedonultrametricityandsparsetrainingdataacasestudyfortheclassificationofhighdimensionalhyperspectraldata
AT martinweinmann unsupervisedfeatureselectionbasedonultrametricityandsparsetrainingdataacasestudyfortheclassificationofhighdimensionalhyperspectraldata
_version_ 1724834235412381696