Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data

In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction t...

Full description

Bibliographic Details
Main Authors:	Patrick Erik Bradley, Sina Keller, Martin Weinmann
Format:	Article
Language:	English
Published:	MDPI AG 2018-09-01
Series:	Remote Sensing
Subjects:	unsupervised feature selection ultrametricity sparse training data classification land cover land use hyperspectral imagery ROSIS data AVIRIS data EnMAP data
Online Access:	http://www.mdpi.com/2072-4292/10/10/1564

id	doaj-0e4e96b1891549f39bb9c2d776802609
record_format	Article
spelling	doaj-0e4e96b1891549f39bb9c2d7768026092020-11-25T02:29:16ZengMDPI AGRemote Sensing2072-42922018-09-011010156410.3390/rs10101564rs10101564Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral DataPatrick Erik Bradley0Sina Keller1Martin Weinmann2Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Englerstr. 7, 76131 Karlsruhe, GermanyInstitute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Englerstr. 7, 76131 Karlsruhe, GermanyInstitute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Englerstr. 7, 76131 Karlsruhe, GermanyIn this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features.http://www.mdpi.com/2072-4292/10/10/1564unsupervised feature selectionultrametricitysparse training dataclassificationland coverland usehyperspectral imageryROSIS dataAVIRIS dataEnMAP data
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Patrick Erik Bradley Sina Keller Martin Weinmann
spellingShingle	Patrick Erik Bradley Sina Keller Martin Weinmann Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data Remote Sensing unsupervised feature selection ultrametricity sparse training data classification land cover land use hyperspectral imagery ROSIS data AVIRIS data EnMAP data
author_facet	Patrick Erik Bradley Sina Keller Martin Weinmann
author_sort	Patrick Erik Bradley
title	Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_short	Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_full	Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_fullStr	Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_full_unstemmed	Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data
title_sort	unsupervised feature selection based on ultrametricity and sparse training data: a case study for the classification of high-dimensional hyperspectral data
publisher	MDPI AG
series	Remote Sensing
issn	2072-4292
publishDate	2018-09-01
description	In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features.
topic	unsupervised feature selection ultrametricity sparse training data classification land cover land use hyperspectral imagery ROSIS data AVIRIS data EnMAP data
url	http://www.mdpi.com/2072-4292/10/10/1564
work_keys_str_mv	AT patrickerikbradley unsupervisedfeatureselectionbasedonultrametricityandsparsetrainingdataacasestudyfortheclassificationofhighdimensionalhyperspectraldata AT sinakeller unsupervisedfeatureselectionbasedonultrametricityandsparsetrainingdataacasestudyfortheclassificationofhighdimensionalhyperspectraldata AT martinweinmann unsupervisedfeatureselectionbasedonultrametricityandsparsetrainingdataacasestudyfortheclassificationofhighdimensionalhyperspectraldata
_version_	1724834235412381696

Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data

Similar Items