Novel feature selection method based on Stochastic Methods Coupled to Support Vector Machines using H- NMR data (data of olive and hazelnut oils)
One of the principal inconveniences that analysis and information processing presents is that of the representation of dataset. Normally, one encounters a high number of samples, each one with thousands of variables, and in many cases with irrelevant information and noise. Therefore,...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universidad Industrial de Santander
2014-12-01
|
Series: | Revista Ion |
Subjects: | |
Online Access: | http://revistas.uis.edu.co/index.php/revistaion/article/view/4594/4814 |
id |
doaj-6f438c010d924a53b75eb52c7f87e1ca |
---|---|
record_format |
Article |
spelling |
doaj-6f438c010d924a53b75eb52c7f87e1ca2020-11-25T01:08:02ZengUniversidad Industrial de SantanderRevista Ion0120-100X2145-84802014-12-012721728Novel feature selection method based on Stochastic Methods Coupled to Support Vector Machines using H- NMR data (data of olive and hazelnut oils)Oscar Eduardo Gualdron0Claudia Isaza1Cristhian Manuel DuranUniversidad de PamplonaUniversidad de PamplonaOne of the principal inconveniences that analysis and information processing presents is that of the representation of dataset. Normally, one encounters a high number of samples, each one with thousands of variables, and in many cases with irrelevant information and noise. Therefore, in order to represent findings in a clearer way, it is necessary to reduce the amount of variables. In this paper, a novel variable selection technique for multivariable data analysis, inspired on stochastic methods and designed to work with support vector machines (SVM), is described. The approach is demonstrated in a food application involving the detection of adulteration of olive oil (more expensive) with hazelnut oil (cheaper). Fingerprinting by H NMR spectroscopy was used to analyze the different samples. Results show that it is possible to reduce the number of variables without affecting classification results.http://revistas.uis.edu.co/index.php/revistaion/article/view/4594/4814eature selectionH-NMRsimulated annealingsupport vector machineolive oilhazelnut oil |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Oscar Eduardo Gualdron Claudia Isaza Cristhian Manuel Duran |
spellingShingle |
Oscar Eduardo Gualdron Claudia Isaza Cristhian Manuel Duran Novel feature selection method based on Stochastic Methods Coupled to Support Vector Machines using H- NMR data (data of olive and hazelnut oils) Revista Ion eature selection H-NMR simulated annealing support vector machine olive oil hazelnut oil |
author_facet |
Oscar Eduardo Gualdron Claudia Isaza Cristhian Manuel Duran |
author_sort |
Oscar Eduardo Gualdron |
title |
Novel feature selection method based on Stochastic Methods Coupled to Support Vector Machines using H- NMR data (data of olive and hazelnut oils) |
title_short |
Novel feature selection method based on Stochastic Methods Coupled to Support Vector Machines using H- NMR data (data of olive and hazelnut oils) |
title_full |
Novel feature selection method based on Stochastic Methods Coupled to Support Vector Machines using H- NMR data (data of olive and hazelnut oils) |
title_fullStr |
Novel feature selection method based on Stochastic Methods Coupled to Support Vector Machines using H- NMR data (data of olive and hazelnut oils) |
title_full_unstemmed |
Novel feature selection method based on Stochastic Methods Coupled to Support Vector Machines using H- NMR data (data of olive and hazelnut oils) |
title_sort |
novel feature selection method based on stochastic methods coupled to support vector machines using h- nmr data (data of olive and hazelnut oils) |
publisher |
Universidad Industrial de Santander |
series |
Revista Ion |
issn |
0120-100X 2145-8480 |
publishDate |
2014-12-01 |
description |
One of the principal inconveniences that analysis and information processing presents is that of the representation of dataset. Normally, one encounters a high number of samples, each one with thousands of variables, and in many cases with irrelevant information and noise. Therefore, in order to represent findings in a clearer way, it is necessary to reduce the amount of variables. In this paper, a novel variable selection technique for multivariable data analysis, inspired on stochastic methods and designed to work with support vector machines (SVM), is described. The approach is demonstrated in a food application involving the detection of adulteration of olive oil (more expensive) with hazelnut oil (cheaper). Fingerprinting by H NMR spectroscopy was used to analyze the different samples. Results show that it is possible to reduce the number of variables without affecting classification results. |
topic |
eature selection H-NMR simulated annealing support vector machine olive oil hazelnut oil |
url |
http://revistas.uis.edu.co/index.php/revistaion/article/view/4594/4814 |
work_keys_str_mv |
AT oscareduardogualdron novelfeatureselectionmethodbasedonstochasticmethodscoupledtosupportvectormachinesusinghnmrdatadataofoliveandhazelnutoils AT claudiaisaza novelfeatureselectionmethodbasedonstochasticmethodscoupledtosupportvectormachinesusinghnmrdatadataofoliveandhazelnutoils AT cristhianmanuelduran novelfeatureselectionmethodbasedonstochasticmethodscoupledtosupportvectormachinesusinghnmrdatadataofoliveandhazelnutoils |
_version_ |
1725184667529773056 |