Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements

A problem of non-parametric multivariate density estimation for machine learning and data augmentation is considered. A new mixed density estimation method based on calculating the convolution of independently obtained kernel density estimates for unknown distributions of informative features and a...

Full description

Bibliographic Details
Main Authors: Alexander Sirota, Artem Donskikh, Alexey Akimov, Dmitry Minakov
Format: Article
Language:English
Published: Samara National Research University 2019-08-01
Series:Компьютерная оптика
Subjects:
Online Access:http://computeroptics.smr.ru/KO/PDF/KO43-4/430421.pdf
id doaj-5f0a06694e3342b29b64dcefa270186f
record_format Article
spelling doaj-5f0a06694e3342b29b64dcefa270186f2020-11-25T01:38:42ZengSamara National Research UniversityКомпьютерная оптика0134-24522412-61792019-08-0143467769110.18287/2412-6179-2019-43-4-677-691Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurementsAlexander Sirota0Artem Donskikh1Alexey Akimov2Dmitry Minakov3Voronezh State University, Voronezh, RussiaVoronezh State University, Voronezh, RussiaVoronezh State University, Voronezh, RussiaVoronezh State University, Voronezh, RussiaA problem of non-parametric multivariate density estimation for machine learning and data augmentation is considered. A new mixed density estimation method based on calculating the convolution of independently obtained kernel density estimates for unknown distributions of informative features and a known (or independently estimated) density for non-informative interference occurring during measurements is proposed. Properties of the mixed density estimates obtained using this method are analyzed. The method is compared with a conventional Parzen-Rosenblatt window method applied directly to the training data. The equivalence of the mixed kernel density estimator and the data augmentation procedure based on the known (or estimated) statistical model of interference is theoretically and experimentally proven. The applicability of the mixed density estimators for training of machine learning algorithms for the classification of biological objects (elements of grain mixtures) based on spectral measurements in the visible and near-infrared regions is evaluated.http://computeroptics.smr.ru/KO/PDF/KO43-4/430421.pdfmachine learningpattern classificationdata augmentationkernel density estimationspectral measurements
collection DOAJ
language English
format Article
sources DOAJ
author Alexander Sirota
Artem Donskikh
Alexey Akimov
Dmitry Minakov
spellingShingle Alexander Sirota
Artem Donskikh
Alexey Akimov
Dmitry Minakov
Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements
Компьютерная оптика
machine learning
pattern classification
data augmentation
kernel density estimation
spectral measurements
author_facet Alexander Sirota
Artem Donskikh
Alexey Akimov
Dmitry Minakov
author_sort Alexander Sirota
title Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements
title_short Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements
title_full Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements
title_fullStr Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements
title_full_unstemmed Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements
title_sort multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements
publisher Samara National Research University
series Компьютерная оптика
issn 0134-2452
2412-6179
publishDate 2019-08-01
description A problem of non-parametric multivariate density estimation for machine learning and data augmentation is considered. A new mixed density estimation method based on calculating the convolution of independently obtained kernel density estimates for unknown distributions of informative features and a known (or independently estimated) density for non-informative interference occurring during measurements is proposed. Properties of the mixed density estimates obtained using this method are analyzed. The method is compared with a conventional Parzen-Rosenblatt window method applied directly to the training data. The equivalence of the mixed kernel density estimator and the data augmentation procedure based on the known (or estimated) statistical model of interference is theoretically and experimentally proven. The applicability of the mixed density estimators for training of machine learning algorithms for the classification of biological objects (elements of grain mixtures) based on spectral measurements in the visible and near-infrared regions is evaluated.
topic machine learning
pattern classification
data augmentation
kernel density estimation
spectral measurements
url http://computeroptics.smr.ru/KO/PDF/KO43-4/430421.pdf
work_keys_str_mv AT alexandersirota multivariatemixedkerneldensityestimatorsandtheirapplicationinmachinelearningforclassificationofbiologicalobjectsbasedonspectralmeasurements
AT artemdonskikh multivariatemixedkerneldensityestimatorsandtheirapplicationinmachinelearningforclassificationofbiologicalobjectsbasedonspectralmeasurements
AT alexeyakimov multivariatemixedkerneldensityestimatorsandtheirapplicationinmachinelearningforclassificationofbiologicalobjectsbasedonspectralmeasurements
AT dmitryminakov multivariatemixedkerneldensityestimatorsandtheirapplicationinmachinelearningforclassificationofbiologicalobjectsbasedonspectralmeasurements
_version_ 1725052071174995968