STATISTICAL METHODS IN AI: RARE EVENT LEARNING USING ASSOCIATIVE RULES AND HIGHER-ORDER STATISTICS

Rare event learning has not been actively researched since lately due to the unavailability of algorithms which deal with big samples. The research addresses spatio-temporal streams from multi-resolution sensors to find actionable items from a perspective of real-time algorithms. This computing fram...

Full description

Bibliographic Details
Main Authors: V. Iyer, S. Shetty, S. S. Iyengar
Format: Article
Language:English
Published: Copernicus Publications 2015-07-01
Series:ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/II-4-W2/119/2015/isprsannals-II-4-W2-119-2015.pdf
id doaj-18c18ebcd5c94830b539abf7bebf3e32
record_format Article
spelling doaj-18c18ebcd5c94830b539abf7bebf3e322020-11-25T00:27:22ZengCopernicus PublicationsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences2194-90422194-90502015-07-01II-4/W211913010.5194/isprsannals-II-4-W2-119-2015STATISTICAL METHODS IN AI: RARE EVENT LEARNING USING ASSOCIATIVE RULES AND HIGHER-ORDER STATISTICSV. Iyer0S. Shetty1S. S. Iyengar2Electrical and Computer Engineering, Tennessee State University, Nashville, TN 37210, USAElectrical and Computer Engineering, Tennessee State University, Nashville, TN 37210, USASchool of Computing and Information Sciences, Florida International University, Miami, FL 33199, USARare event learning has not been actively researched since lately due to the unavailability of algorithms which deal with big samples. The research addresses spatio-temporal streams from multi-resolution sensors to find actionable items from a perspective of real-time algorithms. This computing framework is independent of the number of input samples, application domain, labelled or label-less streams. A sampling overlap algorithm such as Brooks-Iyengar is used for dealing with noisy sensor streams. We extend the existing noise pre-processing algorithms using Data-Cleaning trees. Pre-processing using ensemble of trees using bagging and multi-target regression showed robustness to random noise and missing data. As spatio-temporal streams are highly statistically correlated, we prove that a temporal window based sampling from sensor data streams converges after n samples using Hoeffding bounds. Which can be used for fast prediction of new samples in real-time. The Data-cleaning tree model uses a nonparametric node splitting technique, which can be learned in an iterative way which scales linearly in memory consumption for any size input stream. The improved task based ensemble extraction is compared with non-linear computation models using various SVM kernels for speed and accuracy. We show using empirical datasets the explicit rule learning computation is linear in time and is only dependent on the number of leafs present in the tree ensemble. The use of unpruned trees (<i>t</i>) in our proposed ensemble always yields minimum number (<i>m</i>) of leafs keeping pre-processing computation to <i>n</i> &times; <i>t</i> log <i>m</i> compared to <i>N<sup>2</sup></i> for Gram Matrix. We also show that the task based feature induction yields higher Qualify of Data (QoD) in the feature space compared to kernel methods using Gram Matrix.http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/II-4-W2/119/2015/isprsannals-II-4-W2-119-2015.pdf
collection DOAJ
language English
format Article
sources DOAJ
author V. Iyer
S. Shetty
S. S. Iyengar
spellingShingle V. Iyer
S. Shetty
S. S. Iyengar
STATISTICAL METHODS IN AI: RARE EVENT LEARNING USING ASSOCIATIVE RULES AND HIGHER-ORDER STATISTICS
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
author_facet V. Iyer
S. Shetty
S. S. Iyengar
author_sort V. Iyer
title STATISTICAL METHODS IN AI: RARE EVENT LEARNING USING ASSOCIATIVE RULES AND HIGHER-ORDER STATISTICS
title_short STATISTICAL METHODS IN AI: RARE EVENT LEARNING USING ASSOCIATIVE RULES AND HIGHER-ORDER STATISTICS
title_full STATISTICAL METHODS IN AI: RARE EVENT LEARNING USING ASSOCIATIVE RULES AND HIGHER-ORDER STATISTICS
title_fullStr STATISTICAL METHODS IN AI: RARE EVENT LEARNING USING ASSOCIATIVE RULES AND HIGHER-ORDER STATISTICS
title_full_unstemmed STATISTICAL METHODS IN AI: RARE EVENT LEARNING USING ASSOCIATIVE RULES AND HIGHER-ORDER STATISTICS
title_sort statistical methods in ai: rare event learning using associative rules and higher-order statistics
publisher Copernicus Publications
series ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
issn 2194-9042
2194-9050
publishDate 2015-07-01
description Rare event learning has not been actively researched since lately due to the unavailability of algorithms which deal with big samples. The research addresses spatio-temporal streams from multi-resolution sensors to find actionable items from a perspective of real-time algorithms. This computing framework is independent of the number of input samples, application domain, labelled or label-less streams. A sampling overlap algorithm such as Brooks-Iyengar is used for dealing with noisy sensor streams. We extend the existing noise pre-processing algorithms using Data-Cleaning trees. Pre-processing using ensemble of trees using bagging and multi-target regression showed robustness to random noise and missing data. As spatio-temporal streams are highly statistically correlated, we prove that a temporal window based sampling from sensor data streams converges after n samples using Hoeffding bounds. Which can be used for fast prediction of new samples in real-time. The Data-cleaning tree model uses a nonparametric node splitting technique, which can be learned in an iterative way which scales linearly in memory consumption for any size input stream. The improved task based ensemble extraction is compared with non-linear computation models using various SVM kernels for speed and accuracy. We show using empirical datasets the explicit rule learning computation is linear in time and is only dependent on the number of leafs present in the tree ensemble. The use of unpruned trees (<i>t</i>) in our proposed ensemble always yields minimum number (<i>m</i>) of leafs keeping pre-processing computation to <i>n</i> &times; <i>t</i> log <i>m</i> compared to <i>N<sup>2</sup></i> for Gram Matrix. We also show that the task based feature induction yields higher Qualify of Data (QoD) in the feature space compared to kernel methods using Gram Matrix.
url http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/II-4-W2/119/2015/isprsannals-II-4-W2-119-2015.pdf
work_keys_str_mv AT viyer statisticalmethodsinairareeventlearningusingassociativerulesandhigherorderstatistics
AT sshetty statisticalmethodsinairareeventlearningusingassociativerulesandhigherorderstatistics
AT ssiyengar statisticalmethodsinairareeventlearningusingassociativerulesandhigherorderstatistics
_version_ 1725340283972878336