Active Learning in Black-Box Settings

Active learning refers to the settings in which a machine learning algorithm (learner) is able to select data from which it learns (selecting points and then obtaining their labels), and by doing so aims to achieve better accuracy (e.g., by avoiding obtaining training data that is redundant or unimp...

Full description

Bibliographic Details
Main Authors: Neil Rubens, Vera Sheinman, Ryota Tomioka, Masashi Sugiyama
Format: Article
Language:English
Published: Austrian Statistical Society 2016-02-01
Series:Austrian Journal of Statistics
Online Access:http://www.ajs.or.at/index.php/ajs/article/view/204
id doaj-5266542de9894436bfb77976aa3099e4
record_format Article
spelling doaj-5266542de9894436bfb77976aa3099e42021-04-22T12:34:43ZengAustrian Statistical SocietyAustrian Journal of Statistics1026-597X2016-02-01401&210.17713/ajs.v40i1&2.204Active Learning in Black-Box SettingsNeil Rubens0Vera Sheinman1Ryota Tomioka2Masashi Sugiyama3University of Electro-Communications, Tokyo, JapanJapanese Institute of Educational Measurement, Tokyo, JapanUniversity of Tokyo, Tokyo, JapanTokyo Institute of Technology, Tokyo, JapanActive learning refers to the settings in which a machine learning algorithm (learner) is able to select data from which it learns (selecting points and then obtaining their labels), and by doing so aims to achieve better accuracy (e.g., by avoiding obtaining training data that is redundant or unimportant). Active learning is particularly useful in cases where the labeling cost is high. A common assumption is that an active learning algorithm is aware of the details of the underlying learning algorithm for which it obtains the data. However, in many practical settings, obtaining precise details of the learning algorithm may not be feasible, making the underlying algorithm in essence a black box – no knowledge of the internal workings of the algorithm is available, and only the inputs and corresponding output estimates are accessible. This makes many of the traditional approaches not applicable, or at the least not effective. Hence our motivation is to use the only data that is accessible in black box settings – output estimates. We note that accuracy will improve only if the learner’s output estimates change. Therefore we propose active learning criterion that utilizes the information contained within the changes of output estimates.http://www.ajs.or.at/index.php/ajs/article/view/204
collection DOAJ
language English
format Article
sources DOAJ
author Neil Rubens
Vera Sheinman
Ryota Tomioka
Masashi Sugiyama
spellingShingle Neil Rubens
Vera Sheinman
Ryota Tomioka
Masashi Sugiyama
Active Learning in Black-Box Settings
Austrian Journal of Statistics
author_facet Neil Rubens
Vera Sheinman
Ryota Tomioka
Masashi Sugiyama
author_sort Neil Rubens
title Active Learning in Black-Box Settings
title_short Active Learning in Black-Box Settings
title_full Active Learning in Black-Box Settings
title_fullStr Active Learning in Black-Box Settings
title_full_unstemmed Active Learning in Black-Box Settings
title_sort active learning in black-box settings
publisher Austrian Statistical Society
series Austrian Journal of Statistics
issn 1026-597X
publishDate 2016-02-01
description Active learning refers to the settings in which a machine learning algorithm (learner) is able to select data from which it learns (selecting points and then obtaining their labels), and by doing so aims to achieve better accuracy (e.g., by avoiding obtaining training data that is redundant or unimportant). Active learning is particularly useful in cases where the labeling cost is high. A common assumption is that an active learning algorithm is aware of the details of the underlying learning algorithm for which it obtains the data. However, in many practical settings, obtaining precise details of the learning algorithm may not be feasible, making the underlying algorithm in essence a black box – no knowledge of the internal workings of the algorithm is available, and only the inputs and corresponding output estimates are accessible. This makes many of the traditional approaches not applicable, or at the least not effective. Hence our motivation is to use the only data that is accessible in black box settings – output estimates. We note that accuracy will improve only if the learner’s output estimates change. Therefore we propose active learning criterion that utilizes the information contained within the changes of output estimates.
url http://www.ajs.or.at/index.php/ajs/article/view/204
work_keys_str_mv AT neilrubens activelearninginblackboxsettings
AT verasheinman activelearninginblackboxsettings
AT ryotatomioka activelearninginblackboxsettings
AT masashisugiyama activelearninginblackboxsettings
_version_ 1721514338771009536