Resource-Bounded Information Acquisition and Learning

In many scenarios it is desirable to augment existing data with information acquired from an external source. For example, information from the Web can be used to fill missing values in a database or to correct errors. In many machine learning and data mining scenarios, acquiring additional feature...

Full description

Bibliographic Details
Main Author: Kanani, Pallika H
Format: Others
Published: ScholarWorks@UMass Amherst 2012
Subjects:
Online Access:https://scholarworks.umass.edu/open_access_dissertations/581
https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1580&context=open_access_dissertations
id ndltd-UMASS-oai-scholarworks.umass.edu-open_access_dissertations-1580
record_format oai_dc
spelling ndltd-UMASS-oai-scholarworks.umass.edu-open_access_dissertations-15802020-12-02T14:39:49Z Resource-Bounded Information Acquisition and Learning Kanani, Pallika H In many scenarios it is desirable to augment existing data with information acquired from an external source. For example, information from the Web can be used to fill missing values in a database or to correct errors. In many machine learning and data mining scenarios, acquiring additional feature values can lead to improved data quality and accuracy. However, there is often a cost associated with such information acquisition, and we typically need to operate under limited resources. In this thesis, I explore different aspects of Resource-bounded Information Acquisition and Learning. The process of acquiring information from an external source involves multiple steps, such as deciding what subset of information to obtain, locating the documents that contain the required information, acquiring relevant documents, extracting the specific piece of information, and combining it with existing information to make useful decisions. The problem of Resource-bounded Information Acquisition (RBIA) involves saving resources at each stage of the information acquisition process. I explore four special cases of the RBIA problem, propose general principles for efficiently acquiring external information in real-world domains, and demonstrate their effectiveness using extensive experiments. For example, in some of these domains I show how interdependency between fields or records in the data can also be exploited to achieve cost reduction. Finally, I propose a general framework for RBIA, that takes into account the state of the database at each point of time, dynamically adapts to the results of all the steps in the acquisition process so far, as well as the properties of each step, and carries them out striving to acquire most information with least amount resources. 2012-05-01T07:00:00Z text application/pdf https://scholarworks.umass.edu/open_access_dissertations/581 https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1580&context=open_access_dissertations Open Access Dissertations ScholarWorks@UMass Amherst Information Acquisition Information Extraction Machine Learning Reinforcement Learning Resource-bounded Web Mining Computer Sciences
collection NDLTD
format Others
sources NDLTD
topic Information Acquisition
Information Extraction
Machine Learning
Reinforcement Learning
Resource-bounded
Web Mining
Computer Sciences
spellingShingle Information Acquisition
Information Extraction
Machine Learning
Reinforcement Learning
Resource-bounded
Web Mining
Computer Sciences
Kanani, Pallika H
Resource-Bounded Information Acquisition and Learning
description In many scenarios it is desirable to augment existing data with information acquired from an external source. For example, information from the Web can be used to fill missing values in a database or to correct errors. In many machine learning and data mining scenarios, acquiring additional feature values can lead to improved data quality and accuracy. However, there is often a cost associated with such information acquisition, and we typically need to operate under limited resources. In this thesis, I explore different aspects of Resource-bounded Information Acquisition and Learning. The process of acquiring information from an external source involves multiple steps, such as deciding what subset of information to obtain, locating the documents that contain the required information, acquiring relevant documents, extracting the specific piece of information, and combining it with existing information to make useful decisions. The problem of Resource-bounded Information Acquisition (RBIA) involves saving resources at each stage of the information acquisition process. I explore four special cases of the RBIA problem, propose general principles for efficiently acquiring external information in real-world domains, and demonstrate their effectiveness using extensive experiments. For example, in some of these domains I show how interdependency between fields or records in the data can also be exploited to achieve cost reduction. Finally, I propose a general framework for RBIA, that takes into account the state of the database at each point of time, dynamically adapts to the results of all the steps in the acquisition process so far, as well as the properties of each step, and carries them out striving to acquire most information with least amount resources.
author Kanani, Pallika H
author_facet Kanani, Pallika H
author_sort Kanani, Pallika H
title Resource-Bounded Information Acquisition and Learning
title_short Resource-Bounded Information Acquisition and Learning
title_full Resource-Bounded Information Acquisition and Learning
title_fullStr Resource-Bounded Information Acquisition and Learning
title_full_unstemmed Resource-Bounded Information Acquisition and Learning
title_sort resource-bounded information acquisition and learning
publisher ScholarWorks@UMass Amherst
publishDate 2012
url https://scholarworks.umass.edu/open_access_dissertations/581
https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1580&context=open_access_dissertations
work_keys_str_mv AT kananipallikah resourceboundedinformationacquisitionandlearning
_version_ 1719365907529072640