Knowledge Acquisition, Delivery and Prediction through Text Mining

The World Wide Web is an abundant source for Textual Web Mining research. Data can be acquired from Web texts and converted to Information or Knowledge for immediate consumption. Studying the acquisition and consumption of Web text can provide a glimpse into the social/behavioral aspects of Web Us...

Full description

Bibliographic Details
Main Author: Schumaker, Robert P.
Other Authors: Chen, Hsinchun
Language:EN
Published: The University of Arizona. 2007
Online Access:http://hdl.handle.net/10150/194680
id ndltd-arizona.edu-oai-arizona.openrepository.com-10150-194680
record_format oai_dc
spelling ndltd-arizona.edu-oai-arizona.openrepository.com-10150-1946802015-10-23T04:41:25Z Knowledge Acquisition, Delivery and Prediction through Text Mining Schumaker, Robert P. Chen, Hsinchun Chen, Hsinchun Zhang, Zhu Zhao, Leon Nunamaker, Jay The World Wide Web is an abundant source for Textual Web Mining research. Data can be acquired from Web texts and converted to Information or Knowledge for immediate consumption. Studying the acquisition and consumption of Web text can provide a glimpse into the social/behavioral aspects of Web Users and Web Content Providers. Patterns embedded within textual data can be similarly identified through technical means and even anticipated.Seven essays explore the important algorithmic and computational aspects needed in the analysis of acquiring, delivering and making predictions from Web texts. Chapters 2 and 3 describe the knowledge acquisition process and feasibility of leveraging Web users. While the knowledge acquired from Web users was not as refined as that from domain experts, the knowledge gathered was found to be of acceptable quality. From our analysis of dialog systems, it was found that Web users were more likely to augment the breadth of existing knowledge by adding new response sets to the knowledge base. Chapters 4 and 5 look at the aspects of knowledge delivery to Web users. Using a dialog system, we observe the acceptance and satisfaction levels of dialog responses in general conversation, domain knowledge and the combination of both knowledge bases. Chapters 6 through 8 consider the prediction facet of knowledge using textual financial news articles and stock prices. This section focuses on comparing different model parameters and textual representations to best describe future prices as well as an examination of document representation based on the sector and industry a company is engaged in. From these analyses we found that Sector-based aggregation led to the best price predictions.Together these essays effectively leverage large amounts of textual Web data to represent knowledge in meaningful ways to end users. These essays also provide the blueprints for several real-world applications. The approaches and techniques described borrow from referent disciplines of linguistics, finance, computer science, statistics as well as MIS and demonstrate potentially useful applications for dialog systems, quantitative stock prediction and other knowledge management processes in which textual data can be accurately represented and forecast; thus improving the exchange of human knowledge. 2007 text Electronic Dissertation http://hdl.handle.net/10150/194680 659747135 2058 EN Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author. The University of Arizona.
collection NDLTD
language EN
sources NDLTD
description The World Wide Web is an abundant source for Textual Web Mining research. Data can be acquired from Web texts and converted to Information or Knowledge for immediate consumption. Studying the acquisition and consumption of Web text can provide a glimpse into the social/behavioral aspects of Web Users and Web Content Providers. Patterns embedded within textual data can be similarly identified through technical means and even anticipated.Seven essays explore the important algorithmic and computational aspects needed in the analysis of acquiring, delivering and making predictions from Web texts. Chapters 2 and 3 describe the knowledge acquisition process and feasibility of leveraging Web users. While the knowledge acquired from Web users was not as refined as that from domain experts, the knowledge gathered was found to be of acceptable quality. From our analysis of dialog systems, it was found that Web users were more likely to augment the breadth of existing knowledge by adding new response sets to the knowledge base. Chapters 4 and 5 look at the aspects of knowledge delivery to Web users. Using a dialog system, we observe the acceptance and satisfaction levels of dialog responses in general conversation, domain knowledge and the combination of both knowledge bases. Chapters 6 through 8 consider the prediction facet of knowledge using textual financial news articles and stock prices. This section focuses on comparing different model parameters and textual representations to best describe future prices as well as an examination of document representation based on the sector and industry a company is engaged in. From these analyses we found that Sector-based aggregation led to the best price predictions.Together these essays effectively leverage large amounts of textual Web data to represent knowledge in meaningful ways to end users. These essays also provide the blueprints for several real-world applications. The approaches and techniques described borrow from referent disciplines of linguistics, finance, computer science, statistics as well as MIS and demonstrate potentially useful applications for dialog systems, quantitative stock prediction and other knowledge management processes in which textual data can be accurately represented and forecast; thus improving the exchange of human knowledge.
author2 Chen, Hsinchun
author_facet Chen, Hsinchun
Schumaker, Robert P.
author Schumaker, Robert P.
spellingShingle Schumaker, Robert P.
Knowledge Acquisition, Delivery and Prediction through Text Mining
author_sort Schumaker, Robert P.
title Knowledge Acquisition, Delivery and Prediction through Text Mining
title_short Knowledge Acquisition, Delivery and Prediction through Text Mining
title_full Knowledge Acquisition, Delivery and Prediction through Text Mining
title_fullStr Knowledge Acquisition, Delivery and Prediction through Text Mining
title_full_unstemmed Knowledge Acquisition, Delivery and Prediction through Text Mining
title_sort knowledge acquisition, delivery and prediction through text mining
publisher The University of Arizona.
publishDate 2007
url http://hdl.handle.net/10150/194680
work_keys_str_mv AT schumakerrobertp knowledgeacquisitiondeliveryandpredictionthroughtextmining
_version_ 1718099329063845888