An information theoretic approach to natural language processing.
A new method of natural language processing, based on the theory of information is described. Parsing of a sentence is accomplished not in a sequential manner, but in a fashion that begins by searching for the main verb of the sentence, then for the object, subject and perhaps for a prepositional ph...
Main Author: | |
---|---|
Other Authors: | |
Language: | en |
Published: |
The University of Arizona.
1994
|
Online Access: | http://hdl.handle.net/10150/186886 |
id |
ndltd-arizona.edu-oai-arizona.openrepository.com-10150-186886 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-arizona.edu-oai-arizona.openrepository.com-10150-1868862015-12-17T03:00:47Z An information theoretic approach to natural language processing. Grubbs, Elmer Andrew. Schooley, Larry C. Hill, Fredrick J. A new method of natural language processing, based on the theory of information is described. Parsing of a sentence is accomplished not in a sequential manner, but in a fashion that begins by searching for the main verb of the sentence, then for the object, subject and perhaps for a prepositional phrase. As each new part of speech is located, the uncertainty of the sentence's meaning is reduced. When the uncertainty reaches zero, the parsing is complete, and the machine performs the task assigned by the input sentence. The process is modeled by a Markov Chain, which can often be used for the internal representation of the sentence. All of this work is done for communication with an intelligent task oriented machine, but the theoretical basis for extending this to other, more complicated domains is also described. A description of a methodology for extending the theory, so that it can be used for the implementation of a machine that learns is also described in this paper. By using belief networks, the machine constructs additions to its basic Markov Chain in order to handle new verbs and objects, which were not included in the original programming. Once implemented, the system will then treat the new word as if it had originally been programmed into the machine. Finally, several prototypes are described which have been written to validate the theory presented. The information theoretic system contained herein is compared to other techniques of natural language processing, and shown to have significant advantages. 1994 text Dissertation-Reproduction (electronic) http://hdl.handle.net/10150/186886 9507016 en Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author. The University of Arizona. |
collection |
NDLTD |
language |
en |
sources |
NDLTD |
description |
A new method of natural language processing, based on the theory of information is described. Parsing of a sentence is accomplished not in a sequential manner, but in a fashion that begins by searching for the main verb of the sentence, then for the object, subject and perhaps for a prepositional phrase. As each new part of speech is located, the uncertainty of the sentence's meaning is reduced. When the uncertainty reaches zero, the parsing is complete, and the machine performs the task assigned by the input sentence. The process is modeled by a Markov Chain, which can often be used for the internal representation of the sentence. All of this work is done for communication with an intelligent task oriented machine, but the theoretical basis for extending this to other, more complicated domains is also described. A description of a methodology for extending the theory, so that it can be used for the implementation of a machine that learns is also described in this paper. By using belief networks, the machine constructs additions to its basic Markov Chain in order to handle new verbs and objects, which were not included in the original programming. Once implemented, the system will then treat the new word as if it had originally been programmed into the machine. Finally, several prototypes are described which have been written to validate the theory presented. The information theoretic system contained herein is compared to other techniques of natural language processing, and shown to have significant advantages. |
author2 |
Schooley, Larry C. |
author_facet |
Schooley, Larry C. Grubbs, Elmer Andrew. |
author |
Grubbs, Elmer Andrew. |
spellingShingle |
Grubbs, Elmer Andrew. An information theoretic approach to natural language processing. |
author_sort |
Grubbs, Elmer Andrew. |
title |
An information theoretic approach to natural language processing. |
title_short |
An information theoretic approach to natural language processing. |
title_full |
An information theoretic approach to natural language processing. |
title_fullStr |
An information theoretic approach to natural language processing. |
title_full_unstemmed |
An information theoretic approach to natural language processing. |
title_sort |
information theoretic approach to natural language processing. |
publisher |
The University of Arizona. |
publishDate |
1994 |
url |
http://hdl.handle.net/10150/186886 |
work_keys_str_mv |
AT grubbselmerandrew aninformationtheoreticapproachtonaturallanguageprocessing AT grubbselmerandrew informationtheoreticapproachtonaturallanguageprocessing |
_version_ |
1718152911411740672 |