Disambiguating words with self-organizing maps

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011. === Cataloged from PDF version of thesis. === Includes bibliographical references (p. 77). === Today, powerful programs readily parse English text; understanding, however, is another...

Full description

Bibliographic Details
Main Author: Couturier, Martin Marcel
Other Authors: Patrick H. Winston.
Format: Others
Language:English
Published: Massachusetts Institute of Technology 2011
Subjects:
Online Access:http://hdl.handle.net/1721.1/66413
id ndltd-MIT-oai-dspace.mit.edu-1721.1-66413
record_format oai_dc
spelling ndltd-MIT-oai-dspace.mit.edu-1721.1-664132019-05-02T15:35:18Z Disambiguating words with self-organizing maps Couturier, Martin Marcel Patrick H. Winston. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011. Cataloged from PDF version of thesis. Includes bibliographical references (p. 77). Today, powerful programs readily parse English text; understanding, however, is another matter. In this thesis, I take a step toward understanding by introducing CLARIFY, a program that disambiguates words. CLARIFY identifies patterns in observed word contexts, and uses these patterns to select the optimal word sense for any specific situation. CLARIFY learns successful patterns by manipulating an accelerated Self-Organizing Map to save these example contexts and then references them to perform further context based disambiguation within the language. Through this process and after training on 125 examples, CLARIFY can now decipher that shrimp in the sentence "The shrimp goes to the store. " is a small-person, not relying on a literal definition of each word as a separate element but looking at the sentence as a fluid solution of many elements, thereby making the inference crustacean absurd. CLARIFY is implemented in 1500 lines of Java. by Martin Marcel Couturier. M.Eng. 2011-10-17T21:23:17Z 2011-10-17T21:23:17Z 2011 2011 Thesis http://hdl.handle.net/1721.1/66413 755091036 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 77 p. application/pdf Massachusetts Institute of Technology
collection NDLTD
language English
format Others
sources NDLTD
topic Electrical Engineering and Computer Science.
spellingShingle Electrical Engineering and Computer Science.
Couturier, Martin Marcel
Disambiguating words with self-organizing maps
description Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011. === Cataloged from PDF version of thesis. === Includes bibliographical references (p. 77). === Today, powerful programs readily parse English text; understanding, however, is another matter. In this thesis, I take a step toward understanding by introducing CLARIFY, a program that disambiguates words. CLARIFY identifies patterns in observed word contexts, and uses these patterns to select the optimal word sense for any specific situation. CLARIFY learns successful patterns by manipulating an accelerated Self-Organizing Map to save these example contexts and then references them to perform further context based disambiguation within the language. Through this process and after training on 125 examples, CLARIFY can now decipher that shrimp in the sentence "The shrimp goes to the store. " is a small-person, not relying on a literal definition of each word as a separate element but looking at the sentence as a fluid solution of many elements, thereby making the inference crustacean absurd. CLARIFY is implemented in 1500 lines of Java. === by Martin Marcel Couturier. === M.Eng.
author2 Patrick H. Winston.
author_facet Patrick H. Winston.
Couturier, Martin Marcel
author Couturier, Martin Marcel
author_sort Couturier, Martin Marcel
title Disambiguating words with self-organizing maps
title_short Disambiguating words with self-organizing maps
title_full Disambiguating words with self-organizing maps
title_fullStr Disambiguating words with self-organizing maps
title_full_unstemmed Disambiguating words with self-organizing maps
title_sort disambiguating words with self-organizing maps
publisher Massachusetts Institute of Technology
publishDate 2011
url http://hdl.handle.net/1721.1/66413
work_keys_str_mv AT couturiermartinmarcel disambiguatingwordswithselforganizingmaps
_version_ 1719024468050837504