Metody vytváření subjektivního slovníku pro indonézštinu
In this work, we created subjectivity lexicons of positive and negative expres- sions for Indonesian language by automatically translating English lexicons, and by intersecting and unioning the translation results. We compared the perfor- mances of the resulting lexicons using a simple prediction me...
Main Author: | |
---|---|
Other Authors: | |
Format: | Dissertation |
Language: | English |
Published: |
2013
|
Online Access: | http://www.nusl.cz/ntk/nusl-324076 |
id |
ndltd-nusl.cz-oai-invenio.nusl.cz-324076 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-nusl.cz-oai-invenio.nusl.cz-3240762021-03-29T05:12:24Z Metody vytváření subjektivního slovníku pro indonézštinu Methods for Creating Subjectivity Lexicon for Indonesian Franky, Bojar, Ondřej Kuboň, Vladislav In this work, we created subjectivity lexicons of positive and negative expres- sions for Indonesian language by automatically translating English lexicons, and by intersecting and unioning the translation results. We compared the perfor- mances of the resulting lexicons using a simple prediction method that compares the number of occurrences of positive and negative expressions in a sentence. We also experimented with weighting the expressions by their frequency and relative frequency in unannotated data. A modification in prediction method using ma- chine learning was later used to better incorporate the information that cannot be captured by the simple prediction. We showed that the lexicons were able to reach high recall but low precision when predicting whether a sentence is eval- uative (positive or negative) or not (neutral). Scoring the expressions improve the recall or precision but with comparable decrease in the other measure. The machine learning prediction was able to minimize the sensitivity of the perfor- mances to the size of the lexicon, but further experiments are required to explore the best choice for the prediction method. 1 2013 info:eu-repo/semantics/masterThesis http://www.nusl.cz/ntk/nusl-324076 eng info:eu-repo/semantics/restrictedAccess |
collection |
NDLTD |
language |
English |
format |
Dissertation |
sources |
NDLTD |
description |
In this work, we created subjectivity lexicons of positive and negative expres- sions for Indonesian language by automatically translating English lexicons, and by intersecting and unioning the translation results. We compared the perfor- mances of the resulting lexicons using a simple prediction method that compares the number of occurrences of positive and negative expressions in a sentence. We also experimented with weighting the expressions by their frequency and relative frequency in unannotated data. A modification in prediction method using ma- chine learning was later used to better incorporate the information that cannot be captured by the simple prediction. We showed that the lexicons were able to reach high recall but low precision when predicting whether a sentence is eval- uative (positive or negative) or not (neutral). Scoring the expressions improve the recall or precision but with comparable decrease in the other measure. The machine learning prediction was able to minimize the sensitivity of the perfor- mances to the size of the lexicon, but further experiments are required to explore the best choice for the prediction method. 1 |
author2 |
Bojar, Ondřej |
author_facet |
Bojar, Ondřej Franky, |
author |
Franky, |
spellingShingle |
Franky, Metody vytváření subjektivního slovníku pro indonézštinu |
author_sort |
Franky, |
title |
Metody vytváření subjektivního slovníku pro indonézštinu |
title_short |
Metody vytváření subjektivního slovníku pro indonézštinu |
title_full |
Metody vytváření subjektivního slovníku pro indonézštinu |
title_fullStr |
Metody vytváření subjektivního slovníku pro indonézštinu |
title_full_unstemmed |
Metody vytváření subjektivního slovníku pro indonézštinu |
title_sort |
metody vytváření subjektivního slovníku pro indonézštinu |
publishDate |
2013 |
url |
http://www.nusl.cz/ntk/nusl-324076 |
work_keys_str_mv |
AT franky metodyvytvarenisubjektivnihoslovnikuproindonezstinu AT franky methodsforcreatingsubjectivitylexiconforindonesian |
_version_ |
1719389450908205056 |