AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH
The research problems in this research are 1) how lexicogrammar takes role in determining polarity of F-Word1 and 2) how to formalize it for corpus processing. The data is obtained from the Contemporary American English Corpus (COCA). In this corpus, F-word is proven to be highest in frequency as co...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Universitas Gadjah Mada
2016-01-01
|
Series: | Humaniora |
Subjects: | |
Online Access: | https://jurnal.ugm.ac.id/jurnal-humaniora/article/view/8709 |
id |
doaj-7ff3fdee71754d3d94b5f81f07a92de2 |
---|---|
record_format |
Article |
spelling |
doaj-7ff3fdee71754d3d94b5f81f07a92de22020-11-24T21:42:55ZengUniversitas Gadjah MadaHumaniora0852-08012302-92692016-01-0127215617310.22146/jh.v27i2.87097175AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISHPrihantoro Prihantoro0Faculty of Humanities Diponegoro University Semarang, IndonesiaThe research problems in this research are 1) how lexicogrammar takes role in determining polarity of F-Word1 and 2) how to formalize it for corpus processing. The data is obtained from the Contemporary American English Corpus (COCA). In this corpus, F-word is proven to be highest in frequency as compared to its distribution across corpora. Corpus methodology is applied by sending queries to retrieve F-Words to COCA interface. Tokens combination surrounding F-words resulted in the phrase and clause unit accompanying F-words, which are significant cues to determine F-word polarity. The polarity is later proven to be not necessarily negative. I also designed a computational resource to allow the retrieval of F-words offline so that users might apply it to any digital text collections.https://jurnal.ugm.ac.id/jurnal-humaniora/article/view/8709F-words, Corpus, Information Retrieval, American English, Polarity |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Prihantoro Prihantoro |
spellingShingle |
Prihantoro Prihantoro AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH Humaniora F-words, Corpus, Information Retrieval, American English, Polarity |
author_facet |
Prihantoro Prihantoro |
author_sort |
Prihantoro Prihantoro |
title |
AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH |
title_short |
AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH |
title_full |
AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH |
title_fullStr |
AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH |
title_full_unstemmed |
AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH |
title_sort |
automatic retrieval and the formalization of multi words expressions with f-words in the corpus of contemporary american english |
publisher |
Universitas Gadjah Mada |
series |
Humaniora |
issn |
0852-0801 2302-9269 |
publishDate |
2016-01-01 |
description |
The research problems in this research are 1) how lexicogrammar takes role in determining polarity of F-Word1 and 2) how to formalize it for corpus processing. The data is obtained from the Contemporary American English Corpus (COCA). In this corpus, F-word is proven to be highest in frequency as compared to its distribution across corpora. Corpus methodology is applied by sending queries to retrieve F-Words to COCA interface. Tokens combination surrounding F-words resulted in the phrase and clause unit accompanying F-words, which are significant cues to determine F-word polarity. The polarity is later proven to be not necessarily negative. I also designed a computational resource to allow the retrieval of F-words offline so that users might apply it to any digital text collections. |
topic |
F-words, Corpus, Information Retrieval, American English, Polarity |
url |
https://jurnal.ugm.ac.id/jurnal-humaniora/article/view/8709 |
work_keys_str_mv |
AT prihantoroprihantoro automaticretrievalandtheformalizationofmultiwordsexpressionswithfwordsinthecorpusofcontemporaryamericanenglish |
_version_ |
1725916442914717696 |