The fast vocabulary-based algorithm for natural language word form analysis
In the field of Natural Language Processing, identifying word forms and, more precisely, identifying part-of-speech and grammatical information for each of the words in the input text usually comprises the very first level of text processing (or immediately follows splitting the text into words, sho...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2016-01-01
|
Series: | ITM Web of Conferences |
Online Access: | http://dx.doi.org/10.1051/itmconf/20160603013 |
id |
doaj-76ac77828e7948a98b43564a19c6dc3a |
---|---|
record_format |
Article |
spelling |
doaj-76ac77828e7948a98b43564a19c6dc3a2021-02-02T05:27:42ZengEDP SciencesITM Web of Conferences2271-20972016-01-0160301310.1051/itmconf/20160603013itmconf_ics2016_03013The fast vocabulary-based algorithm for natural language word form analysisRozanov Alexey0Ryazan State Radioengineering University, Department of computational and applied mathematicsIn the field of Natural Language Processing, identifying word forms and, more precisely, identifying part-of-speech and grammatical information for each of the words in the input text usually comprises the very first level of text processing (or immediately follows splitting the text into words, should such task be non-trivial), therefore development of approaches to speed up the word form analysis pose significant interest In (his work, by using the work [1] as a basis, we present an approach to analysis of word forms for natural languages with postfix inflection, following the work done in [3]. We propose a way of representing the postfix inflection rules associated with a natural language and an algorithm for word form analysis based on it. In conclusion, we provide the benchmark data indicating the increase in speed compared to known analysis methods.http://dx.doi.org/10.1051/itmconf/20160603013 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Rozanov Alexey |
spellingShingle |
Rozanov Alexey The fast vocabulary-based algorithm for natural language word form analysis ITM Web of Conferences |
author_facet |
Rozanov Alexey |
author_sort |
Rozanov Alexey |
title |
The fast vocabulary-based algorithm for natural language word form analysis |
title_short |
The fast vocabulary-based algorithm for natural language word form analysis |
title_full |
The fast vocabulary-based algorithm for natural language word form analysis |
title_fullStr |
The fast vocabulary-based algorithm for natural language word form analysis |
title_full_unstemmed |
The fast vocabulary-based algorithm for natural language word form analysis |
title_sort |
fast vocabulary-based algorithm for natural language word form analysis |
publisher |
EDP Sciences |
series |
ITM Web of Conferences |
issn |
2271-2097 |
publishDate |
2016-01-01 |
description |
In the field of Natural Language Processing, identifying word forms and, more precisely, identifying part-of-speech and grammatical information for each of the words in the input text usually comprises the very first level of text processing (or immediately follows splitting the text into words, should such task be non-trivial), therefore development of approaches to speed up the word form analysis pose significant interest In (his work, by using the work [1] as a basis, we present an approach to analysis of word forms for natural languages with postfix inflection, following the work done in [3]. We propose a way of representing the postfix inflection rules associated with a natural language and an algorithm for word form analysis based on it. In conclusion, we provide the benchmark data indicating the increase in speed compared to known analysis methods. |
url |
http://dx.doi.org/10.1051/itmconf/20160603013 |
work_keys_str_mv |
AT rozanovalexey thefastvocabularybasedalgorithmfornaturallanguagewordformanalysis AT rozanovalexey fastvocabularybasedalgorithmfornaturallanguagewordformanalysis |
_version_ |
1724303621403705344 |