The fast vocabulary-based algorithm for natural language word form analysis

In the field of Natural Language Processing, identifying word forms and, more precisely, identifying part-of-speech and grammatical information for each of the words in the input text usually comprises the very first level of text processing (or immediately follows splitting the text into words, sho...

Full description

Bibliographic Details
Main Author: Rozanov Alexey
Format: Article
Language:English
Published: EDP Sciences 2016-01-01
Series:ITM Web of Conferences
Online Access:http://dx.doi.org/10.1051/itmconf/20160603013
id doaj-76ac77828e7948a98b43564a19c6dc3a
record_format Article
spelling doaj-76ac77828e7948a98b43564a19c6dc3a2021-02-02T05:27:42ZengEDP SciencesITM Web of Conferences2271-20972016-01-0160301310.1051/itmconf/20160603013itmconf_ics2016_03013The fast vocabulary-based algorithm for natural language word form analysisRozanov Alexey0Ryazan State Radioengineering University, Department of computational and applied mathematicsIn the field of Natural Language Processing, identifying word forms and, more precisely, identifying part-of-speech and grammatical information for each of the words in the input text usually comprises the very first level of text processing (or immediately follows splitting the text into words, should such task be non-trivial), therefore development of approaches to speed up the word form analysis pose significant interest In (his work, by using the work [1] as a basis, we present an approach to analysis of word forms for natural languages with postfix inflection, following the work done in [3]. We propose a way of representing the postfix inflection rules associated with a natural language and an algorithm for word form analysis based on it. In conclusion, we provide the benchmark data indicating the increase in speed compared to known analysis methods.http://dx.doi.org/10.1051/itmconf/20160603013
collection DOAJ
language English
format Article
sources DOAJ
author Rozanov Alexey
spellingShingle Rozanov Alexey
The fast vocabulary-based algorithm for natural language word form analysis
ITM Web of Conferences
author_facet Rozanov Alexey
author_sort Rozanov Alexey
title The fast vocabulary-based algorithm for natural language word form analysis
title_short The fast vocabulary-based algorithm for natural language word form analysis
title_full The fast vocabulary-based algorithm for natural language word form analysis
title_fullStr The fast vocabulary-based algorithm for natural language word form analysis
title_full_unstemmed The fast vocabulary-based algorithm for natural language word form analysis
title_sort fast vocabulary-based algorithm for natural language word form analysis
publisher EDP Sciences
series ITM Web of Conferences
issn 2271-2097
publishDate 2016-01-01
description In the field of Natural Language Processing, identifying word forms and, more precisely, identifying part-of-speech and grammatical information for each of the words in the input text usually comprises the very first level of text processing (or immediately follows splitting the text into words, should such task be non-trivial), therefore development of approaches to speed up the word form analysis pose significant interest In (his work, by using the work [1] as a basis, we present an approach to analysis of word forms for natural languages with postfix inflection, following the work done in [3]. We propose a way of representing the postfix inflection rules associated with a natural language and an algorithm for word form analysis based on it. In conclusion, we provide the benchmark data indicating the increase in speed compared to known analysis methods.
url http://dx.doi.org/10.1051/itmconf/20160603013
work_keys_str_mv AT rozanovalexey thefastvocabularybasedalgorithmfornaturallanguagewordformanalysis
AT rozanovalexey fastvocabularybasedalgorithmfornaturallanguagewordformanalysis
_version_ 1724303621403705344