Romanian Linguistic Resources On Very Large Scale

This paper suggests a methodology for building a technological environment for linguistic processing, intended to conserve, update and exploit, for research, for public and for commercial purposes, strategic linguistic resources of the Romanian language, rooted in textual data contributed daily and...

Full description

Bibliographic Details
Main Author: Dan Cristea
Format: Article
Language:English
Published: Institute of Mathematics and Computer Science of the Academy of Sciences of Moldova 2011-10-01
Series:Computer Science Journal of Moldova
Online Access:http://www.math.md/files/csjm/v19-n2/v19-n2-(pp130-145).pdf
id doaj-c298adb9f1384c1eb1f06138e08bd505
record_format Article
spelling doaj-c298adb9f1384c1eb1f06138e08bd5052020-11-24T21:10:36ZengInstitute of Mathematics and Computer Science of the Academy of Sciences of MoldovaComputer Science Journal of Moldova1561-40422011-10-01192(56)130145Romanian Linguistic Resources On Very Large ScaleDan Cristea0Faculty of Computer Science, ``Alexandru Ioan Cuza'' University of Iasi; Institute of Computer Science, Romanian Academy, the Iasi branchThis paper suggests a methodology for building a technological environment for linguistic processing, intended to conserve, update and exploit, for research, for public and for commercial purposes, strategic linguistic resources of the Romanian language, rooted in textual data contributed daily and in the long run by important editorial houses and mass-media institutions. In essence, it describes a technology able to receive, store and continuously process large amounts of textual data, received from voluntary contributors, on a daily basis. Apart from storing linguistic data \textit{\`{a} la longue} for the benefit of preserving the language, the results of the processing will be returned to three categories of users: the researchers working on Romanian language and computational linguistics, the contributors of the resources, and the public at large. http://www.math.md/files/csjm/v19-n2/v19-n2-(pp130-145).pdf
collection DOAJ
language English
format Article
sources DOAJ
author Dan Cristea
spellingShingle Dan Cristea
Romanian Linguistic Resources On Very Large Scale
Computer Science Journal of Moldova
author_facet Dan Cristea
author_sort Dan Cristea
title Romanian Linguistic Resources On Very Large Scale
title_short Romanian Linguistic Resources On Very Large Scale
title_full Romanian Linguistic Resources On Very Large Scale
title_fullStr Romanian Linguistic Resources On Very Large Scale
title_full_unstemmed Romanian Linguistic Resources On Very Large Scale
title_sort romanian linguistic resources on very large scale
publisher Institute of Mathematics and Computer Science of the Academy of Sciences of Moldova
series Computer Science Journal of Moldova
issn 1561-4042
publishDate 2011-10-01
description This paper suggests a methodology for building a technological environment for linguistic processing, intended to conserve, update and exploit, for research, for public and for commercial purposes, strategic linguistic resources of the Romanian language, rooted in textual data contributed daily and in the long run by important editorial houses and mass-media institutions. In essence, it describes a technology able to receive, store and continuously process large amounts of textual data, received from voluntary contributors, on a daily basis. Apart from storing linguistic data \textit{\`{a} la longue} for the benefit of preserving the language, the results of the processing will be returned to three categories of users: the researchers working on Romanian language and computational linguistics, the contributors of the resources, and the public at large.
url http://www.math.md/files/csjm/v19-n2/v19-n2-(pp130-145).pdf
work_keys_str_mv AT dancristea romanianlinguisticresourcesonverylargescale
_version_ 1716755916582289408