Romanian Linguistic Resources On Very Large Scale
This paper suggests a methodology for building a technological environment for linguistic processing, intended to conserve, update and exploit, for research, for public and for commercial purposes, strategic linguistic resources of the Romanian language, rooted in textual data contributed daily and...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Institute of Mathematics and Computer Science of the Academy of Sciences of Moldova
2011-10-01
|
Series: | Computer Science Journal of Moldova |
Online Access: | http://www.math.md/files/csjm/v19-n2/v19-n2-(pp130-145).pdf |
id |
doaj-c298adb9f1384c1eb1f06138e08bd505 |
---|---|
record_format |
Article |
spelling |
doaj-c298adb9f1384c1eb1f06138e08bd5052020-11-24T21:10:36ZengInstitute of Mathematics and Computer Science of the Academy of Sciences of MoldovaComputer Science Journal of Moldova1561-40422011-10-01192(56)130145Romanian Linguistic Resources On Very Large ScaleDan Cristea0Faculty of Computer Science, ``Alexandru Ioan Cuza'' University of Iasi; Institute of Computer Science, Romanian Academy, the Iasi branchThis paper suggests a methodology for building a technological environment for linguistic processing, intended to conserve, update and exploit, for research, for public and for commercial purposes, strategic linguistic resources of the Romanian language, rooted in textual data contributed daily and in the long run by important editorial houses and mass-media institutions. In essence, it describes a technology able to receive, store and continuously process large amounts of textual data, received from voluntary contributors, on a daily basis. Apart from storing linguistic data \textit{\`{a} la longue} for the benefit of preserving the language, the results of the processing will be returned to three categories of users: the researchers working on Romanian language and computational linguistics, the contributors of the resources, and the public at large. http://www.math.md/files/csjm/v19-n2/v19-n2-(pp130-145).pdf |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Dan Cristea |
spellingShingle |
Dan Cristea Romanian Linguistic Resources On Very Large Scale Computer Science Journal of Moldova |
author_facet |
Dan Cristea |
author_sort |
Dan Cristea |
title |
Romanian Linguistic Resources On Very Large Scale |
title_short |
Romanian Linguistic Resources On Very Large Scale |
title_full |
Romanian Linguistic Resources On Very Large Scale |
title_fullStr |
Romanian Linguistic Resources On Very Large Scale |
title_full_unstemmed |
Romanian Linguistic Resources On Very Large Scale |
title_sort |
romanian linguistic resources on very large scale |
publisher |
Institute of Mathematics and Computer Science of the Academy of Sciences of Moldova |
series |
Computer Science Journal of Moldova |
issn |
1561-4042 |
publishDate |
2011-10-01 |
description |
This paper suggests a methodology for building a technological environment for linguistic processing, intended to conserve, update and exploit, for research, for public and for commercial purposes, strategic linguistic resources of the Romanian language, rooted in textual data contributed daily and in the long run by important editorial houses and mass-media institutions. In essence, it describes a technology able to receive, store and continuously process large amounts of textual data, received from
voluntary contributors, on a daily basis. Apart from storing linguistic data \textit{\`{a} la longue} for the benefit of preserving the language, the results of the processing will be returned to three categories of users: the researchers working on
Romanian language and computational linguistics, the contributors of the resources, and the public at large. |
url |
http://www.math.md/files/csjm/v19-n2/v19-n2-(pp130-145).pdf |
work_keys_str_mv |
AT dancristea romanianlinguisticresourcesonverylargescale |
_version_ |
1716755916582289408 |