Lietuvių mokslo kalbos tekstynas

<p><strong>BUILDING </strong><strong>OF THE CORPUS OF ACADEMIC LITHUANIAN</strong></p><p><em>Summary</em></p><p>The paper sets out to describe the initial stages of the design of the corpus of academic Lithuanian. Due to the increasin...

Full description

Bibliographic Details
Main Authors: Aurelija Usonienė, Jonė Grigaliūnienė, Birutė Ryvitytė, Linas Būtėnas, Erika Jasionytė
Format: Article
Language:deu
Published: Vilnius University 2011-12-01
Series:Baltistica
Subjects:
Online Access:http://www.baltistica.lt/index.php/baltistica/article/view/1212
id doaj-8fce32ecbbd844798105131223eb849a
record_format Article
spelling doaj-8fce32ecbbd844798105131223eb849a2020-11-24T22:43:54ZdeuVilnius UniversityBaltistica0132-65032345-00452011-12-0143110111410.15388/baltistica.43.1.12121134Lietuvių mokslo kalbos tekstynasAurelija UsonienėJonė GrigaliūnienėBirutė RyvitytėLinas BūtėnasErika Jasionytė<p><strong>BUILDING </strong><strong>OF THE CORPUS OF ACADEMIC LITHUANIAN</strong></p><p><em>Summary</em></p><p>The paper sets out to describe the initial stages of the design of the corpus of academic Lithuanian. Due to the increasing interest and numerous corpora-based studies in academic discourse (especially of academic English) all over the world, there is an obvious need to provide easily accessible electronic resources of academic Lithuanian to facilitate modern linguistic research, interdisciplinary studies, lexicographical practice, and terminology studies in theory and practice. The Corpus of Academic Lithuanian (CorALit) is being compiled at the University of Vilnius (Faculty of Philology and Faculty of Mathematics and Informatics). The building of the corpus is being carried out within the framework of the 2007—2013 national high-tech development programme launched by the Government of Lithuania and supervised by the Lithuanian State Science and Study Foundation (<a href="http://www.vmsfondas.lt/index.php?lang=en" target="_blank">http://www.vmsfondas.lt/index.php?lang=en</a>). The main issue in the process of corpus design is representativeness which is determined by the following factors: the number of research and study fields represented, the range of genres included (i.e. balance) and the way text chunks for each genre are selected (i.e. sampling).</p><p>The Corpus of Academic Lithuanian aims at representing the main fields of study and research developed in Lithuania and listed in Order No.30 of the Minister of Education and Science of 9 January 1998 “Concerning the Classification of Study and Research Areas, Fields and Branches” as well as the most typical genres that academic community uses for the creation, dissemination and evaluation of new knowledge and internal communication. Since at present there is no reliable scientific measure for corpus balance, the project team will have to rely on intuition and best estimates based on the experience of academic language corpora already compiled in other countries (the UK, USA, etc.). The compilation of the corpus also involves negotiations, sometimes rather time-consuming, with publishers and authors for copyright. Last but not least, technical aspects of corpus design are touched upon. The main purpose of corpus compilation is to make it easily accessible for large numbers of users, and this means changing the format of computer files and text coding in accordance with TEI P5 Guidelines. TEI P5 format will allow users to access the first synchronic corpus of written academic Lithuanian as a major resource of authentic language data via a simple internet search.</p>http://www.baltistica.lt/index.php/baltistica/article/view/1212tekstynas
collection DOAJ
language deu
format Article
sources DOAJ
author Aurelija Usonienė
Jonė Grigaliūnienė
Birutė Ryvitytė
Linas Būtėnas
Erika Jasionytė
spellingShingle Aurelija Usonienė
Jonė Grigaliūnienė
Birutė Ryvitytė
Linas Būtėnas
Erika Jasionytė
Lietuvių mokslo kalbos tekstynas
Baltistica
tekstynas
author_facet Aurelija Usonienė
Jonė Grigaliūnienė
Birutė Ryvitytė
Linas Būtėnas
Erika Jasionytė
author_sort Aurelija Usonienė
title Lietuvių mokslo kalbos tekstynas
title_short Lietuvių mokslo kalbos tekstynas
title_full Lietuvių mokslo kalbos tekstynas
title_fullStr Lietuvių mokslo kalbos tekstynas
title_full_unstemmed Lietuvių mokslo kalbos tekstynas
title_sort lietuvių mokslo kalbos tekstynas
publisher Vilnius University
series Baltistica
issn 0132-6503
2345-0045
publishDate 2011-12-01
description <p><strong>BUILDING </strong><strong>OF THE CORPUS OF ACADEMIC LITHUANIAN</strong></p><p><em>Summary</em></p><p>The paper sets out to describe the initial stages of the design of the corpus of academic Lithuanian. Due to the increasing interest and numerous corpora-based studies in academic discourse (especially of academic English) all over the world, there is an obvious need to provide easily accessible electronic resources of academic Lithuanian to facilitate modern linguistic research, interdisciplinary studies, lexicographical practice, and terminology studies in theory and practice. The Corpus of Academic Lithuanian (CorALit) is being compiled at the University of Vilnius (Faculty of Philology and Faculty of Mathematics and Informatics). The building of the corpus is being carried out within the framework of the 2007—2013 national high-tech development programme launched by the Government of Lithuania and supervised by the Lithuanian State Science and Study Foundation (<a href="http://www.vmsfondas.lt/index.php?lang=en" target="_blank">http://www.vmsfondas.lt/index.php?lang=en</a>). The main issue in the process of corpus design is representativeness which is determined by the following factors: the number of research and study fields represented, the range of genres included (i.e. balance) and the way text chunks for each genre are selected (i.e. sampling).</p><p>The Corpus of Academic Lithuanian aims at representing the main fields of study and research developed in Lithuania and listed in Order No.30 of the Minister of Education and Science of 9 January 1998 “Concerning the Classification of Study and Research Areas, Fields and Branches” as well as the most typical genres that academic community uses for the creation, dissemination and evaluation of new knowledge and internal communication. Since at present there is no reliable scientific measure for corpus balance, the project team will have to rely on intuition and best estimates based on the experience of academic language corpora already compiled in other countries (the UK, USA, etc.). The compilation of the corpus also involves negotiations, sometimes rather time-consuming, with publishers and authors for copyright. Last but not least, technical aspects of corpus design are touched upon. The main purpose of corpus compilation is to make it easily accessible for large numbers of users, and this means changing the format of computer files and text coding in accordance with TEI P5 Guidelines. TEI P5 format will allow users to access the first synchronic corpus of written academic Lithuanian as a major resource of authentic language data via a simple internet search.</p>
topic tekstynas
url http://www.baltistica.lt/index.php/baltistica/article/view/1212
work_keys_str_mv AT aurelijausoniene lietuviumokslokalbostekstynas
AT jonegrigaliuniene lietuviumokslokalbostekstynas
AT biruteryvityte lietuviumokslokalbostekstynas
AT linasbutenas lietuviumokslokalbostekstynas
AT erikajasionyte lietuviumokslokalbostekstynas
_version_ 1725693898346463232