Converting raw transcripts into an annotated and turn-aligned TEI-XML corpus: the example of the Corpus of Serbian Forms of Address
This paper describes the procedure of building a TEI-XML corpus of spoken Serbian starting from raw transcripts. The corpus consists of semi–structured interviews, which were gathered with the aim of investigating forms of address in Serbian. The interviews were thoroughly transcribed according to G...
Main Author: | Dolores Lemmenmeier-Batinić |
---|---|
Format: | Article |
Language: | English |
Published: |
Znanstvena založba Filozofske fakultete Univerze v Ljubljani (Ljubljana University Press, Faculty of Arts)
2021-07-01
|
Series: | Slovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave |
Subjects: | |
Online Access: | https://revije.ff.uni-lj.si/slovenscina2/article/view/9869 |
Similar Items
-
On the neologisms in Serbian from the viewpoint of corpus preparation for the compilation of the Matica srpska multivolume dictionary of contemporary Serbian
by: Dragićević Rajna M.
Published: (2020-01-01) -
SPIRAL CONSTRUCTION OF SYNTACTICALLY ANNOTATED SPOKEN LANGUAGE CORPUS
by: Inagaki, Yasuyoshi, et al.
Published: (2003) -
Distribution of verbal overgeneralizations in the Serbian corpus of early child language
by: Anđelković Darinka, et al.
Published: (2017-01-01) -
Introduction : Grammaticalité et annotations de corpus d’anglais oral – perspectives et problèmes
by: Sylvie Hancil
Published: (2018-07-01) -
Methodology and Experience of Building the Retrospective Corpus of Lithuanian Broadcast Media
by: Laima Nevinskaitė
Published: (2013-10-01)