SCAP-TT: Tagging and lemmatising Spanish tourism discourse, and beyond
In this research note we report on the first results of SCAP, the Spanish Corpus Annotation Project, applied to tourism discourse (SCAP_tur). In particular, we present and assess a new TreeTagger parameter set for Spanish (SCAP-TT), which has been trained for the Part-of-Speech tagging (POS-tagging)...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | deu |
Published: |
Asociación Europea de Lenguas para Fines Específicos
2017-04-01
|
Series: | Ibérica |
Subjects: | |
Online Access: | http://www.aelfe.org/documents/33_RN_IBERICA_01.pdf |
Summary: | In this research note we report on the first results of SCAP, the Spanish Corpus Annotation Project, applied to tourism discourse (SCAP_tur). In particular, we present and assess a new TreeTagger parameter set for Spanish (SCAP-TT), which has been trained for the Part-of-Speech tagging (POS-tagging) and lemmatisation of Spanish promotional tourism texts. Although SCAP-TT has been trained for specialized tourism discourse, we also show promising results for the annotation of other text genres such as essays and literary texts. |
---|---|
ISSN: | 1139-7241 2340-2784 |