SCAP-TT: Tagging and lemmatising Spanish tourism discourse, and beyond

In this research note we report on the first results of SCAP, the Spanish Corpus Annotation Project, applied to tourism discourse (SCAP_tur). In particular, we present and assess a new TreeTagger parameter set for Spanish (SCAP-TT), which has been trained for the Part-of-Speech tagging (POS-tagging)...

Full description

Bibliographic Details
Main Authors: Patrick Goethals, Els Lefever, Lieve Macken
Format: Article
Language:deu
Published: Asociación Europea de Lenguas para Fines Específicos 2017-04-01
Series:Ibérica
Subjects:
Online Access:http://www.aelfe.org/documents/33_RN_IBERICA_01.pdf
Description
Summary:In this research note we report on the first results of SCAP, the Spanish Corpus Annotation Project, applied to tourism discourse (SCAP_tur). In particular, we present and assess a new TreeTagger parameter set for Spanish (SCAP-TT), which has been trained for the Part-of-Speech tagging (POS-tagging) and lemmatisation of Spanish promotional tourism texts. Although SCAP-TT has been trained for specialized tourism discourse, we also show promising results for the annotation of other text genres such as essays and literary texts.
ISSN:1139-7241
2340-2784