A corpus of spoken Faroese

The paper describes the new Corpus of Spoken Faroese. While the corpus is still under development with respect to content (number of informants, dialects and words), it is included in the larger Nordic Dialect Corpus, which means that all technical solutions are already in place. As a result, the Fa...

Full description

Bibliographic Details
Main Author: Janne Bondi Johannessen
Format: Article
Language:English
Published: Septentrio Academic Publishing 2009-01-01
Series:Nordlyd: Tromsø University Working Papers on Language & Linguistics
Subjects:
Online Access:https://septentrio.uit.no/index.php/nordlyd/article/view/224
Description
Summary:The paper describes the new Corpus of Spoken Faroese. While the corpus is still under development with respect to content (number of informants, dialects and words), it is included in the larger Nordic Dialect Corpus, which means that all technical solutions are already in place. As a result, the Faroese corpus is fully operable, albeit with a rather limited number of words at present. The recordings have all been made, but transcription and tagging remain undone for most of them, however these are expected to be finished by the end of 2009. At the moment, there are nine conversations in the corpus. In the paper I describe some of the search and result-handling options the corpus offers, exemplifying with Faroese, and I also try to shed light on some linguistic questions using the corpus.
ISSN:1503-8599