Crowdsourcing the Paldaruo Speech Corpus of Welsh for Speech Technology

Collecting speech data for a low-resource language is challenging when funding and resources are limited. This paper describes the process of designing, creating and using the Paldaruo Speech Corpus for developing speech technology for Welsh. Specifically, this paper focuses on the crowdsourcing of...

Full description

Bibliographic Details
Main Authors: Sarah Cooper, Dewi Bryn Jones, Delyth Prys
Format: Article
Language:English
Published: MDPI AG 2019-07-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/10/8/247