Crowdsourcing a text corpus for a low resource language
Low resourced languages, such as South Africa's isiXhosa, have a limited number of digitised texts, making it challenging to build language corpora and the information retrieval services, such as search and translation that depend on them. Researchers have been unable to assemble isiXhosa corpo...
Main Author: | Packham, Sean |
---|---|
Other Authors: | Suleman, Hussein |
Format: | Dissertation |
Language: | English |
Published: |
University of Cape Town
2016
|
Subjects: | |
Online Access: | http://hdl.handle.net/11427/20436 |
Similar Items
-
Text-to-Speech Synthesis Using Found Data for Low-Resource Languages
by: Cooper, Erica Lindsay
Published: (2019) -
Operations on text in a database programming language
by: Andreev, Maxim.
Published: (1999) -
Text operators in a relational programming language
by: Xie, Jiantao, 1979-
Published: (2005) -
A Manual for Web Corpus Crawling of Low Resource Languages
by: Armin Hoenen, et al.
Published: (2020-05-01) -
Transfer learning for low-resource natural language analysis
by: Zhang, Yuan, Ph. D. Massachusetts Institute of Technology
Published: (2017)