Crowdsourcing a text corpus for a low resource language

Low resourced languages, such as South Africa's isiXhosa, have a limited number of digitised texts, making it challenging to build language corpora and the information retrieval services, such as search and translation that depend on them. Researchers have been unable to assemble isiXhosa corpo...

Full description

Bibliographic Details
Main Author: Packham, Sean
Other Authors: Suleman, Hussein
Format: Dissertation
Language:English
Published: University of Cape Town 2016
Subjects:
Online Access:http://hdl.handle.net/11427/20436