Exchanging image processing and OCR components in a Setswana digitisation pipeline

As more natural language processing (NLP) applications benefit from neural network based approaches, it makes sense to re-evaluate existing work in NLP. A complete pipeline for digitisation includes several components hand- ling the material in sequence. Image processing after scanning the document...

Full description

Bibliographic Details
Main Authors: Gideon Jozua Kotzé, Friedel Wolff
Format: Article
Language:English
Published: South African Institute of Computer Scientists and Information Technologists 2020-12-01
Series:South African Computer Journal
Subjects:
Online Access:https://sacj.cs.uct.ac.za/index.php/sacj/article/view/707