The comparisons of OCR tools: a conversion case in the Malaysian Hansard Corpus development / Anis Nadiah Che Abdul Rahman ...[et al.]
Optical Character Recognition (OCR) is a tool in computational technology that allows a recognition of printed characters by manipulating photoelectric devices and computer software. It runs by converting images or texts that are scanned beforehand into machine-readable and editable texts. There are...
Main Authors: | Che Abdul Rahman, Anis Nadiah (Author), Ho Abdullah, Imran (Author), Zainuddin, Intan Safinaz (Author), Jaludin, Azhar (Author) |
---|---|
Format: | Article |
Language: | English |
Published: |
Penerbit UiTM,
2019-12.
|
Subjects: | |
Online Access: | Get fulltext View Fulltext in UiTM IR |
Similar Items
-
"Is Selangor in deep water?": a corpus-driven account of air/water in the Malaysian Hansard Corpus (MHC)
by: Norsimah Mat Awal, et al.
Published: (2019) -
A corpus driven analysis of representations around the word 'ekonomi' in Malaysian Hansard Corpus
by: Nor Fariza Mohd Nor, et al.
Published: (2019) -
Alkohol (arak dan etanol) dalam makanan halal / A Anis Najiha and W.A. Wan Nadiah
by: A, Anis Najiha, et al.
Published: (2015) -
Domain-specific stop words in Malaysian Parliamentary Debates 1959 - 2018
by: Anis Nadiah Che Abdul Rahman, et al.
Published: (2021) -
COHESION IN HANSARD
by: Šimoliūnaitė, Justina
Published: (2010)