Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans

Text in Afrikaans === Summaries in Afrikaans and English === In Afrikaans, soos in NederJands en Duits, word saamgestelde woorde aanmekaar geskryf. Nuwe woorde word dus voortdurend geskep deur woorde aanmekaar te haak Dit bemoeilik die proses van woordafkapping tydens teksprosessering, wat deesdae...

Full description

Bibliographic Details
Main Author: Fick, Machteld
Other Authors: Swanepoel, Carel Johannes
Format: Others
Language:af
Published: 2009
Subjects:
Online Access:Fick, Machteld (2002) Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans, University of South Africa, Pretoria, <http://hdl.handle.net/10500/584>
http://hdl.handle.net/10500/584
id ndltd-netd.ac.za-oai-union.ndltd.org-unisa-oai-uir.unisa.ac.za-10500-584
record_format oai_dc
spelling ndltd-netd.ac.za-oai-union.ndltd.org-unisa-oai-uir.unisa.ac.za-10500-5842018-11-19T17:13:57Z Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans Fick, Machteld Swanepoel, Carel Johannes Neural networks Backpropagation Feed-forward Training algorithm Transfer function Resilient backpropagation Early termination Encoding Hyphenation Syllabification 410.285 Afrikaans language -- Syllabication Afrikaans language -- Data processing Syllabication -- Data processing Neural networks (Computer science) Back propagation (Artificial intelligence) Text in Afrikaans Summaries in Afrikaans and English In Afrikaans, soos in NederJands en Duits, word saamgestelde woorde aanmekaar geskryf. Nuwe woorde word dus voortdurend geskep deur woorde aanmekaar te haak Dit bemoeilik die proses van woordafkapping tydens teksprosessering, wat deesdae deur rekenaars gedoen word, aangesien die verwysingsbron gedurig verander. Daar bestaan verskeie afkappingsalgoritmes en tegnieke, maar die resultate is onbevredigend. Afrikaanse woorde met korrekte lettergreepverdeling is net die elektroniese weergawe van die handwoordeboek van die Afrikaanse Taal (HAT) onttrek. 'n Neutrale netwerk ( vorentoevoer-terugpropagering) is met sowat. 5 000 van hierdie woorde afgerig. Die neurale netwerk is verfyn deur 'n gcskikte afrigtingsalgoritme en oorfragfunksie vir die probleem asook die optimale aantal verborge lae en aantal neurone in elke laag te bepaal. Die neurale netwerk is met 5 000 nuwe woorde getoets en dit het 97,56% van moontlike posisies korrek as of geldige of ongeldige afkappingsposisies geklassifiseer. Verder is 510 woorde uit tydskrifartikels met die neurale netwerk getoets en 98,75% van moontlike posisies is korrek geklassifiseer. In Afrikaans, like in Dutch and German, compound words are written as one word. New words are therefore created by simply joining words. Word hyphenation during typesetting by computer is a problem, because the source of reference changes all the time. Several algorithms and techniques for hyphenation exist, but results are not satisfactory. Afrikaans words with correct syllabification were extracted from the electronic version of the Handwoordeboek van die Afrikaans Taal (HAT). A neural network (feedforward backpropagation) was trained with about 5 000 of these words. The neural network was refined by heuristically finding a suitable training algorithm and transfer function for the problem as well as determining the optimal number of layers and number of neurons in each layer. The neural network was tested with 5 000 words not the training data. It classified 97,56% of possible points in these words correctly as either valid or invalid hyphenation points. Furthermore, 510 words from articles in a magazine were tested with the neural network and 98,75% of possible positions were classified correctly. Computing M.Sc. (Operasionele Navorsing) 2009-08-25T10:44:56Z 2009-08-25T10:44:56Z 2002-09 Dissertation Fick, Machteld (2002) Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans, University of South Africa, Pretoria, <http://hdl.handle.net/10500/584> http://hdl.handle.net/10500/584 af 1 online resource (ii, 102 pages)
collection NDLTD
language af
format Others
sources NDLTD
topic Neural networks
Backpropagation
Feed-forward
Training algorithm
Transfer function
Resilient backpropagation
Early termination
Encoding
Hyphenation
Syllabification
410.285
Afrikaans language -- Syllabication
Afrikaans language -- Data processing
Syllabication -- Data processing
Neural networks (Computer science)
Back propagation (Artificial intelligence)
spellingShingle Neural networks
Backpropagation
Feed-forward
Training algorithm
Transfer function
Resilient backpropagation
Early termination
Encoding
Hyphenation
Syllabification
410.285
Afrikaans language -- Syllabication
Afrikaans language -- Data processing
Syllabication -- Data processing
Neural networks (Computer science)
Back propagation (Artificial intelligence)
Fick, Machteld
Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans
description Text in Afrikaans === Summaries in Afrikaans and English === In Afrikaans, soos in NederJands en Duits, word saamgestelde woorde aanmekaar geskryf. Nuwe woorde word dus voortdurend geskep deur woorde aanmekaar te haak Dit bemoeilik die proses van woordafkapping tydens teksprosessering, wat deesdae deur rekenaars gedoen word, aangesien die verwysingsbron gedurig verander. Daar bestaan verskeie afkappingsalgoritmes en tegnieke, maar die resultate is onbevredigend. Afrikaanse woorde met korrekte lettergreepverdeling is net die elektroniese weergawe van die handwoordeboek van die Afrikaanse Taal (HAT) onttrek. 'n Neutrale netwerk ( vorentoevoer-terugpropagering) is met sowat. 5 000 van hierdie woorde afgerig. Die neurale netwerk is verfyn deur 'n gcskikte afrigtingsalgoritme en oorfragfunksie vir die probleem asook die optimale aantal verborge lae en aantal neurone in elke laag te bepaal. Die neurale netwerk is met 5 000 nuwe woorde getoets en dit het 97,56% van moontlike posisies korrek as of geldige of ongeldige afkappingsposisies geklassifiseer. Verder is 510 woorde uit tydskrifartikels met die neurale netwerk getoets en 98,75% van moontlike posisies is korrek geklassifiseer. === In Afrikaans, like in Dutch and German, compound words are written as one word. New words are therefore created by simply joining words. Word hyphenation during typesetting by computer is a problem, because the source of reference changes all the time. Several algorithms and techniques for hyphenation exist, but results are not satisfactory. Afrikaans words with correct syllabification were extracted from the electronic version of the Handwoordeboek van die Afrikaans Taal (HAT). A neural network (feedforward backpropagation) was trained with about 5 000 of these words. The neural network was refined by heuristically finding a suitable training algorithm and transfer function for the problem as well as determining the optimal number of layers and number of neurons in each layer. The neural network was tested with 5 000 words not the training data. It classified 97,56% of possible points in these words correctly as either valid or invalid hyphenation points. Furthermore, 510 words from articles in a magazine were tested with the neural network and 98,75% of possible positions were classified correctly. === Computing === M.Sc. (Operasionele Navorsing)
author2 Swanepoel, Carel Johannes
author_facet Swanepoel, Carel Johannes
Fick, Machteld
author Fick, Machteld
author_sort Fick, Machteld
title Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans
title_short Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans
title_full Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans
title_fullStr Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans
title_full_unstemmed Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans
title_sort neurale netwerke as moontlike woordafkappingstegniek vir afrikaans
publishDate 2009
url Fick, Machteld (2002) Neurale netwerke as moontlike woordafkappingstegniek vir Afrikaans, University of South Africa, Pretoria, <http://hdl.handle.net/10500/584>
http://hdl.handle.net/10500/584
work_keys_str_mv AT fickmachteld neuralenetwerkeasmoontlikewoordafkappingstegniekvirafrikaans
_version_ 1718792611235364864