Neural Networks for Part-of-Speech Tagging

The aim of this thesis is to explore the viability of artificial neural networks using a purely contextual word representation as a solution for part-of-speech tagging. Furthermore, the effects of deep learning and increased contextual information of the network are explored. This was achieved by cr...

Full description

Bibliographic Details
Main Author: Strandqvist, Wiktor
Format: Others
Language:English
Published: Linköpings universitet, Institutionen för datavetenskap 2016
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-129296
id ndltd-UPSALLA1-oai-DiVA.org-liu-129296
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-liu-1292962018-01-11T05:11:42ZNeural Networks for Part-of-Speech TaggingengStrandqvist, WiktorLinköpings universitet, Institutionen för datavetenskap2016artificial neural networkpart-of-speech tagginglanguage technologyLanguage Technology (Computational Linguistics)Språkteknologi (språkvetenskaplig databehandling)The aim of this thesis is to explore the viability of artificial neural networks using a purely contextual word representation as a solution for part-of-speech tagging. Furthermore, the effects of deep learning and increased contextual information of the network are explored. This was achieved by creating an artificial neural network written in Python. The input vectors employed were created by Word2Vec. This system was compared to a baseline using a tagger with handcrafted features in respect to accuracy and precision. The results show that the use of artificial neural networks using a purely contextual word representation shows promise, but ultimately falls roughly two percent short of the baseline. The suspected reason for this is the suboptimal representation for rare words. The use of deeper network architectures shows an insignificant improvement, indicating that the data sets used might be too small. The use of additional context information provided a higher accuracy, but started to decline after a context size of one. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-129296application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic artificial neural network
part-of-speech tagging
language technology
Language Technology (Computational Linguistics)
Språkteknologi (språkvetenskaplig databehandling)
spellingShingle artificial neural network
part-of-speech tagging
language technology
Language Technology (Computational Linguistics)
Språkteknologi (språkvetenskaplig databehandling)
Strandqvist, Wiktor
Neural Networks for Part-of-Speech Tagging
description The aim of this thesis is to explore the viability of artificial neural networks using a purely contextual word representation as a solution for part-of-speech tagging. Furthermore, the effects of deep learning and increased contextual information of the network are explored. This was achieved by creating an artificial neural network written in Python. The input vectors employed were created by Word2Vec. This system was compared to a baseline using a tagger with handcrafted features in respect to accuracy and precision. The results show that the use of artificial neural networks using a purely contextual word representation shows promise, but ultimately falls roughly two percent short of the baseline. The suspected reason for this is the suboptimal representation for rare words. The use of deeper network architectures shows an insignificant improvement, indicating that the data sets used might be too small. The use of additional context information provided a higher accuracy, but started to decline after a context size of one.
author Strandqvist, Wiktor
author_facet Strandqvist, Wiktor
author_sort Strandqvist, Wiktor
title Neural Networks for Part-of-Speech Tagging
title_short Neural Networks for Part-of-Speech Tagging
title_full Neural Networks for Part-of-Speech Tagging
title_fullStr Neural Networks for Part-of-Speech Tagging
title_full_unstemmed Neural Networks for Part-of-Speech Tagging
title_sort neural networks for part-of-speech tagging
publisher Linköpings universitet, Institutionen för datavetenskap
publishDate 2016
url http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-129296
work_keys_str_mv AT strandqvistwiktor neuralnetworksforpartofspeechtagging
_version_ 1718604409738362880