Character-based recurrent neural networks for morphological relational reasoning
We present a model for predicting inflected word forms based on morphological analogies. Previous work includes rule-based algorithms that determine and copy affixes from one word to another, with limited support for varying inflectional patterns. In related tasks such as morphological reinflection,...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Polish Academy of Sciences
2019-11-01
|
Series: | Journal of Language Modelling |
Subjects: | |
Online Access: | https://jlm.ipipan.waw.pl/index.php/JLM/article/view/218 |
id |
doaj-2c160c7007c14a54ab2dce0be52f42e1 |
---|---|
record_format |
Article |
spelling |
doaj-2c160c7007c14a54ab2dce0be52f42e12021-03-25T10:36:08ZengPolish Academy of SciencesJournal of Language Modelling2299-856X2299-84702019-11-0171139–170139–17010.15398/jlm.v7i1.218162Character-based recurrent neural networks for morphological relational reasoningOlof Mogren0https://orcid.org/0000-0002-9567-2218Richard Johansson1https://orcid.org/0000-0002-9429-4884RISE AIGothenburg universityWe present a model for predicting inflected word forms based on morphological analogies. Previous work includes rule-based algorithms that determine and copy affixes from one word to another, with limited support for varying inflectional patterns. In related tasks such as morphological reinflection, the algorithm is provided with an explicit enumeration of morphological features which may not be available in all cases. In contrast, our model is feature-free: instead of explicitly representing morphological features, the model is given a demo pair that implicitly specifies a morphological relation (such as write:writes specifying infinitive:present). Given this demo relation and a query word (e.g. watch), the model predicts the target word (e.g. watches). To address this task, we devise a character-based recurrent neural network architecture using three separate encoders and one decoder. Our experimental evaluation on five different languages shows tha the exact form can be predicted with high accuracy, consistently beating the baseline methods. Particularly, for English the prediction accuracy is 95.60%. The solution is not limited to copying affixes from the demo relation, but generalizes to words with varying inflectional patterns, and can abstract away from the orthographic level to the level of morphological forms.https://jlm.ipipan.waw.pl/index.php/JLM/article/view/218morphologysyntaxmachine learninganalogiesrecurrent neural networkscharacter-level modelling |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Olof Mogren Richard Johansson |
spellingShingle |
Olof Mogren Richard Johansson Character-based recurrent neural networks for morphological relational reasoning Journal of Language Modelling morphology syntax machine learning analogies recurrent neural networks character-level modelling |
author_facet |
Olof Mogren Richard Johansson |
author_sort |
Olof Mogren |
title |
Character-based recurrent neural networks for morphological relational reasoning |
title_short |
Character-based recurrent neural networks for morphological relational reasoning |
title_full |
Character-based recurrent neural networks for morphological relational reasoning |
title_fullStr |
Character-based recurrent neural networks for morphological relational reasoning |
title_full_unstemmed |
Character-based recurrent neural networks for morphological relational reasoning |
title_sort |
character-based recurrent neural networks for morphological relational reasoning |
publisher |
Polish Academy of Sciences |
series |
Journal of Language Modelling |
issn |
2299-856X 2299-8470 |
publishDate |
2019-11-01 |
description |
We present a model for predicting inflected word forms based on morphological analogies. Previous work includes rule-based algorithms that determine and copy affixes from one word to another, with limited support for varying inflectional patterns. In related tasks such as morphological reinflection, the algorithm is provided with an explicit enumeration of morphological features which may not be available in all cases. In contrast, our model is feature-free: instead of explicitly representing morphological features, the model is given a demo pair that implicitly specifies a morphological relation (such as write:writes specifying infinitive:present). Given this demo relation and a query word (e.g. watch), the model predicts the target word (e.g. watches). To address this task, we devise a character-based recurrent neural network architecture using three separate encoders and one decoder.
Our experimental evaluation on five different languages shows tha the exact form can be predicted with high accuracy, consistently beating the baseline methods. Particularly, for English the prediction accuracy is 95.60%. The solution is not limited to copying affixes from the demo relation, but generalizes to words with varying inflectional patterns, and can abstract away from the orthographic level to the level of morphological forms. |
topic |
morphology syntax machine learning analogies recurrent neural networks character-level modelling |
url |
https://jlm.ipipan.waw.pl/index.php/JLM/article/view/218 |
work_keys_str_mv |
AT olofmogren characterbasedrecurrentneuralnetworksformorphologicalrelationalreasoning AT richardjohansson characterbasedrecurrentneuralnetworksformorphologicalrelationalreasoning |
_version_ |
1724203603547127808 |