Character-based recurrent neural networks for morphological relational reasoning

We present a model for predicting inflected word forms based on morphological analogies. Previous work includes rule-based algorithms that determine and copy affixes from one word to another, with limited support for varying inflectional patterns. In related tasks such as morphological reinflection,...

Full description

Bibliographic Details
Main Authors: Olof Mogren, Richard Johansson
Format: Article
Language:English
Published: Polish Academy of Sciences 2019-11-01
Series:Journal of Language Modelling
Subjects:
Online Access:https://jlm.ipipan.waw.pl/index.php/JLM/article/view/218
id doaj-2c160c7007c14a54ab2dce0be52f42e1
record_format Article
spelling doaj-2c160c7007c14a54ab2dce0be52f42e12021-03-25T10:36:08ZengPolish Academy of SciencesJournal of Language Modelling2299-856X2299-84702019-11-0171139–170139–17010.15398/jlm.v7i1.218162Character-based recurrent neural networks for morphological relational reasoningOlof Mogren0https://orcid.org/0000-0002-9567-2218Richard Johansson1https://orcid.org/0000-0002-9429-4884RISE AIGothenburg universityWe present a model for predicting inflected word forms based on morphological analogies. Previous work includes rule-based algorithms that determine and copy affixes from one word to another, with limited support for varying inflectional patterns. In related tasks such as morphological reinflection, the algorithm is provided with an explicit enumeration of morphological features which may not be available in all cases. In contrast, our model is feature-free: instead of explicitly representing morphological features, the model is given a demo pair that implicitly specifies a morphological relation (such as write:writes specifying infinitive:present). Given this demo relation and a query word (e.g. watch), the model predicts the target word (e.g. watches). To address this task, we devise a character-based recurrent neural network architecture using three separate encoders and one decoder. Our experimental evaluation on five different languages shows tha the exact form can be predicted with high accuracy, consistently beating the baseline methods. Particularly, for English the prediction accuracy is 95.60%. The solution is not limited to copying affixes from the demo relation, but generalizes to words with varying inflectional patterns, and can abstract away from the orthographic level to the level of morphological forms.https://jlm.ipipan.waw.pl/index.php/JLM/article/view/218morphologysyntaxmachine learninganalogiesrecurrent neural networkscharacter-level modelling
collection DOAJ
language English
format Article
sources DOAJ
author Olof Mogren
Richard Johansson
spellingShingle Olof Mogren
Richard Johansson
Character-based recurrent neural networks for morphological relational reasoning
Journal of Language Modelling
morphology
syntax
machine learning
analogies
recurrent neural networks
character-level modelling
author_facet Olof Mogren
Richard Johansson
author_sort Olof Mogren
title Character-based recurrent neural networks for morphological relational reasoning
title_short Character-based recurrent neural networks for morphological relational reasoning
title_full Character-based recurrent neural networks for morphological relational reasoning
title_fullStr Character-based recurrent neural networks for morphological relational reasoning
title_full_unstemmed Character-based recurrent neural networks for morphological relational reasoning
title_sort character-based recurrent neural networks for morphological relational reasoning
publisher Polish Academy of Sciences
series Journal of Language Modelling
issn 2299-856X
2299-8470
publishDate 2019-11-01
description We present a model for predicting inflected word forms based on morphological analogies. Previous work includes rule-based algorithms that determine and copy affixes from one word to another, with limited support for varying inflectional patterns. In related tasks such as morphological reinflection, the algorithm is provided with an explicit enumeration of morphological features which may not be available in all cases. In contrast, our model is feature-free: instead of explicitly representing morphological features, the model is given a demo pair that implicitly specifies a morphological relation (such as write:writes specifying infinitive:present). Given this demo relation and a query word (e.g. watch), the model predicts the target word (e.g. watches). To address this task, we devise a character-based recurrent neural network architecture using three separate encoders and one decoder. Our experimental evaluation on five different languages shows tha the exact form can be predicted with high accuracy, consistently beating the baseline methods. Particularly, for English the prediction accuracy is 95.60%. The solution is not limited to copying affixes from the demo relation, but generalizes to words with varying inflectional patterns, and can abstract away from the orthographic level to the level of morphological forms.
topic morphology
syntax
machine learning
analogies
recurrent neural networks
character-level modelling
url https://jlm.ipipan.waw.pl/index.php/JLM/article/view/218
work_keys_str_mv AT olofmogren characterbasedrecurrentneuralnetworksformorphologicalrelationalreasoning
AT richardjohansson characterbasedrecurrentneuralnetworksformorphologicalrelationalreasoning
_version_ 1724203603547127808