Evaluation of English–Slovak Neural and Statistical Machine Translation

This study is focused on the comparison of phrase-based statistical machine translation (SMT) systems and neural machine translation (NMT) systems using automatic metrics for translation quality evaluation for the language pair of English and Slovak. As the statistical approach is the predecessor of...

Full description

Bibliographic Details
Main Authors: Lucia Benkova, Dasa Munkova, Ľubomír Benko, Michal Munk
Format: Article
Language:English
Published: MDPI AG 2021-03-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/7/2948
id doaj-07d7ce75d894484a8dd5a862678e598f
record_format Article
spelling doaj-07d7ce75d894484a8dd5a862678e598f2021-03-26T00:04:33ZengMDPI AGApplied Sciences2076-34172021-03-01112948294810.3390/app11072948Evaluation of English–Slovak Neural and Statistical Machine TranslationLucia Benkova0Dasa Munkova1Ľubomír Benko2Michal Munk3Department of Informatics, Constantine the Philosopher University in Nitra, Tr. A. Hlinku 1, SK-949 01 Nitra, SlovakiaDepartment of Translation Studies, Constantine the Philosopher University in Nitra, Štefánikova 67, SK-949 74 Nitra, SlovakiaDepartment of Informatics, Constantine the Philosopher University in Nitra, Tr. A. Hlinku 1, SK-949 01 Nitra, SlovakiaDepartment of Informatics, Constantine the Philosopher University in Nitra, Tr. A. Hlinku 1, SK-949 01 Nitra, SlovakiaThis study is focused on the comparison of phrase-based statistical machine translation (SMT) systems and neural machine translation (NMT) systems using automatic metrics for translation quality evaluation for the language pair of English and Slovak. As the statistical approach is the predecessor of neural machine translation, it was assumed that the neural network approach would generate results with a better quality. An experiment was performed using residuals to compare the scores of automatic metrics of the accuracy (BLEU_n) of the statistical machine translation with those of the neural machine translation. The results showed that the assumption of better neural machine translation quality regardless of the system used was confirmed. There were statistically significant differences between the SMT and NMT in favor of the NMT based on all BLEU_n scores. The neural machine translation achieved a better quality of translation of journalistic texts from English into Slovak, regardless of if it was a system trained on general texts, such as Google Translate, or specific ones, such as the European Commission’s (EC’s) tool, which was trained on a specific-domain.https://www.mdpi.com/2076-3417/11/7/2948neural machine translationstatistical machine translationtext analysisautomatic evaluationSlovak languageEnglish language
collection DOAJ
language English
format Article
sources DOAJ
author Lucia Benkova
Dasa Munkova
Ľubomír Benko
Michal Munk
spellingShingle Lucia Benkova
Dasa Munkova
Ľubomír Benko
Michal Munk
Evaluation of English–Slovak Neural and Statistical Machine Translation
Applied Sciences
neural machine translation
statistical machine translation
text analysis
automatic evaluation
Slovak language
English language
author_facet Lucia Benkova
Dasa Munkova
Ľubomír Benko
Michal Munk
author_sort Lucia Benkova
title Evaluation of English–Slovak Neural and Statistical Machine Translation
title_short Evaluation of English–Slovak Neural and Statistical Machine Translation
title_full Evaluation of English–Slovak Neural and Statistical Machine Translation
title_fullStr Evaluation of English–Slovak Neural and Statistical Machine Translation
title_full_unstemmed Evaluation of English–Slovak Neural and Statistical Machine Translation
title_sort evaluation of english–slovak neural and statistical machine translation
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2021-03-01
description This study is focused on the comparison of phrase-based statistical machine translation (SMT) systems and neural machine translation (NMT) systems using automatic metrics for translation quality evaluation for the language pair of English and Slovak. As the statistical approach is the predecessor of neural machine translation, it was assumed that the neural network approach would generate results with a better quality. An experiment was performed using residuals to compare the scores of automatic metrics of the accuracy (BLEU_n) of the statistical machine translation with those of the neural machine translation. The results showed that the assumption of better neural machine translation quality regardless of the system used was confirmed. There were statistically significant differences between the SMT and NMT in favor of the NMT based on all BLEU_n scores. The neural machine translation achieved a better quality of translation of journalistic texts from English into Slovak, regardless of if it was a system trained on general texts, such as Google Translate, or specific ones, such as the European Commission’s (EC’s) tool, which was trained on a specific-domain.
topic neural machine translation
statistical machine translation
text analysis
automatic evaluation
Slovak language
English language
url https://www.mdpi.com/2076-3417/11/7/2948
work_keys_str_mv AT luciabenkova evaluationofenglishslovakneuralandstatisticalmachinetranslation
AT dasamunkova evaluationofenglishslovakneuralandstatisticalmachinetranslation
AT lubomirbenko evaluationofenglishslovakneuralandstatisticalmachinetranslation
AT michalmunk evaluationofenglishslovakneuralandstatisticalmachinetranslation
_version_ 1724203057408901120