Otedama: Fast Rule-Based Pre-Ordering for Machine Translation

We present Otedama, a fast, open-source tool for rule-based syntactic pre-ordering, a well established technique in statistical machine translation. Otedama implements both a learner for pre-ordering rules, as well as a component for applying these rules to parsed sentences. Our system is compatible...

Full description

Bibliographic Details
Main Authors: Hitschler Julian, Jehl Laura, Karimova Sariya, Ohta Mayumi, Körner Benjamin, Riezler Stefan
Format: Article
Language:English
Published: Sciendo 2016-10-01
Series:Prague Bulletin of Mathematical Linguistics
Online Access:https://doi.org/10.1515/pralin-2016-0015
id doaj-88f43c0f623c4593bf7c5dcaed838c5d
record_format Article
spelling doaj-88f43c0f623c4593bf7c5dcaed838c5d2021-09-05T13:59:53ZengSciendoPrague Bulletin of Mathematical Linguistics 1804-04622016-10-01106115916810.1515/pralin-2016-0015pralin-2016-0015Otedama: Fast Rule-Based Pre-Ordering for Machine TranslationHitschler Julian0Jehl Laura1Karimova Sariya2Ohta Mayumi3Körner Benjamin4Riezler Stefan5Computational Linguistics, Heidelberg University, GermanyComputational Linguistics, Heidelberg University, GermanyComputational Linguistics, Heidelberg University, Russian FederationComputational Linguistics, Heidelberg University, GermanyComputational Linguistics, Heidelberg University, GermanyComputational Linguistics, Heidelberg University, GermanyWe present Otedama, a fast, open-source tool for rule-based syntactic pre-ordering, a well established technique in statistical machine translation. Otedama implements both a learner for pre-ordering rules, as well as a component for applying these rules to parsed sentences. Our system is compatible with several external parsers and capable of accommodating many source and all target languages in any machine translation paradigm which uses parallel training data. We demonstrate improvements on a patent translation task over a state-of-the-art English-Japanese hierarchical phrase-based machine translation system. We compare Otedama with an existing syntax-based pre-ordering system, showing comparable translation performance at a runtime speedup of a factor of 4.5-10.https://doi.org/10.1515/pralin-2016-0015
collection DOAJ
language English
format Article
sources DOAJ
author Hitschler Julian
Jehl Laura
Karimova Sariya
Ohta Mayumi
Körner Benjamin
Riezler Stefan
spellingShingle Hitschler Julian
Jehl Laura
Karimova Sariya
Ohta Mayumi
Körner Benjamin
Riezler Stefan
Otedama: Fast Rule-Based Pre-Ordering for Machine Translation
Prague Bulletin of Mathematical Linguistics
author_facet Hitschler Julian
Jehl Laura
Karimova Sariya
Ohta Mayumi
Körner Benjamin
Riezler Stefan
author_sort Hitschler Julian
title Otedama: Fast Rule-Based Pre-Ordering for Machine Translation
title_short Otedama: Fast Rule-Based Pre-Ordering for Machine Translation
title_full Otedama: Fast Rule-Based Pre-Ordering for Machine Translation
title_fullStr Otedama: Fast Rule-Based Pre-Ordering for Machine Translation
title_full_unstemmed Otedama: Fast Rule-Based Pre-Ordering for Machine Translation
title_sort otedama: fast rule-based pre-ordering for machine translation
publisher Sciendo
series Prague Bulletin of Mathematical Linguistics
issn 1804-0462
publishDate 2016-10-01
description We present Otedama, a fast, open-source tool for rule-based syntactic pre-ordering, a well established technique in statistical machine translation. Otedama implements both a learner for pre-ordering rules, as well as a component for applying these rules to parsed sentences. Our system is compatible with several external parsers and capable of accommodating many source and all target languages in any machine translation paradigm which uses parallel training data. We demonstrate improvements on a patent translation task over a state-of-the-art English-Japanese hierarchical phrase-based machine translation system. We compare Otedama with an existing syntax-based pre-ordering system, showing comparable translation performance at a runtime speedup of a factor of 4.5-10.
url https://doi.org/10.1515/pralin-2016-0015
work_keys_str_mv AT hitschlerjulian otedamafastrulebasedpreorderingformachinetranslation
AT jehllaura otedamafastrulebasedpreorderingformachinetranslation
AT karimovasariya otedamafastrulebasedpreorderingformachinetranslation
AT ohtamayumi otedamafastrulebasedpreorderingformachinetranslation
AT kornerbenjamin otedamafastrulebasedpreorderingformachinetranslation
AT riezlerstefan otedamafastrulebasedpreorderingformachinetranslation
_version_ 1717812832409485312