Otedama: Fast Rule-Based Pre-Ordering for Machine Translation
We present Otedama, a fast, open-source tool for rule-based syntactic pre-ordering, a well established technique in statistical machine translation. Otedama implements both a learner for pre-ordering rules, as well as a component for applying these rules to parsed sentences. Our system is compatible...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Sciendo
2016-10-01
|
Series: | Prague Bulletin of Mathematical Linguistics |
Online Access: | https://doi.org/10.1515/pralin-2016-0015 |
id |
doaj-88f43c0f623c4593bf7c5dcaed838c5d |
---|---|
record_format |
Article |
spelling |
doaj-88f43c0f623c4593bf7c5dcaed838c5d2021-09-05T13:59:53ZengSciendoPrague Bulletin of Mathematical Linguistics 1804-04622016-10-01106115916810.1515/pralin-2016-0015pralin-2016-0015Otedama: Fast Rule-Based Pre-Ordering for Machine TranslationHitschler Julian0Jehl Laura1Karimova Sariya2Ohta Mayumi3Körner Benjamin4Riezler Stefan5Computational Linguistics, Heidelberg University, GermanyComputational Linguistics, Heidelberg University, GermanyComputational Linguistics, Heidelberg University, Russian FederationComputational Linguistics, Heidelberg University, GermanyComputational Linguistics, Heidelberg University, GermanyComputational Linguistics, Heidelberg University, GermanyWe present Otedama, a fast, open-source tool for rule-based syntactic pre-ordering, a well established technique in statistical machine translation. Otedama implements both a learner for pre-ordering rules, as well as a component for applying these rules to parsed sentences. Our system is compatible with several external parsers and capable of accommodating many source and all target languages in any machine translation paradigm which uses parallel training data. We demonstrate improvements on a patent translation task over a state-of-the-art English-Japanese hierarchical phrase-based machine translation system. We compare Otedama with an existing syntax-based pre-ordering system, showing comparable translation performance at a runtime speedup of a factor of 4.5-10.https://doi.org/10.1515/pralin-2016-0015 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hitschler Julian Jehl Laura Karimova Sariya Ohta Mayumi Körner Benjamin Riezler Stefan |
spellingShingle |
Hitschler Julian Jehl Laura Karimova Sariya Ohta Mayumi Körner Benjamin Riezler Stefan Otedama: Fast Rule-Based Pre-Ordering for Machine Translation Prague Bulletin of Mathematical Linguistics |
author_facet |
Hitschler Julian Jehl Laura Karimova Sariya Ohta Mayumi Körner Benjamin Riezler Stefan |
author_sort |
Hitschler Julian |
title |
Otedama: Fast Rule-Based Pre-Ordering for Machine Translation |
title_short |
Otedama: Fast Rule-Based Pre-Ordering for Machine Translation |
title_full |
Otedama: Fast Rule-Based Pre-Ordering for Machine Translation |
title_fullStr |
Otedama: Fast Rule-Based Pre-Ordering for Machine Translation |
title_full_unstemmed |
Otedama: Fast Rule-Based Pre-Ordering for Machine Translation |
title_sort |
otedama: fast rule-based pre-ordering for machine translation |
publisher |
Sciendo |
series |
Prague Bulletin of Mathematical Linguistics |
issn |
1804-0462 |
publishDate |
2016-10-01 |
description |
We present Otedama, a fast, open-source tool for rule-based syntactic pre-ordering, a well established technique in statistical machine translation. Otedama implements both a learner for pre-ordering rules, as well as a component for applying these rules to parsed sentences. Our system is compatible with several external parsers and capable of accommodating many source and all target languages in any machine translation paradigm which uses parallel training data. We demonstrate improvements on a patent translation task over a state-of-the-art English-Japanese hierarchical phrase-based machine translation system. We compare Otedama with an existing syntax-based pre-ordering system, showing comparable translation performance at a runtime speedup of a factor of 4.5-10. |
url |
https://doi.org/10.1515/pralin-2016-0015 |
work_keys_str_mv |
AT hitschlerjulian otedamafastrulebasedpreorderingformachinetranslation AT jehllaura otedamafastrulebasedpreorderingformachinetranslation AT karimovasariya otedamafastrulebasedpreorderingformachinetranslation AT ohtamayumi otedamafastrulebasedpreorderingformachinetranslation AT kornerbenjamin otedamafastrulebasedpreorderingformachinetranslation AT riezlerstefan otedamafastrulebasedpreorderingformachinetranslation |
_version_ |
1717812832409485312 |