A Systematic Comparison of Data Selection Criteria for SMT Domain Adaptation

Data selection has shown significant improvements in effective use of training data by extracting sentences from large general-domain corpora to adapt statistical machine translation (SMT) systems to in-domain data. This paper performs an in-depth analysis of three different sentence selection techn...

Full description

Bibliographic Details
Main Authors: Longyue Wang, Derek F. Wong, Lidia S. Chao, Yi Lu, Junwen Xing
Format: Article
Language:English
Published: Hindawi Limited 2014-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1155/2014/745485