Decoding Strategies for Improving Low-Resource Machine Translation

Pre-processing and post-processing are significant aspects of natural language processing (NLP) application software. Pre-processing in neural machine translation (NMT) includes subword tokenization to alleviate the problem of unknown words, parallel corpus filtering that only filters data suitable...

Full description

Bibliographic Details
Main Authors: Chanjun Park, Yeongwook Yang, Kinam Park, Heuiseok Lim
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/9/10/1562