Insertion-based Decoding with Automatically Inferred Generation Order

Conventional neural autoregressive decoding commonly assumes a fixed left-to-right generation order, which may be sub-optimal. In this work, we propose a novel decoding algorithm— InDIGO—which supports flexible sequence generation in arbitrary orders through insertion operation...

Full description

Bibliographic Details
Main Authors: Gu, Jiatao, Liu, Qi, Cho, Kyunghyun
Format: Article
Language:English
Published: The MIT Press 2019-11-01
Series:Transactions of the Association for Computational Linguistics
Online Access:https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00292
id doaj-6e1c19d77d424904a1de9dd28f606c5f
record_format Article
spelling doaj-6e1c19d77d424904a1de9dd28f606c5f2020-11-25T03:17:44ZengThe MIT PressTransactions of the Association for Computational Linguistics2307-387X2019-11-01766167610.1162/tacl_a_00292Insertion-based Decoding with Automatically Inferred Generation OrderGu, JiataoLiu, QiCho, Kyunghyun Conventional neural autoregressive decoding commonly assumes a fixed left-to-right generation order, which may be sub-optimal. In this work, we propose a novel decoding algorithm— InDIGO—which supports flexible sequence generation in arbitrary orders through insertion operations. We extend Transformer, a state-of-the-art sequence generation model, to efficiently implement the proposed approach, enabling it to be trained with either a pre-defined generation order or adaptive orders obtained from beam-search. Experiments on four real-world tasks, including word order recovery, machine translation, image caption, and code generation, demonstrate that our algorithm can generate sequences following arbitrary orders, while achieving competitive or even better performance compared with the conventional left-to-right generation. The generated sequences show that InDIGO adopts adaptive generation orders based on input information. https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00292
collection DOAJ
language English
format Article
sources DOAJ
author Gu, Jiatao
Liu, Qi
Cho, Kyunghyun
spellingShingle Gu, Jiatao
Liu, Qi
Cho, Kyunghyun
Insertion-based Decoding with Automatically Inferred Generation Order
Transactions of the Association for Computational Linguistics
author_facet Gu, Jiatao
Liu, Qi
Cho, Kyunghyun
author_sort Gu, Jiatao
title Insertion-based Decoding with Automatically Inferred Generation Order
title_short Insertion-based Decoding with Automatically Inferred Generation Order
title_full Insertion-based Decoding with Automatically Inferred Generation Order
title_fullStr Insertion-based Decoding with Automatically Inferred Generation Order
title_full_unstemmed Insertion-based Decoding with Automatically Inferred Generation Order
title_sort insertion-based decoding with automatically inferred generation order
publisher The MIT Press
series Transactions of the Association for Computational Linguistics
issn 2307-387X
publishDate 2019-11-01
description Conventional neural autoregressive decoding commonly assumes a fixed left-to-right generation order, which may be sub-optimal. In this work, we propose a novel decoding algorithm— InDIGO—which supports flexible sequence generation in arbitrary orders through insertion operations. We extend Transformer, a state-of-the-art sequence generation model, to efficiently implement the proposed approach, enabling it to be trained with either a pre-defined generation order or adaptive orders obtained from beam-search. Experiments on four real-world tasks, including word order recovery, machine translation, image caption, and code generation, demonstrate that our algorithm can generate sequences following arbitrary orders, while achieving competitive or even better performance compared with the conventional left-to-right generation. The generated sequences show that InDIGO adopts adaptive generation orders based on input information.
url https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00292
work_keys_str_mv AT gujiatao insertionbaseddecodingwithautomaticallyinferredgenerationorder
AT liuqi insertionbaseddecodingwithautomaticallyinferredgenerationorder
AT chokyunghyun insertionbaseddecodingwithautomaticallyinferredgenerationorder
_version_ 1724630394781827072