An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation

The task of dialogue generation has attracted increasing attention due to its diverse downstream applications, such as question-answering systems and chatbots. Recently, the deep neural network (DNN)-based dialogue generation models have achieved superior performance against conventional models util...

Full description

Bibliographic Details
Main Authors: Zhe Li, Mieradilijiang Maimaiti, Jiabao Sheng, Zunwang Ke, Wushour Silamu, Qinyong Wang, Xiuhong Li
Format: Article
Language:English
Published: MDPI AG 2020-10-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/12/11/1756
id doaj-cb45203d520c4d9986599c86bfa28fd3
record_format Article
spelling doaj-cb45203d520c4d9986599c86bfa28fd32020-11-25T03:52:06ZengMDPI AGSymmetry2073-89942020-10-01121756175610.3390/sym12111756An Empirical Study on Deep Neural Network Models for Chinese Dialogue GenerationZhe Li0Mieradilijiang Maimaiti1Jiabao Sheng2Zunwang Ke3Wushour Silamu4Qinyong Wang5Xiuhong Li6Xinjiang Laboratory of Multi-Language Information Technology, Xinjiang Multilingual Information Technology Research Center, College of Software, Xinjiang University, Urumqi 830046, ChinaDepartment of Computer Science and Technology, Tsinghua University, Beijing 100084, ChinaCollege of Software, Xinjiang University, Urumqi 830046, ChinaCollege of Software, Xinjiang University, Urumqi 830046, ChinaCollege of Information Science and Engineering, Xinjiang University, Urumqi 830046, ChinaSchool of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, QLD 4702, AustraliaCollege of Information Science and Engineering, Xinjiang University, Urumqi 830046, ChinaThe task of dialogue generation has attracted increasing attention due to its diverse downstream applications, such as question-answering systems and chatbots. Recently, the deep neural network (DNN)-based dialogue generation models have achieved superior performance against conventional models utilizing statistical machine learning methods. However, despite that an enormous number of state-of-the-art DNN-based models have been proposed, there lacks detailed empirical comparative analysis for them on the open Chinese corpus. As a result, relevant researchers and engineers might find it hard to get an intuitive understanding of the current research progress. To address this challenge, we conducted an empirical study for state-of-the-art DNN-based dialogue generation models in various Chinese corpora. Specifically, extensive experiments were performed on several well-known single-turn and multi-turn dialogue corpora, including KdConv, Weibo, and Douban, to evaluate a wide range of dialogue generation models that are based on the symmetrical architecture of Seq2Seq, RNNSearch, transformer, generative adversarial nets, and reinforcement learning respectively. Moreover, we paid special attention to the prevalent pre-trained model for the quality of dialogue generation. Their performances were evaluated by four widely-used metrics in this area: BLEU, pseudo, distinct, and rouge. Finally, we report a case study to show example responses generated by these models separately.https://www.mdpi.com/2073-8994/12/11/1756natural language processingdialogue generationdeep learningnetwork architectureempirical investigation
collection DOAJ
language English
format Article
sources DOAJ
author Zhe Li
Mieradilijiang Maimaiti
Jiabao Sheng
Zunwang Ke
Wushour Silamu
Qinyong Wang
Xiuhong Li
spellingShingle Zhe Li
Mieradilijiang Maimaiti
Jiabao Sheng
Zunwang Ke
Wushour Silamu
Qinyong Wang
Xiuhong Li
An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation
Symmetry
natural language processing
dialogue generation
deep learning
network architecture
empirical investigation
author_facet Zhe Li
Mieradilijiang Maimaiti
Jiabao Sheng
Zunwang Ke
Wushour Silamu
Qinyong Wang
Xiuhong Li
author_sort Zhe Li
title An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation
title_short An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation
title_full An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation
title_fullStr An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation
title_full_unstemmed An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation
title_sort empirical study on deep neural network models for chinese dialogue generation
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2020-10-01
description The task of dialogue generation has attracted increasing attention due to its diverse downstream applications, such as question-answering systems and chatbots. Recently, the deep neural network (DNN)-based dialogue generation models have achieved superior performance against conventional models utilizing statistical machine learning methods. However, despite that an enormous number of state-of-the-art DNN-based models have been proposed, there lacks detailed empirical comparative analysis for them on the open Chinese corpus. As a result, relevant researchers and engineers might find it hard to get an intuitive understanding of the current research progress. To address this challenge, we conducted an empirical study for state-of-the-art DNN-based dialogue generation models in various Chinese corpora. Specifically, extensive experiments were performed on several well-known single-turn and multi-turn dialogue corpora, including KdConv, Weibo, and Douban, to evaluate a wide range of dialogue generation models that are based on the symmetrical architecture of Seq2Seq, RNNSearch, transformer, generative adversarial nets, and reinforcement learning respectively. Moreover, we paid special attention to the prevalent pre-trained model for the quality of dialogue generation. Their performances were evaluated by four widely-used metrics in this area: BLEU, pseudo, distinct, and rouge. Finally, we report a case study to show example responses generated by these models separately.
topic natural language processing
dialogue generation
deep learning
network architecture
empirical investigation
url https://www.mdpi.com/2073-8994/12/11/1756
work_keys_str_mv AT zheli anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT mieradilijiangmaimaiti anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT jiabaosheng anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT zunwangke anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT wushoursilamu anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT qinyongwang anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT xiuhongli anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT zheli empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT mieradilijiangmaimaiti empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT jiabaosheng empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT zunwangke empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT wushoursilamu empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT qinyongwang empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
AT xiuhongli empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration
_version_ 1724484329901391872