An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation
The task of dialogue generation has attracted increasing attention due to its diverse downstream applications, such as question-answering systems and chatbots. Recently, the deep neural network (DNN)-based dialogue generation models have achieved superior performance against conventional models util...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-10-01
|
Series: | Symmetry |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-8994/12/11/1756 |
id |
doaj-cb45203d520c4d9986599c86bfa28fd3 |
---|---|
record_format |
Article |
spelling |
doaj-cb45203d520c4d9986599c86bfa28fd32020-11-25T03:52:06ZengMDPI AGSymmetry2073-89942020-10-01121756175610.3390/sym12111756An Empirical Study on Deep Neural Network Models for Chinese Dialogue GenerationZhe Li0Mieradilijiang Maimaiti1Jiabao Sheng2Zunwang Ke3Wushour Silamu4Qinyong Wang5Xiuhong Li6Xinjiang Laboratory of Multi-Language Information Technology, Xinjiang Multilingual Information Technology Research Center, College of Software, Xinjiang University, Urumqi 830046, ChinaDepartment of Computer Science and Technology, Tsinghua University, Beijing 100084, ChinaCollege of Software, Xinjiang University, Urumqi 830046, ChinaCollege of Software, Xinjiang University, Urumqi 830046, ChinaCollege of Information Science and Engineering, Xinjiang University, Urumqi 830046, ChinaSchool of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, QLD 4702, AustraliaCollege of Information Science and Engineering, Xinjiang University, Urumqi 830046, ChinaThe task of dialogue generation has attracted increasing attention due to its diverse downstream applications, such as question-answering systems and chatbots. Recently, the deep neural network (DNN)-based dialogue generation models have achieved superior performance against conventional models utilizing statistical machine learning methods. However, despite that an enormous number of state-of-the-art DNN-based models have been proposed, there lacks detailed empirical comparative analysis for them on the open Chinese corpus. As a result, relevant researchers and engineers might find it hard to get an intuitive understanding of the current research progress. To address this challenge, we conducted an empirical study for state-of-the-art DNN-based dialogue generation models in various Chinese corpora. Specifically, extensive experiments were performed on several well-known single-turn and multi-turn dialogue corpora, including KdConv, Weibo, and Douban, to evaluate a wide range of dialogue generation models that are based on the symmetrical architecture of Seq2Seq, RNNSearch, transformer, generative adversarial nets, and reinforcement learning respectively. Moreover, we paid special attention to the prevalent pre-trained model for the quality of dialogue generation. Their performances were evaluated by four widely-used metrics in this area: BLEU, pseudo, distinct, and rouge. Finally, we report a case study to show example responses generated by these models separately.https://www.mdpi.com/2073-8994/12/11/1756natural language processingdialogue generationdeep learningnetwork architectureempirical investigation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhe Li Mieradilijiang Maimaiti Jiabao Sheng Zunwang Ke Wushour Silamu Qinyong Wang Xiuhong Li |
spellingShingle |
Zhe Li Mieradilijiang Maimaiti Jiabao Sheng Zunwang Ke Wushour Silamu Qinyong Wang Xiuhong Li An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation Symmetry natural language processing dialogue generation deep learning network architecture empirical investigation |
author_facet |
Zhe Li Mieradilijiang Maimaiti Jiabao Sheng Zunwang Ke Wushour Silamu Qinyong Wang Xiuhong Li |
author_sort |
Zhe Li |
title |
An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation |
title_short |
An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation |
title_full |
An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation |
title_fullStr |
An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation |
title_full_unstemmed |
An Empirical Study on Deep Neural Network Models for Chinese Dialogue Generation |
title_sort |
empirical study on deep neural network models for chinese dialogue generation |
publisher |
MDPI AG |
series |
Symmetry |
issn |
2073-8994 |
publishDate |
2020-10-01 |
description |
The task of dialogue generation has attracted increasing attention due to its diverse downstream applications, such as question-answering systems and chatbots. Recently, the deep neural network (DNN)-based dialogue generation models have achieved superior performance against conventional models utilizing statistical machine learning methods. However, despite that an enormous number of state-of-the-art DNN-based models have been proposed, there lacks detailed empirical comparative analysis for them on the open Chinese corpus. As a result, relevant researchers and engineers might find it hard to get an intuitive understanding of the current research progress. To address this challenge, we conducted an empirical study for state-of-the-art DNN-based dialogue generation models in various Chinese corpora. Specifically, extensive experiments were performed on several well-known single-turn and multi-turn dialogue corpora, including KdConv, Weibo, and Douban, to evaluate a wide range of dialogue generation models that are based on the symmetrical architecture of Seq2Seq, RNNSearch, transformer, generative adversarial nets, and reinforcement learning respectively. Moreover, we paid special attention to the prevalent pre-trained model for the quality of dialogue generation. Their performances were evaluated by four widely-used metrics in this area: BLEU, pseudo, distinct, and rouge. Finally, we report a case study to show example responses generated by these models separately. |
topic |
natural language processing dialogue generation deep learning network architecture empirical investigation |
url |
https://www.mdpi.com/2073-8994/12/11/1756 |
work_keys_str_mv |
AT zheli anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT mieradilijiangmaimaiti anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT jiabaosheng anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT zunwangke anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT wushoursilamu anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT qinyongwang anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT xiuhongli anempiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT zheli empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT mieradilijiangmaimaiti empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT jiabaosheng empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT zunwangke empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT wushoursilamu empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT qinyongwang empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration AT xiuhongli empiricalstudyondeepneuralnetworkmodelsforchinesedialoguegeneration |
_version_ |
1724484329901391872 |