An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences
Sign language translation (SLT) is an important application to bridge the communication gap between deaf and hearing people. In recent years, the research on the SLT based on neural translation frameworks has attracted wide attention. Despite the progress, current SLT research is still in the initia...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2020-01-01
|
Series: | Computational Intelligence and Neuroscience |
Online Access: | http://dx.doi.org/10.1155/2020/8816125 |
id |
doaj-79c1524018b64eafbc371a039e6054b6 |
---|---|
record_format |
Article |
spelling |
doaj-79c1524018b64eafbc371a039e6054b62020-11-25T04:04:43ZengHindawi LimitedComputational Intelligence and Neuroscience1687-52651687-52732020-01-01202010.1155/2020/88161258816125An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign SentencesJiangbin Zheng0Zheng Zhao1Min Chen2Jing Chen3Chong Wu4Yidong Chen5Xiaodong Shi6Yiqi Tong7Department of Artificial Intelligence, School of Informatics, Xiamen University, Xiamen 361005, ChinaChina Mobile (Suzhou) Software Technology Co., LTD, Suzhou 215000, ChinaChina Mobile (Suzhou) Software Technology Co., LTD, Suzhou 215000, ChinaChina Mobile (Suzhou) Software Technology Co., LTD, Suzhou 215000, ChinaDepartment of Electrical Engineering, City University of Hong Kong, Kowloon, Hong KongDepartment of Artificial Intelligence, School of Informatics, Xiamen University, Xiamen 361005, ChinaDepartment of Artificial Intelligence, School of Informatics, Xiamen University, Xiamen 361005, ChinaDepartment of Artificial Intelligence, School of Informatics, Xiamen University, Xiamen 361005, ChinaSign language translation (SLT) is an important application to bridge the communication gap between deaf and hearing people. In recent years, the research on the SLT based on neural translation frameworks has attracted wide attention. Despite the progress, current SLT research is still in the initial stage. In fact, current systems perform poorly in processing long sign sentences, which often involve long-distance dependencies and require large resource consumption. To tackle this problem, we propose two explainable adaptations to the traditional neural SLT models using optimized tokenization-related modules. First, we introduce a frame stream density compression (FSDC) algorithm for detecting and reducing the redundant similar frames, which effectively shortens the long sign sentences without losing information. Then, we replace the traditional encoder in a neural machine translation (NMT) module with an improved architecture, which incorporates a temporal convolution (T-Conv) unit and a dynamic hierarchical bidirectional GRU (DH-BiGRU) unit sequentially. The improved component takes the temporal tokenization information into consideration to extract deeper information with reasonable resource consumption. Our experiments on the RWTH-PHOENIX-Weather 2014T dataset show that the proposed model outperforms the state-of-the-art baseline up to about 1.5+ BLEU-4 score gains.http://dx.doi.org/10.1155/2020/8816125 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jiangbin Zheng Zheng Zhao Min Chen Jing Chen Chong Wu Yidong Chen Xiaodong Shi Yiqi Tong |
spellingShingle |
Jiangbin Zheng Zheng Zhao Min Chen Jing Chen Chong Wu Yidong Chen Xiaodong Shi Yiqi Tong An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences Computational Intelligence and Neuroscience |
author_facet |
Jiangbin Zheng Zheng Zhao Min Chen Jing Chen Chong Wu Yidong Chen Xiaodong Shi Yiqi Tong |
author_sort |
Jiangbin Zheng |
title |
An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences |
title_short |
An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences |
title_full |
An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences |
title_fullStr |
An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences |
title_full_unstemmed |
An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences |
title_sort |
improved sign language translation model with explainable adaptations for processing long sign sentences |
publisher |
Hindawi Limited |
series |
Computational Intelligence and Neuroscience |
issn |
1687-5265 1687-5273 |
publishDate |
2020-01-01 |
description |
Sign language translation (SLT) is an important application to bridge the communication gap between deaf and hearing people. In recent years, the research on the SLT based on neural translation frameworks has attracted wide attention. Despite the progress, current SLT research is still in the initial stage. In fact, current systems perform poorly in processing long sign sentences, which often involve long-distance dependencies and require large resource consumption. To tackle this problem, we propose two explainable adaptations to the traditional neural SLT models using optimized tokenization-related modules. First, we introduce a frame stream density compression (FSDC) algorithm for detecting and reducing the redundant similar frames, which effectively shortens the long sign sentences without losing information. Then, we replace the traditional encoder in a neural machine translation (NMT) module with an improved architecture, which incorporates a temporal convolution (T-Conv) unit and a dynamic hierarchical bidirectional GRU (DH-BiGRU) unit sequentially. The improved component takes the temporal tokenization information into consideration to extract deeper information with reasonable resource consumption. Our experiments on the RWTH-PHOENIX-Weather 2014T dataset show that the proposed model outperforms the state-of-the-art baseline up to about 1.5+ BLEU-4 score gains. |
url |
http://dx.doi.org/10.1155/2020/8816125 |
work_keys_str_mv |
AT jiangbinzheng animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT zhengzhao animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT minchen animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT jingchen animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT chongwu animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT yidongchen animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT xiaodongshi animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT yiqitong animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT jiangbinzheng improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT zhengzhao improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT minchen improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT jingchen improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT chongwu improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT yidongchen improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT xiaodongshi improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences AT yiqitong improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences |
_version_ |
1715053413785403392 |