Improving Low-Resource Neural Machine Translation With Teacher-Free Knowledge Distillation

Improving Low-Resource Neural Machine Translation With Teacher-Free Knowledge Distillation

Knowledge Distillation (KD) aims to distill the knowledge of a cumbersome teacher model into a lightweight student model. Its success is generally attributed to the privileged information on similarities among categories provided by the teacher model, and in this sense, only strong teacher models ar...

Full description

Bibliographic Details
Main Authors:	Xinlu Zhang, Xiao Li, Yating Yang, Rui Dong
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Neural machine translation knowledge distillation prior knowledge
Online Access:	https://ieeexplore.ieee.org/document/9257421/

Similar Items

Grammatically Derived Factual Relation Augmented Neural Machine Translation
by: Li, F., et al.
Published: (2022)

Head Network Distillation: Splitting Distilled Deep Neural Networks for Resource-Constrained Edge Computing Systems
by: Yoshitomo Matsubara, et al.
Published: (2020-01-01)

Variational Bayesian Group-Level Sparsification for Knowledge Distillation
by: Yue Ming, et al.
Published: (2020-01-01)

Review of Knowledge Distillation in Convolutional Neural Network Compression
by: MENG Xianfa, LIU Fang, LI Guang, HUANG Mengmeng
Published: (2021-10-01)

Layer-Level Knowledge Distillation for Deep Neural Network Learning
by: Hao-Ting Li, et al.
Published: (2019-05-01)

Teaching Yourself: A Self-Knowledge Distillation Approach to Action Recognition
by: Duc-Quang Vu, et al.
Published: (2021-01-01)

Multi-Source Neural Model for Machine Translation of Agglutinative Language
by: Yirong Pan, et al.
Published: (2020-06-01)

Explaining Neural Networks Using Attentive Knowledge Distillation
by: Hyeonseok Lee, et al.
Published: (2021-02-01)

Memory-Replay Knowledge Distillation
by: Jiyue Wang, et al.
Published: (2021-04-01)

Robust Semantic Segmentation With Multi-Teacher Knowledge Distillation
by: Abdollah Amirkhani, et al.
Published: (2021-01-01)

Viewpoint robust knowledge distillation for accelerating vehicle re-identification
by: Yi Xie, et al.
Published: (2021-07-01)

KnowRU: Knowledge Reuse via Knowledge Distillation in Multi-Agent Reinforcement Learning
by: Zijian Gao, et al.
Published: (2021-08-01)

Zero-Shot Knowledge Distillation Using Label-Free Adversarial Perturbation With Taylor Approximation
by: Kang Il Lee, et al.
Published: (2021-01-01)

A Fast Scene Text Detector Using Knowledge Distillation
by: Peng Yang, et al.
Published: (2019-01-01)

Pre-Training on Mixed Data for Low-Resource Neural Machine Translation
by: Wenbo Zhang, et al.
Published: (2021-03-01)

Speech Enhancement Using Generative Adversarial Network by Distilling Knowledge from Statistical Method
by: Jianfeng Wu, et al.
Published: (2019-08-01)

TNT: An Interpretable Tree-Network-Tree Learning Framework Using Knowledge Distillation
by: Jiawei Li, et al.
Published: (2020-10-01)

Teaching Where to See: Knowledge Distillation-Based Attentive Information Transfer in Vehicle Maker Classification
by: Yunsoo Lee, et al.
Published: (2019-01-01)

An Adversarial Feature Distillation Method for Audio Classification
by: Liang Gao, et al.
Published: (2019-01-01)

GAN-Knowledge Distillation for One-Stage Object Detection
by: Wanwei Wang, et al.
Published: (2020-01-01)

Machine Learning-Based Fast Banknote Serial Number Recognition Using Knowledge Distillation and Bayesian Optimization
by: Eunjeong Choi, et al.
Published: (2019-09-01)

Knowledge distillation in deep learning and its applications
by: Abdolmaged Alkhulaifi, et al.
Published: (2021-04-01)

A Diverse Data Augmentation Strategy for Low-Resource Neural Machine Translation
by: Yu Li, et al.
Published: (2020-05-01)

Unsupervised Anomaly Detection Using Style Distillation
by: Hwehee Chung, et al.
Published: (2020-01-01)

Sentiment Interaction Distillation Network for Image Sentiment Analysis
by: Deng, S., et al.
Published: (2022)

Training Small Networks for Scene Classification of Remote Sensing Images via Knowledge Distillation
by: Guanzhou Chen, et al.
Published: (2018-05-01)

Keeping Models Consistent between Pretraining and Translation for Low-Resource Neural Machine Translation
by: Wenbo Zhang, et al.
Published: (2020-11-01)

Predictive Distillation Method of Anchor‐Free Object Detection Model for Continual Learning
by: Chung, D., et al.
Published: (2022)

Heterogeneous Defect Prediction Based on Federated Transfer Learning via Knowledge Distillation
by: Aili Wang, et al.
Published: (2021-01-01)

A Gradually Distilled CNN for SAR Target Recognition
by: Rui Min, et al.
Published: (2019-01-01)

Compact Cloud Detection with Bidirectional Self-Attention Knowledge Distillation
by: Yajie Chai, et al.
Published: (2020-08-01)

Automatic Diabetic Retinopathy Grading via Self-Knowledge Distillation
by: Ling Luo, et al.
Published: (2020-08-01)

Advance Research on Neural Machine Translation Integrating Linguistic Knowledge
by: GUO Wanghao, FAN Jiangwei, ZHANG Keliang
Published: (2021-07-01)

Highlight Every Step: Knowledge Distillation via Collaborative Teaching
by: Chen, C., et al.
Published: (2022)

Revisiting Label Smoothing Regularization with Knowledge Distillation
by: Jiyue Wang, et al.
Published: (2021-05-01)

IMPROVING LISTENING SKILL BY ACTIVATING STUDENTS’ PRIOR KNOWLEDGE
by: Sitti Nurpahmi
Published: (2015-06-01)

Knowledge Distillation in Acoustic Scene Classification
by: Jee-Weon Jung, et al.
Published: (2020-01-01)

Evaluation as a Mechanism for Integrated Knowledge Translation
by: Donnelly, Catherine
Published: (2013)

Improving the recall of biomedical named entity recognition with label re-correction and knowledge distillation
by: Huiwei Zhou, et al.
Published: (2021-06-01)

Multistructure-Based Collaborative Online Distillation
by: Liang Gao, et al.
Published: (2019-04-01)