A new joint CTC-attention-based speech recognition model with multi-level multi-head attention

Abstract A method called joint connectionist temporal classification (CTC)-attention-based speech recognition has recently received increasing focus and has achieved impressive performance. A hybrid end-to-end architecture that adds an extra CTC loss to the attention-based model could force extra re...

Full description

Bibliographic Details
Main Authors: Chu-Xiong Qin, Wen-Lin Zhang, Dan Qu
Format: Article
Language:English
Published: SpringerOpen 2019-10-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13636-019-0161-0