End-to-End Mandarin Recognition based on Convolution Input

The cross-entropy criterion of mainstream neural network training is to classify and optimize each frame of acoustic data, while the continuous speech recognition uses the sequence-level transcription accuracy as a performance measure. In view of this difference, an end-to-end speech recognition sys...

Full description

Bibliographic Details
Main Authors: Wang Yanzhe, Zhang LiMin, Zhang Bingqiang, Li Zhenyu
Format: Article
Language:English
Published: EDP Sciences 2018-01-01
Series:MATEC Web of Conferences
Online Access:https://doi.org/10.1051/matecconf/201821401004