Voice Keyword Retrieval Method Using Attention Mechanism and Multimodal Information Fusion
A cross-modal speech-text retrieval method using interactive learning convolution automatic encoder (CAE) is proposed. First, an interactive learning autoencoder structure is proposed, including two inputs of speech and text, as well as processing links such as encoding, hidden layer interaction, an...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2021-01-01
|
Series: | Scientific Programming |
Online Access: | http://dx.doi.org/10.1155/2021/6662841 |