Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network
In recent years, more and more people are applying Convolutional Neural Networks to the study of sound signals. The main reason is the translational invariance of convolution in time and space. Thereby the diversity of the sound signal can be overcome. However, in terms of sound direction recognitio...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
De Gruyter
2020-07-01
|
Series: | Journal of Intelligent Systems |
Subjects: | |
Online Access: | https://doi.org/10.1515/jisys-2019-0250 |
id |
doaj-f31548cf575547efbec9a13fa73d2164 |
---|---|
record_format |
Article |
spelling |
doaj-f31548cf575547efbec9a13fa73d21642021-10-03T07:42:34ZengDe GruyterJournal of Intelligent Systems2191-026X2020-07-0130120922310.1515/jisys-2019-0250jisys-2019-0250Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural NetworkWang Zhuhe0Li Nan1Wu Tao2Zhang Haoxuan3Feng Tao4School of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaSchool of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaSchool of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaSchool of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaSchool of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaIn recent years, more and more people are applying Convolutional Neural Networks to the study of sound signals. The main reason is the translational invariance of convolution in time and space. Thereby the diversity of the sound signal can be overcome. However, in terms of sound direction recognition, there are also problems such as a microphone matrix being too large, and feature selection. This paper proposes a sound direction recognition using a simulated human head with microphones at both ears. Theoretically, the two microphones cannot distinguish the front and rear directions. However, we use the original data of the two channels as the input of the convolutional neural network, and the resolution effect can reach more than 0.9. For comparison, we also chose the delay feature (GCC) for sound direction recognition. Finally, we also conducted experiments that used probability distributions to identify more directions.https://doi.org/10.1515/jisys-2019-0250convolutional neural networksimulated human headdual-channel raw datagccprobability distributions |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wang Zhuhe Li Nan Wu Tao Zhang Haoxuan Feng Tao |
spellingShingle |
Wang Zhuhe Li Nan Wu Tao Zhang Haoxuan Feng Tao Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network Journal of Intelligent Systems convolutional neural network simulated human head dual-channel raw data gcc probability distributions |
author_facet |
Wang Zhuhe Li Nan Wu Tao Zhang Haoxuan Feng Tao |
author_sort |
Wang Zhuhe |
title |
Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network |
title_short |
Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network |
title_full |
Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network |
title_fullStr |
Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network |
title_full_unstemmed |
Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network |
title_sort |
simulation of human ear recognition sound direction based on convolutional neural network |
publisher |
De Gruyter |
series |
Journal of Intelligent Systems |
issn |
2191-026X |
publishDate |
2020-07-01 |
description |
In recent years, more and more people are applying Convolutional Neural Networks to the study of sound signals. The main reason is the translational invariance of convolution in time and space. Thereby the diversity of the sound signal can be overcome. However, in terms of sound direction recognition, there are also problems such as a microphone matrix being too large, and feature selection. This paper proposes a sound direction recognition using a simulated human head with microphones at both ears. Theoretically, the two microphones cannot distinguish the front and rear directions. However, we use the original data of the two channels as the input of the convolutional neural network, and the resolution effect can reach more than 0.9. For comparison, we also chose the delay feature (GCC) for sound direction recognition. Finally, we also conducted experiments that used probability distributions to identify more directions. |
topic |
convolutional neural network simulated human head dual-channel raw data gcc probability distributions |
url |
https://doi.org/10.1515/jisys-2019-0250 |
work_keys_str_mv |
AT wangzhuhe simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork AT linan simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork AT wutao simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork AT zhanghaoxuan simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork AT fengtao simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork |
_version_ |
1716846078723096576 |