Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network

In recent years, more and more people are applying Convolutional Neural Networks to the study of sound signals. The main reason is the translational invariance of convolution in time and space. Thereby the diversity of the sound signal can be overcome. However, in terms of sound direction recognitio...

Full description

Bibliographic Details
Main Authors: Wang Zhuhe, Li Nan, Wu Tao, Zhang Haoxuan, Feng Tao
Format: Article
Language:English
Published: De Gruyter 2020-07-01
Series:Journal of Intelligent Systems
Subjects:
gcc
Online Access:https://doi.org/10.1515/jisys-2019-0250
id doaj-f31548cf575547efbec9a13fa73d2164
record_format Article
spelling doaj-f31548cf575547efbec9a13fa73d21642021-10-03T07:42:34ZengDe GruyterJournal of Intelligent Systems2191-026X2020-07-0130120922310.1515/jisys-2019-0250jisys-2019-0250Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural NetworkWang Zhuhe0Li Nan1Wu Tao2Zhang Haoxuan3Feng Tao4School of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaSchool of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaSchool of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaSchool of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaSchool of Artificial Intelligence, Beijing Technology and Business University, Beijing, ChinaIn recent years, more and more people are applying Convolutional Neural Networks to the study of sound signals. The main reason is the translational invariance of convolution in time and space. Thereby the diversity of the sound signal can be overcome. However, in terms of sound direction recognition, there are also problems such as a microphone matrix being too large, and feature selection. This paper proposes a sound direction recognition using a simulated human head with microphones at both ears. Theoretically, the two microphones cannot distinguish the front and rear directions. However, we use the original data of the two channels as the input of the convolutional neural network, and the resolution effect can reach more than 0.9. For comparison, we also chose the delay feature (GCC) for sound direction recognition. Finally, we also conducted experiments that used probability distributions to identify more directions.https://doi.org/10.1515/jisys-2019-0250convolutional neural networksimulated human headdual-channel raw datagccprobability distributions
collection DOAJ
language English
format Article
sources DOAJ
author Wang Zhuhe
Li Nan
Wu Tao
Zhang Haoxuan
Feng Tao
spellingShingle Wang Zhuhe
Li Nan
Wu Tao
Zhang Haoxuan
Feng Tao
Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network
Journal of Intelligent Systems
convolutional neural network
simulated human head
dual-channel raw data
gcc
probability distributions
author_facet Wang Zhuhe
Li Nan
Wu Tao
Zhang Haoxuan
Feng Tao
author_sort Wang Zhuhe
title Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network
title_short Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network
title_full Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network
title_fullStr Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network
title_full_unstemmed Simulation of Human Ear Recognition Sound Direction Based on Convolutional Neural Network
title_sort simulation of human ear recognition sound direction based on convolutional neural network
publisher De Gruyter
series Journal of Intelligent Systems
issn 2191-026X
publishDate 2020-07-01
description In recent years, more and more people are applying Convolutional Neural Networks to the study of sound signals. The main reason is the translational invariance of convolution in time and space. Thereby the diversity of the sound signal can be overcome. However, in terms of sound direction recognition, there are also problems such as a microphone matrix being too large, and feature selection. This paper proposes a sound direction recognition using a simulated human head with microphones at both ears. Theoretically, the two microphones cannot distinguish the front and rear directions. However, we use the original data of the two channels as the input of the convolutional neural network, and the resolution effect can reach more than 0.9. For comparison, we also chose the delay feature (GCC) for sound direction recognition. Finally, we also conducted experiments that used probability distributions to identify more directions.
topic convolutional neural network
simulated human head
dual-channel raw data
gcc
probability distributions
url https://doi.org/10.1515/jisys-2019-0250
work_keys_str_mv AT wangzhuhe simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork
AT linan simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork
AT wutao simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork
AT zhanghaoxuan simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork
AT fengtao simulationofhumanearrecognitionsounddirectionbasedonconvolutionalneuralnetwork
_version_ 1716846078723096576